Citrus Sinensis ID: 027891


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------
MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGVNVTASASMPQSEPVKRKRGRPRKYGPDGSVSLALSPSVSTHPGTISPTQKRGRGRPPGTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYECPFLEVYG
cccccccccccccccEEcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEccccccHHHHHHHHHHccccEEEEEcccccEEEEEEcccccccccEEEEEEEEEEcc
ccccccEcccccccEEEEccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHcccccccccccccccccccccccccccccccccHccccccccccccccccHHHccccccccccccccEEEEEcccHHHHHHHHHHHHccccEEEEEEEccEEEEEEEEcccccccEEEEEcEEEEEEc
mdrrdglalpgsasfymqrgmtgsgsgtqpslhgspgihplsnpslqfqsniggstigstlsvdpssaisphgvnvtasasmpqsepvkrkrgrprkygpdgsvslalspsvsthpgtisptqkrgrgrppgtgrkqQVSSlgeslsgsagmgftphVITVAVGEDIAMKLLSFSQQGPRAICVLSangaistatlrqpsssggsvtyecpflevyg
mdrrdglalpgSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGvnvtasasmpqsepvkrkrgrprkygpdgsvslalspsvsthpgtisptqkrgrgrppgTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAIStatlrqpsssggsvtyeCPFLEVYG
MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGVNVTASASMPQSEpvkrkrgrprkygpdgSVSLALSPSVSTHPGTISptqkrgrgrppgtgrkqqVsslgeslsgsAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYECPFLEVYG
*******************************************************************************************************************************************************MGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAIST*************TYECPFL****
****************************************************************************************************************************************************SAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYECPFLEVYG
********LPGSASFYMQRGM*************SPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGVNVT**************************VSLALSP*************************************GSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTAT**********VTYECPFLEVYG
*********P***SFY***GMTGSGSGTQPSLHGSPGIHPLSNP**********************************************************S*SL***********************************LGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYECPFLEVYG
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGSTLSVDPSSAISPHGVNVTASASMPQSEPVKRKRGRPRKYGPDGSVSLALSPSVSTHPGTISPTQKRGRGRPPGTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYECPFLEVYG
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query217 2.2.26 [Sep-21-2011]
Q9S7C9 311 Putative DNA-binding prot no no 0.336 0.234 0.368 8e-05
>sp|Q9S7C9|ESCA_ARATH Putative DNA-binding protein ESCAROLA OS=Arabidopsis thaliana GN=ESC PE=2 SV=1 Back     alignment and function desciption
 Score = 47.4 bits (111), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 42/76 (55%), Gaps = 3/76 (3%)

Query: 124 KRGRGRPPGTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGPRAIC 183
           KR RGRPPG+  K +   +    S +A      HV+ V+ G DI   + +++++  R + 
Sbjct: 86  KRPRGRPPGSKNKAKPPIIVTRDSPNA---LRSHVLEVSPGADIVESVSTYARRRGRGVS 142

Query: 184 VLSANGAISTATLRQP 199
           VL  NG +S  TLRQP
Sbjct: 143 VLGGNGTVSNVTLRQP 158





Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query217
255541558 340 DNA binding protein, putative [Ricinus c 0.976 0.623 0.774 2e-88
133907524 340 AT-hook DNA-binding protein [Gossypium h 0.976 0.623 0.732 2e-84
225454180 345 PREDICTED: uncharacterized protein LOC10 0.976 0.614 0.764 4e-81
224130232 336 predicted protein [Populus trichocarpa] 0.953 0.616 0.713 3e-75
358249184 341 uncharacterized protein LOC100814615 [Gl 0.953 0.607 0.688 1e-72
224067876 328 predicted protein [Populus trichocarpa] 0.926 0.612 0.665 2e-70
449518609 334 PREDICTED: uncharacterized LOC101203138 0.944 0.613 0.640 1e-69
449441474 334 PREDICTED: uncharacterized protein LOC10 0.944 0.613 0.640 1e-69
356568374 342 PREDICTED: uncharacterized protein LOC10 0.953 0.605 0.665 2e-69
356520420 340 PREDICTED: uncharacterized protein LOC10 0.967 0.617 0.650 6e-68
>gi|255541558|ref|XP_002511843.1| DNA binding protein, putative [Ricinus communis] gi|223549023|gb|EEF50512.1| DNA binding protein, putative [Ricinus communis] Back     alignment and taxonomy information
 Score =  330 bits (847), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 165/213 (77%), Positives = 191/213 (89%), Gaps = 1/213 (0%)

Query: 1   MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSPGIHPLSNPSLQFQSNIGGSTIGST 60
           MDRRD +A+ GSASFYMQRGMTGSGSGTQ  L+ S GI+PL++ ++ FQSN+G +TIGST
Sbjct: 1   MDRRDAMAMSGSASFYMQRGMTGSGSGTQSGLNVSSGINPLTSTNVSFQSNVGANTIGST 60

Query: 61  LSVDPSSAISPHGVNVTASASMPQ-SEPVKRKRGRPRKYGPDGSVSLALSPSVSTHPGTI 119
           L ++ S+AI PHGVNV AS+ MP   EPVKRKRGRPRKYGPDG+VSLALSPS+STHPGTI
Sbjct: 61  LPLETSTAIPPHGVNVGASSLMPPPGEPVKRKRGRPRKYGPDGTVSLALSPSLSTHPGTI 120

Query: 120 SPTQKRGRGRPPGTGRKQQVSSLGESLSGSAGMGFTPHVITVAVGEDIAMKLLSFSQQGP 179
           +PTQKRGRGRPPGTGRKQQ++SLGE LSGSAGMGFTPH+IT+AVGEDIA K++SFSQQGP
Sbjct: 121 TPTQKRGRGRPPGTGRKQQLASLGEWLSGSAGMGFTPHIITIAVGEDIATKIMSFSQQGP 180

Query: 180 RAICVLSANGAISTATLRQPSSSGGSVTYECPF 212
           RAIC+LSANGA+ST TLRQPS+SGGSVTYE  F
Sbjct: 181 RAICILSANGAVSTVTLRQPSTSGGSVTYEGRF 213




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|133907524|gb|ABO42262.1| AT-hook DNA-binding protein [Gossypium hirsutum] Back     alignment and taxonomy information
>gi|225454180|ref|XP_002272142.1| PREDICTED: uncharacterized protein LOC100265498 [Vitis vinifera] gi|297745264|emb|CBI40344.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224130232|ref|XP_002320785.1| predicted protein [Populus trichocarpa] gi|222861558|gb|EEE99100.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|358249184|ref|NP_001239751.1| uncharacterized protein LOC100814615 [Glycine max] gi|255636132|gb|ACU18409.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|224067876|ref|XP_002302577.1| predicted protein [Populus trichocarpa] gi|222844303|gb|EEE81850.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|449518609|ref|XP_004166329.1| PREDICTED: uncharacterized LOC101203138 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449441474|ref|XP_004138507.1| PREDICTED: uncharacterized protein LOC101203138 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356568374|ref|XP_003552386.1| PREDICTED: uncharacterized protein LOC100802542 isoform 1 [Glycine max] gi|356568376|ref|XP_003552387.1| PREDICTED: uncharacterized protein LOC100802542 isoform 2 [Glycine max] Back     alignment and taxonomy information
>gi|356520420|ref|XP_003528860.1| PREDICTED: uncharacterized protein LOC100799791 [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query217
TAIR|locus:2050766 348 AT2G45850 [Arabidopsis thalian 0.944 0.589 0.422 2.6e-31
TAIR|locus:2098861 354 AT3G61310 [Arabidopsis thalian 0.944 0.579 0.417 6.9e-31
TAIR|locus:2031306 361 AT1G63480 [Arabidopsis thalian 0.290 0.174 0.571 3.1e-23
TAIR|locus:2031321 378 AT1G63470 [Arabidopsis thalian 0.290 0.166 0.539 1.5e-21
TAIR|locus:2051038 351 AT2G33620 [Arabidopsis thalian 0.285 0.176 0.629 1.1e-20
TAIR|locus:2118091 356 AHL1 "AT-hook motif nuclear-lo 0.285 0.174 0.709 7.8e-20
TAIR|locus:2126946 318 AT4G00200 [Arabidopsis thalian 0.285 0.194 0.709 1.5e-19
TAIR|locus:2132599 334 AT4G22770 [Arabidopsis thalian 0.290 0.188 0.619 7.9e-17
TAIR|locus:2153142 419 AHL4 "AT-HOOK MOTIF NUCLEAR LO 0.271 0.140 0.661 2.1e-16
TAIR|locus:2122684 404 AHL3 "AT-HOOK MOTIF NUCLEAR LO 0.285 0.153 0.629 2.5e-16
TAIR|locus:2050766 AT2G45850 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 344 (126.2 bits), Expect = 2.6e-31, P = 2.6e-31
 Identities = 96/227 (42%), Positives = 121/227 (53%)

Query:     1 MDRRDGLALPGSASFYMQRGMTGSGSGTQPSLHGSP----GIHPLSNPSLQFQSNIGGST 56
             MDRRD + L GS S+Y+ RG++GSG    P+ HGSP    G+  L N +  F S  G + 
Sbjct:     1 MDRRDAMGLSGSGSYYIHRGLSGSGP---PTFHGSPQQQQGLRHLPNQNSPFGS--GSTG 55

Query:    57 IGS-TLSVDPSSAIS-------PH--GVNVTASASMPQSEXXXXXXXXXXXXXXXXSVSL 106
              GS +L  DPS A +       PH  GVN+ A    P                   SVSL
Sbjct:    56 FGSPSLHGDPSLATAAGGAGALPHHIGVNMIAPPPPPSETPMKRKRGRPRKYGQDGSVSL 115

Query:   107 ALSPS-VSTHPGTISXXXXXXXXXXXXXXXXXXVXXXXXXXXXXAGMGFTPHVITVAVGE 165
             ALS S VST   T +                  +          +GM FTPHVI V++GE
Sbjct:   116 ALSSSSVSTI--TPNNSNKRGRGRPPGSGKKQRMASVGELMPSSSGMSFTPHVIAVSIGE 173

Query:   166 DIAMKLLSFSQQGPRAICVLSANGAISTATLRQPSSSGGSVTYECPF 212
             DIA K+++FSQQGPRAICVLSA+GA+STATL QPS+S G++ YE  F
Sbjct:   174 DIASKVIAFSQQGPRAICVLSASGAVSTATLIQPSASPGAIKYEGRF 220




GO:0003677 "DNA binding" evidence=IEA;ISS
GO:0005634 "nucleus" evidence=ISM;IDA
GO:0008150 "biological_process" evidence=ND
GO:0010103 "stomatal complex morphogenesis" evidence=RCA
TAIR|locus:2098861 AT3G61310 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031306 AT1G63480 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031321 AT1G63470 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2051038 AT2G33620 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2118091 AHL1 "AT-hook motif nuclear-localized protein 1" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2126946 AT4G00200 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2132599 AT4G22770 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2153142 AHL4 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 4" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2122684 AHL3 "AT-HOOK MOTIF NUCLEAR LOCALIZED PROTEIN 3" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
GSVIVG00026831001
SubName- Full=Chromosome chr15 scaffold_40, whole genome shotgun sequence; (345 aa)
(Vitis vinifera)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query217
pfam03479120 pfam03479, DUF296, Domain of unknown function (DUF 6e-15
cd11378113 cd11378, DUF296, Domain of unknown function found 2e-11
>gnl|CDD|217587 pfam03479, DUF296, Domain of unknown function (DUF296) Back     alignment and domain information
 Score = 68.0 bits (167), Expect = 6e-15
 Identities = 26/62 (41%), Positives = 33/62 (53%), Gaps = 3/62 (4%)

Query: 154 FTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPS---SSGGSVTYEC 210
             PHV+ +  GED+   L +F++Q      VLS  GA+S  TLRQP     S G VT E 
Sbjct: 1   GRPHVLRLEPGEDLVESLEAFARQRGIGAAVLSGIGAVSNVTLRQPDEEAKSYGVVTLEG 60

Query: 211 PF 212
            F
Sbjct: 61  RF 62


This putative domain is found in proteins that contain AT-hook motifs pfam02178, which strongly suggests a DNA-binding function for the proteins as a whole. There are three highly conserved histidine residues, eg at 117, 119 and 133 in Reut_B5223, which should be a structurally conserved metal-binding unit, based on structural comparison with known metal-binding structures. The proteins should work as trimers. Length = 120

>gnl|CDD|211390 cd11378, DUF296, Domain of unknown function found in archaea, bacteria, and plants Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 217
PF03479120 DUF296: Domain of unknown function (DUF296); Inter 99.36
COG1661141 Predicted DNA-binding protein with PD1-like DNA-bi 98.39
PF0217813 AT_hook: AT hook motif; InterPro: IPR017956 AT hoo 96.29
smart0038426 AT_hook DNA binding domain with preference for A/T 96.28
PF14621219 RFX5_DNA_bdg: RFX5 DNA-binding domain 83.9
PF14869133 DUF4488: Domain of unknown function (DUF4488) 80.06
>PF03479 DUF296: Domain of unknown function (DUF296); InterPro: IPR005175 This putative conserved domain is found in proteins that contain AT-hook motifs IPR000637 from INTERPRO, suggesting a DNA-binding function for the proteins as a whole, however, the function of this domain is unknown Back     alignment and domain information
Probab=99.36  E-value=6.4e-13  Score=104.88  Aligned_cols=63  Identities=27%  Similarity=0.283  Sum_probs=56.7

Q ss_pred             ceeEEEEecCCccHHHHHHHHhhcCCceEEEecccCeeeeEEeeCCC--CCCCceeEeeeeeeec
Q 027891          154 FTPHVITVAVGEDIAMKLLSFSQQGPRAICVLSANGAISTATLRQPS--SSGGSVTYECPFLEVY  216 (217)
Q Consensus       154 f~pHVI~V~~GEDV~~kI~~Faqq~~raiCILSasGsVSnVTLRqp~--ssg~tvtyeGrFEILS  216 (217)
                      ||+|+|++.+||||.++|.+||++.....|++++.|+|++|+|+++.  ....+.+|+|+|||||
T Consensus         1 ~r~~~~rl~~Gedl~~~l~~~~~~~~i~~~~is~iGsl~~~~l~~~~~~~~~~~~~~~g~~Ei~s   65 (120)
T PF03479_consen    1 GRVFVIRLDPGEDLLESLEAFAREHGIRSGVISGIGSLSNVTLGYYDPPSYYEPLEFEGPFEIIS   65 (120)
T ss_dssp             EEEEEEEEETTSBHHHHHHHHHHHHT-SSEEEEEEEEEEEEEEEEEETTTEEEEEEEESEEEEEE
T ss_pred             CcEEEEEECCCCHHHHHHHHHHHHCCCcEEEEEEEeEEeEEEEEEecccCCcceEEecccEEEEE
Confidence            68999999999999999999999999878888999999999999973  3346789999999997



Overexpression of a protein containing this domain, Q9S7C9 from SWISSPROT, in Arabidopsis thaliana causes late flowering and modified leaf development []. ; PDB: 2DT4_A 2P6Y_A 3HWU_A 3HTN_A 2NMU_A 2H6L_A 2HX0_A.

>COG1661 Predicted DNA-binding protein with PD1-like DNA-binding motif [General function prediction only] Back     alignment and domain information
>PF02178 AT_hook: AT hook motif; InterPro: IPR017956 AT hooks are DNA-binding motifs with a preference for A/T rich regions Back     alignment and domain information
>smart00384 AT_hook DNA binding domain with preference for A/T rich regions Back     alignment and domain information
>PF14621 RFX5_DNA_bdg: RFX5 DNA-binding domain Back     alignment and domain information
>PF14869 DUF4488: Domain of unknown function (DUF4488) Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query217
2hx0_A154 Putative DNA-binding protein; NESG, PSI-2, SCR59, 9e-11
2p6y_A142 Hypothetical protein VCA0587; NESG, Q9KM02_vibch, 6e-10
2dt4_A143 Hypothetical protein PH0802; PPC domain, structura 8e-06
1qzv_F154 Plant photosystem I: subunit PSAF; photosynthesis, 4e-05
>2hx0_A Putative DNA-binding protein; NESG, PSI-2, SCR59, structural genomics, protein structure initiative; 1.55A {Salmonella choleraesuis} SCOP: d.290.1.3 PDB: 2nmu_A Length = 154 Back     alignment and structure
 Score = 57.4 bits (138), Expect = 9e-11
 Identities = 12/67 (17%), Positives = 26/67 (38%), Gaps = 2/67 (2%)

Query: 147 SGSAGMGFTPHVITVAVGEDIAMKLLSFSQQ-GPRAICVLSANGAISTATLRQPSSSGGS 205
           S         + + +  G+++  +L +F QQ   RA  +    G+++   LR       +
Sbjct: 11  SHHNASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYA-GQEAT 69

Query: 206 VTYECPF 212
            +    F
Sbjct: 70  TSLTGTF 76


>2p6y_A Hypothetical protein VCA0587; NESG, Q9KM02_vibch, VCR80, structural genomics, PSI-2, prote structure initiative; 1.63A {Vibrio cholerae} Length = 142 Back     alignment and structure
>2dt4_A Hypothetical protein PH0802; PPC domain, structural genomics, unknown function; 1.60A {Pyrococcus horikoshii} Length = 143 Back     alignment and structure
>1qzv_F Plant photosystem I: subunit PSAF; photosynthesis,plant photosynthetic reaction center, peripheral antenna; HET: CL1 PQN; 4.44A {Pisum sativum} SCOP: i.5.1.1 Length = 154 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query217
2hx0_A154 Putative DNA-binding protein; NESG, PSI-2, SCR59, 99.48
2p6y_A142 Hypothetical protein VCA0587; NESG, Q9KM02_vibch, 99.48
2dt4_A143 Hypothetical protein PH0802; PPC domain, structura 99.28
2h6l_A146 Hypothetical protein; NESG GR103, structural genom 99.27
3htn_A149 Putative DNA binding protein; DUF269 family protei 99.15
3hwu_A147 Putative DNA-binding protein; YP_299413.1, structu 98.35
2ezd_A26 High mobility group protein HMG-I/HMG-Y; DNA bindi 96.85
4i95_A142 Putative uncharacterized protein; lipocalin-like d 87.8
>2hx0_A Putative DNA-binding protein; NESG, PSI-2, SCR59, structural genomics, protein structure initiative; 1.55A {Salmonella choleraesuis} SCOP: d.290.1.3 PDB: 2nmu_A Back     alignment and structure
Probab=99.48  E-value=7.6e-14  Score=113.64  Aligned_cols=67  Identities=16%  Similarity=0.258  Sum_probs=61.5

Q ss_pred             CCCCCceeEEEEecCCccHHHHHHHHhhcCC-ceEEEecccCeeeeEEeeCCCCCCCceeEeeeeeeec
Q 027891          149 SAGMGFTPHVITVAVGEDIAMKLLSFSQQGP-RAICVLSANGAISTATLRQPSSSGGSVTYECPFLEVY  216 (217)
Q Consensus       149 s~g~~f~pHVI~V~~GEDV~~kI~~Faqq~~-raiCILSasGsVSnVTLRqp~ssg~tvtyeGrFEILS  216 (217)
                      +..+.+++|+|+|.+||||.++|.+||++.. ++.|||++.|+|++|+||+++... +++|+|+|||||
T Consensus        13 ~~~~~~r~~vlrL~~Gedl~~~i~~~~~~~gi~~a~v~s~iGsl~~~~l~~~~~~~-~~~~~g~~EIls   80 (154)
T 2hx0_A           13 HNASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-TTSLTGTFEVIS   80 (154)
T ss_dssp             CSCCCCEEEEEEECTTCBHHHHHHHHHHHHTCSSEEEEEEEEEEEEEEEECTTCSS-CEEEEEEEEEEE
T ss_pred             CCCCCccEEEEEECCCCcHHHHHHHHHHHhCCCEEEEEEeEeeeEEeEEEccCCCc-cEecCCcEEEEE
Confidence            4567899999999999999999999998777 799999999999999999998844 789999999987



>2p6y_A Hypothetical protein VCA0587; NESG, Q9KM02_vibch, VCR80, structural genomics, PSI-2, prote structure initiative; 1.63A {Vibrio cholerae} Back     alignment and structure
>2dt4_A Hypothetical protein PH0802; PPC domain, structural genomics, unknown function; 1.60A {Pyrococcus horikoshii} Back     alignment and structure
>2h6l_A Hypothetical protein; NESG GR103, structural genomics, PSI-2, protein structure initiative, northeast structural genomics consortium; 2.00A {Archaeoglobus fulgidus} SCOP: d.290.1.3 Back     alignment and structure
>3htn_A Putative DNA binding protein; DUF269 family protein, structural genomics, joint center for structural genomics, JCSG; HET: MSE 1PE; 1.50A {Bacteroides thetaiotaomicron vpi-5482} SCOP: d.290.1.0 Back     alignment and structure
>3hwu_A Putative DNA-binding protein; YP_299413.1, structural genomi center for structural genomics, JCSG, protein structure INI PSI-2; HET: MSE; 1.30A {Ralstonia eutropha} Back     alignment and structure
>2ezd_A High mobility group protein HMG-I/HMG-Y; DNA binding protein, minor groove DNA binding, transcriptional CO-activator, architectural factor; HET: DNA; NMR {Homo sapiens} SCOP: j.10.1.1 PDB: 2eze_A* Back     alignment and structure
>4i95_A Putative uncharacterized protein; lipocalin-like domain of PF13924 family, structural genomics center for structural genomics, JCSG; HET: MSE; 1.81A {Bacteroides eggerthii} PDB: 4gzv_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 217
d2hx0a1136 d.290.1.3 (A:6-141) Hypothetical protein STM3071 { 1e-06
d2h6la1138 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {A 4e-05
>d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} Length = 136 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: AF0104/ALDC/Ptd012-like
superfamily: AF0104/ALDC/Ptd012-like
family: AF0104-like
domain: Hypothetical protein STM3071
species: Salmonella typhimurium [TaxId: 90371]
 Score = 44.1 bits (104), Expect = 1e-06
 Identities = 13/70 (18%), Positives = 26/70 (37%), Gaps = 6/70 (8%)

Query: 147 SGSAGMGFTPHVITVAVGEDIAMKLLSFS-QQGPRAICVLSANGAISTATLRQPSSSGGS 205
           + S       + + +  G+++  +L +F  Q   RA  +    G+++   LR        
Sbjct: 2   NASTA---RFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQEA-- 56

Query: 206 VTYECPFLEV 215
            T      EV
Sbjct: 57  TTSLTGTFEV 66


>d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} Length = 138 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query217
d2hx0a1136 Hypothetical protein STM3071 {Salmonella typhimuri 99.26
d2h6la1138 Hypothetical protein AF0104 {Archaeoglobus fulgidu 98.94
>d2hx0a1 d.290.1.3 (A:6-141) Hypothetical protein STM3071 {Salmonella typhimurium [TaxId: 90371]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: AF0104/ALDC/Ptd012-like
superfamily: AF0104/ALDC/Ptd012-like
family: AF0104-like
domain: Hypothetical protein STM3071
species: Salmonella typhimurium [TaxId: 90371]
Probab=99.26  E-value=6.5e-12  Score=98.46  Aligned_cols=65  Identities=17%  Similarity=0.264  Sum_probs=60.5

Q ss_pred             CCCceeEEEEecCCccHHHHHHHHhhcCC-ceEEEecccCeeeeEEeeCCCCCCCceeEeeeeeeec
Q 027891          151 GMGFTPHVITVAVGEDIAMKLLSFSQQGP-RAICVLSANGAISTATLRQPSSSGGSVTYECPFLEVY  216 (217)
Q Consensus       151 g~~f~pHVI~V~~GEDV~~kI~~Faqq~~-raiCILSasGsVSnVTLRqp~ssg~tvtyeGrFEILS  216 (217)
                      ++..|-+++++.+||||.++|..||++.. ++.||+++.|++++|+|+++.. ....+++|+|||+|
T Consensus         3 ~~~~R~~~lrl~~Gedl~~~i~~~~~~~~I~~a~V~~~iGs~~~~~~~~~~~-~~~~~~~g~~Ei~s   68 (136)
T d2hx0a1           3 ASTARFYALRLLPGQEVFSQLHAFVQQNQLRAAWIAGCTGSLTDVALRYAGQ-EATTSLTGTFEVIS   68 (136)
T ss_dssp             CCCCEEEEEEECTTCBHHHHHHHHHHHHTCSSEEEEEEEEEEEEEEEECTTC-SSCEEEEEEEEEEE
T ss_pred             CCCCcEEEEEECCCChHHHHHHHHHHHhCCCEEEEEEEeeeeEEEEEEeCCC-CCcEEecCcEEEEE
Confidence            47789999999999999999999998888 8999999999999999999987 56789999999997



>d2h6la1 d.290.1.3 (A:1-138) Hypothetical protein AF0104 {Archaeoglobus fulgidus [TaxId: 2234]} Back     information, alignment and structure