Citrus Sinensis ID: 029494


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190--
MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLAQSLGMPQGLSEGLK
cccccccHHcccccccccccccccccccccccccccccccccccccccccccccccEEEEEcccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccEEEEcccEEEEEEEEcccEEEEEEcHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccc
ccccHHHHHHHccccccccccccccccccccccccccccEccccccccccccccEEEEEEEcccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHccEEEcccccEEEEEEEcccccEEEEEcHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccc
MASTSAFTAQISsihwkkptsssnfsnnlnlnsnlagkemlsfssqqksghqfRSLRVYglfggkkdnnekgddapskagiLGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTlsgnqqpvRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLAQslgmpqglseglk
mastsaftaqissihwkkptsssnfsNNLNLNSNLAGKEMLSFssqqksghqfRSLRVYGLFggkkdnnekgddapskAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKvtlsgnqqpVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLaqslgmpqglseglk
MASTSAFTAQISSIHWKKPTsssnfsnnlnlnsnlAGKEMLSFSSQQKSGHQFRSLRVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLAQSLGMPQGLSEGLK
*******************************************************LRVYGLF******************ILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAY*********************************
*********QISSIHWKKP****************AGKEM*****************VY**********************************************************YCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDA**********************QGLS****
***********SSIHWKKPTSSSNFSNNLNLNSNLAGKEMLS***********RSLRVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLAQSL***********
***********SSI******SSSNFSNNLNLNSNLAGKEMLSFSSQ*K*GHQFRSLRVYGLFGGKK**********SKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVLAMKER**DL**SLGMPQ*******
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MASTSAFTAQISSIHWKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSLRVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLAQSLGMPQGLSEGLK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query192 2.2.26 [Sep-21-2011]
Q8GMT0113 Nucleoid-associated prote no no 0.531 0.902 0.504 1e-17
Q5N376113 Nucleoid-associated prote yes no 0.531 0.902 0.504 1e-17
Q8YM73115 Nucleoid-associated prote yes no 0.531 0.886 0.504 4e-17
Q3MAP6115 Nucleoid-associated prote yes no 0.531 0.886 0.504 1e-16
B2J719115 Nucleoid-associated prote yes no 0.531 0.886 0.485 2e-15
B8HWY2114 Nucleoid-associated prote yes no 0.520 0.877 0.435 1e-13
Q8DKX6115 Nucleoid-associated prote yes no 0.520 0.869 0.425 5e-13
A2BTV8116 Nucleoid-associated prote yes no 0.526 0.870 0.401 2e-12
Q7V3Q0116 Nucleoid-associated prote yes no 0.526 0.870 0.401 3e-12
A9B9L9115 Nucleoid-associated prote yes no 0.526 0.878 0.411 4e-12
>sp|Q8GMT0|Y464_SYNE7 Nucleoid-associated protein Synpcc7942_0464 OS=Synechococcus elongatus (strain PCC 7942) GN=Synpcc7942_0464 PE=3 SV=2 Back     alignment and function desciption
 Score = 89.7 bits (221), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 52/103 (50%), Positives = 67/103 (65%), Gaps = 1/103 (0%)

Query: 82  LGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITE 141
           LG M+ L +  KKAQ V Q  A +VQ++L   E +G  +G L+KV +SGNQ+P+R EI  
Sbjct: 10  LGKMKELADAFKKAQQV-QEGAKQVQEDLNNMEIEGQAQGGLVKVWVSGNQEPLRAEIAP 68

Query: 142 AAMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLAQSLGMP 184
            A+  GAE LS LV  A KDA+QKS  AMKE+M  L   LG+P
Sbjct: 69  EALNEGAEVLSELVAAAMKDAYQKSTAAMKEKMEALTAGLGIP 111





Synechococcus elongatus (strain PCC 7942) (taxid: 1140)
>sp|Q5N376|Y1054_SYNP6 Nucleoid-associated protein syc1054_d OS=Synechococcus sp. (strain ATCC 27144 / PCC 6301 / SAUG 1402/1) GN=syc1054_d PE=3 SV=1 Back     alignment and function description
>sp|Q8YM73|Y5067_NOSS1 Nucleoid-associated protein alr5067 OS=Nostoc sp. (strain PCC 7120 / UTEX 2576) GN=alr5067 PE=3 SV=1 Back     alignment and function description
>sp|Q3MAP6|Y2322_ANAVT Nucleoid-associated protein Ava_2322 OS=Anabaena variabilis (strain ATCC 29413 / PCC 7937) GN=Ava_2322 PE=3 SV=1 Back     alignment and function description
>sp|B2J719|Y448_NOSP7 Nucleoid-associated protein Npun_F0448 OS=Nostoc punctiforme (strain ATCC 29133 / PCC 73102) GN=Npun_F0448 PE=3 SV=1 Back     alignment and function description
>sp|B8HWY2|Y899_CYAP4 Nucleoid-associated protein Cyan7425_0899 OS=Cyanothece sp. (strain PCC 7425 / ATCC 29141) GN=Cyan7425_0899 PE=3 SV=1 Back     alignment and function description
>sp|Q8DKX6|Y723_THEEB Nucleoid-associated protein tlr0723 OS=Thermosynechococcus elongatus (strain BP-1) GN=tlr0723 PE=3 SV=1 Back     alignment and function description
>sp|A2BTV8|Y019_PROM5 Nucleoid-associated protein P9515_00191 OS=Prochlorococcus marinus (strain MIT 9515) GN=P9515_00191 PE=3 SV=1 Back     alignment and function description
>sp|Q7V3Q0|Y020_PROMP Nucleoid-associated protein PMM0020 OS=Prochlorococcus marinus subsp. pastoris (strain CCMP1986 / MED4) GN=PMM0020 PE=3 SV=1 Back     alignment and function description
>sp|A9B9L9|Y020_PROM4 Nucleoid-associated protein P9211_00201 OS=Prochlorococcus marinus (strain MIT 9211) GN=P9211_00201 PE=3 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query192
224142377196 predicted protein [Populus trichocarpa] 1.0 0.979 0.775 3e-82
255577037195 conserved hypothetical protein [Ricinus 0.994 0.979 0.775 3e-78
225445842196 PREDICTED: UPF0133 protein Synpcc7942_04 1.0 0.979 0.765 1e-76
356543329196 PREDICTED: UPF0133 protein Synpcc7942_04 1.0 0.979 0.729 1e-73
351724105195 uncharacterized protein LOC100305795 [Gl 0.994 0.979 0.709 4e-70
449457887187 PREDICTED: nucleoid-associated protein S 0.963 0.989 0.737 1e-68
388500608188 unknown [Lotus japonicus] 0.979 1.0 0.692 1e-67
388507246188 unknown [Lotus japonicus] 0.979 1.0 0.687 4e-67
223948417186 unknown [Zea mays] gi|413926357|gb|AFW66 0.963 0.994 0.689 1e-66
413935880265 hypothetical protein ZEAMMB73_296576 [Ze 0.968 0.701 0.666 9e-66
>gi|224142377|ref|XP_002324535.1| predicted protein [Populus trichocarpa] gi|222865969|gb|EEF03100.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  309 bits (792), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 152/196 (77%), Positives = 173/196 (88%), Gaps = 4/196 (2%)

Query: 1   MASTSAFTAQISSIH----WKKPTSSSNFSNNLNLNSNLAGKEMLSFSSQQKSGHQFRSL 56
           MAST+A TAQ+++++    WK+P  +S    NLNLN+N+ GK++LS+S +QKS   +RS 
Sbjct: 1   MASTTAVTAQVTNLNGLGDWKRPFPASLPLGNLNLNANVVGKQLLSWSGKQKSERNYRSF 60

Query: 57  RVYGLFGGKKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFD 116
           RV GLFGGKK+N+EK DDAPSKAGI GNMQNLYETVKKAQMVVQVEAVRVQKELA+AEFD
Sbjct: 61  RVIGLFGGKKENSEKSDDAPSKAGIFGNMQNLYETVKKAQMVVQVEAVRVQKELASAEFD 120

Query: 117 GYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSD 176
           GYCEGELIKVTL+GNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSV AMKERM+D
Sbjct: 121 GYCEGELIKVTLTGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVQAMKERMND 180

Query: 177 LAQSLGMPQGLSEGLK 192
           LAQSLGMP G+ EGLK
Sbjct: 181 LAQSLGMPPGVGEGLK 196




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255577037|ref|XP_002529403.1| conserved hypothetical protein [Ricinus communis] gi|223531151|gb|EEF32999.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|225445842|ref|XP_002277907.1| PREDICTED: UPF0133 protein Synpcc7942_0464 [Vitis vinifera] gi|297743656|emb|CBI36539.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356543329|ref|XP_003540114.1| PREDICTED: UPF0133 protein Synpcc7942_0464-like [Glycine max] Back     alignment and taxonomy information
>gi|351724105|ref|NP_001236790.1| uncharacterized protein LOC100305795 [Glycine max] gi|255626621|gb|ACU13655.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|449457887|ref|XP_004146679.1| PREDICTED: nucleoid-associated protein Synpcc7942_0464-like [Cucumis sativus] gi|449503169|ref|XP_004161868.1| PREDICTED: nucleoid-associated protein Synpcc7942_0464-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|388500608|gb|AFK38370.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|388507246|gb|AFK41689.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|223948417|gb|ACN28292.1| unknown [Zea mays] gi|413926357|gb|AFW66289.1| hypothetical protein ZEAMMB73_706905 [Zea mays] Back     alignment and taxonomy information
>gi|413935880|gb|AFW70431.1| hypothetical protein ZEAMMB73_296576 [Zea mays] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query192
TAIR|locus:2061471182 AT2G24020 "AT2G24020" [Arabido 0.776 0.818 0.802 6.6e-58
TAIR|locus:2118721180 AT4G30620 "AT4G30620" [Arabido 0.796 0.85 0.778 1.1e-57
UNIPROTKB|Q9KT50109 VC_1055 "Nucleoid-associated p 0.526 0.926 0.323 1.2e-08
TIGR_CMR|VC_1055109 VC_1055 "conserved hypothetica 0.526 0.926 0.323 1.2e-08
UNIPROTKB|P0A8B5109 ybaB [Escherichia coli K-12 (t 0.520 0.917 0.317 6.8e-08
UNIPROTKB|Q609A8107 MCA1327 "Nucleoid-associated p 0.520 0.934 0.269 2.3e-07
UNIPROTKB|Q47XR0108 CPS_3743 "Nucleoid-associated 0.526 0.935 0.285 7.8e-07
TIGR_CMR|CPS_3743108 CPS_3743 "conserved hypothetic 0.526 0.935 0.285 7.8e-07
UNIPROTKB|Q8EFF9109 SO_2014 "Nucleoid-associated p 0.526 0.926 0.314 1e-06
TIGR_CMR|SO_2014109 SO_2014 "conserved hypothetica 0.526 0.926 0.314 1e-06
TAIR|locus:2061471 AT2G24020 "AT2G24020" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 595 (214.5 bits), Expect = 6.6e-58, P = 6.6e-58
 Identities = 122/152 (80%), Positives = 134/152 (88%)

Query:    42 SFSSQQKSGHQFRSLRVYGLFGG-KKDNNEKGDDAPSKAGILGNMQNLYETVKKAQMVVQ 100
             ++  Q KS + +RSLRV GLFGG  KDNN   +D  SKAGI GNMQN+YETVKKAQMVVQ
Sbjct:    33 TWPKQYKSKNGYRSLRVNGLFGGGNKDNNS--EDGQSKAGIFGNMQNMYETVKKAQMVVQ 90

Query:   101 VEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYK 160
             VEAVRVQKELAAAEFDGYC GEL+KVTLSGNQQP+RT+ITEAAMELG+EKLS LVTEAYK
Sbjct:    91 VEAVRVQKELAAAEFDGYCAGELVKVTLSGNQQPIRTDITEAAMELGSEKLSQLVTEAYK 150

Query:   161 DAHQKSVLAMKERMSDLAQSLGMPQGLSEGLK 192
             DAH KSV+AMKERMSDLAQSLGMP GLSEG+K
Sbjct:   151 DAHAKSVVAMKERMSDLAQSLGMPPGLSEGMK 182




GO:0008150 "biological_process" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0009941 "chloroplast envelope" evidence=IDA
GO:0009535 "chloroplast thylakoid membrane" evidence=IDA
GO:0009570 "chloroplast stroma" evidence=IDA
GO:0019243 "methylglyoxal catabolic process to D-lactate" evidence=RCA
TAIR|locus:2118721 AT4G30620 "AT4G30620" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q9KT50 VC_1055 "Nucleoid-associated protein VC_1055" [Vibrio cholerae O1 biovar El Tor str. N16961 (taxid:243277)] Back     alignment and assigned GO terms
TIGR_CMR|VC_1055 VC_1055 "conserved hypothetical protein" [Vibrio cholerae O1 biovar El Tor (taxid:686)] Back     alignment and assigned GO terms
UNIPROTKB|P0A8B5 ybaB [Escherichia coli K-12 (taxid:83333)] Back     alignment and assigned GO terms
UNIPROTKB|Q609A8 MCA1327 "Nucleoid-associated protein MCA1327" [Methylococcus capsulatus str. Bath (taxid:243233)] Back     alignment and assigned GO terms
UNIPROTKB|Q47XR0 CPS_3743 "Nucleoid-associated protein CPS_3743" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms
TIGR_CMR|CPS_3743 CPS_3743 "conserved hypothetical protein TIGR00103" [Colwellia psychrerythraea 34H (taxid:167879)] Back     alignment and assigned GO terms
UNIPROTKB|Q8EFF9 SO_2014 "Nucleoid-associated protein SO_2014" [Shewanella oneidensis MR-1 (taxid:211586)] Back     alignment and assigned GO terms
TIGR_CMR|SO_2014 SO_2014 "conserved hypothetical protein TIGR00103" [Shewanella oneidensis MR-1 (taxid:211586)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
estExt_fgenesh4_pg.C_LG_XVIII0915
hypothetical protein (196 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query192
pfam0257592 pfam02575, YbaB_DNA_bd, YbaB/EbfC DNA-binding fami 8e-20
PRK00153104 PRK00153, PRK00153, hypothetical protein; Validate 6e-19
COG0718105 COG0718, COG0718, Uncharacterized protein conserve 3e-12
TIGR00103101 TIGR00103, DNA_YbaB_EbfC, DNA-binding protein, Yba 2e-09
PRK14625109 PRK14625, PRK14625, hypothetical protein; Provisio 3e-04
>gnl|CDD|217115 pfam02575, YbaB_DNA_bd, YbaB/EbfC DNA-binding family Back     alignment and domain information
 Score = 79.1 bits (196), Expect = 8e-20
 Identities = 31/92 (33%), Positives = 47/92 (51%), Gaps = 1/92 (1%)

Query: 93  KKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLS 152
           K+AQ + Q +  + Q+ELA  E +G   G L+KVT++G  +    EI    ++   E L 
Sbjct: 2   KQAQQM-QEKMEKAQEELANKEVEGTSGGGLVKVTVNGKGEVKDVEIDPELLDEDKEMLE 60

Query: 153 LLVTEAYKDAHQKSVLAMKERMSDLAQSLGMP 184
            L+  A  DA +K+    KE M  L   L +P
Sbjct: 61  DLIVAAINDALKKAEELAKEEMGKLTGGLNLP 92


This is a family of DNA-binding proteins. Members of this family form homodimers which bind DNA via a tweezer-like structure. The conformation of the DNA is changed when bound to these proteins. In bacteria, these proteins may play a role in DNA replication-recovery following DNA damage. Length = 92

>gnl|CDD|234669 PRK00153, PRK00153, hypothetical protein; Validated Back     alignment and domain information
>gnl|CDD|223790 COG0718, COG0718, Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>gnl|CDD|211546 TIGR00103, DNA_YbaB_EbfC, DNA-binding protein, YbaB/EbfC family Back     alignment and domain information
>gnl|CDD|184772 PRK14625, PRK14625, hypothetical protein; Provisional Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 192
PRK14624115 hypothetical protein; Provisional 100.0
PRK14625109 hypothetical protein; Provisional 100.0
PRK14622103 hypothetical protein; Provisional 100.0
PRK14621111 hypothetical protein; Provisional 99.98
PRK14626110 hypothetical protein; Provisional 99.98
PRK00153104 hypothetical protein; Validated 99.97
PRK14628118 hypothetical protein; Provisional 99.97
PRK14623106 hypothetical protein; Provisional 99.97
COG0718105 Uncharacterized protein conserved in bacteria [Fun 99.97
PRK14627100 hypothetical protein; Provisional 99.97
TIGR00103102 DNA_YbaB_EbfC DNA-binding protein, YbaB/EbfC famil 99.97
PRK03762103 hypothetical protein; Provisional 99.96
PRK0058799 hypothetical protein; Provisional 99.96
PRK1462999 hypothetical protein; Provisional 99.95
PF0257593 YbaB_DNA_bd: YbaB/EbfC DNA-binding family; InterPr 99.94
PF10921109 DUF2710: Protein of unknown function (DUF2710); In 96.73
PRK14624115 hypothetical protein; Provisional 96.28
PRK03762103 hypothetical protein; Provisional 96.26
PRK14626110 hypothetical protein; Provisional 95.91
TIGR00103102 DNA_YbaB_EbfC DNA-binding protein, YbaB/EbfC famil 95.49
COG0718105 Uncharacterized protein conserved in bacteria [Fun 95.21
PRK00153104 hypothetical protein; Validated 94.47
PRK14628118 hypothetical protein; Provisional 94.23
PRK14627100 hypothetical protein; Provisional 93.9
PF10904101 DUF2694: Protein of unknown function (DUF2694); In 93.45
PRK14625109 hypothetical protein; Provisional 93.43
PRK14623106 hypothetical protein; Provisional 93.14
PRK14621111 hypothetical protein; Provisional 93.09
PF0257593 YbaB_DNA_bd: YbaB/EbfC DNA-binding family; InterPr 91.21
PRK14622103 hypothetical protein; Provisional 91.13
PRK0058799 hypothetical protein; Provisional 86.37
PF1043786 Lip_prot_lig_C: Bacterial lipoate protein ligase C 82.97
PF1310385 TonB_2: TonB C terminal; PDB: 1LR0_A. 80.97
>PRK14624 hypothetical protein; Provisional Back     alignment and domain information
Probab=100.00  E-value=8e-34  Score=221.84  Aligned_cols=106  Identities=20%  Similarity=0.264  Sum_probs=102.2

Q ss_pred             ccccHHHHHHHHHHHHHHHHHHHHHHHHHHhccEEEEEECCCeEEEEEecCcceEEEEeCcccccC-CHHHHHHHHHHHH
Q 029494           81 ILGNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMEL-GAEKLSLLVTEAY  159 (192)
Q Consensus        81 m~gnm~~L~~~~KkaQe~mQ~km~klQeeL~~~~vtgsSggGlVkVtvnG~gev~~V~Idp~~l~~-D~E~LedLI~aAv  159 (192)
                      ||.+|++|.+++||||+ ||++++++|++|++++|+|+|+||+|+||+||+++|++|+|||+++++ |+|+|||||++|+
T Consensus         1 ~~~~~~nm~~~mkqAq~-mQ~km~~~QeeL~~~~v~g~sGgG~VkV~~nG~~~i~~i~Idp~lld~eD~E~LeDLI~aAv   79 (115)
T PRK14624          1 MFDKIKNMSEALSNMGN-IREKMEEVKKRIASIRVVGDAGAGMVTVTATGEGQITNVFINKQLFDADDNKMLEDLVMAAT   79 (115)
T ss_pred             CcchHHhHHHHHHHHHH-HHHHHHHHHHHHhccEEEEEECCcEEEEEEEcCccEEEEEECHHHcCcccHHHHHHHHHHHH
Confidence            67788888899999999 899999999999999999999999999999999999999999999986 9999999999999


Q ss_pred             HHHHHHHHHHHHHHHHhhhcCCCCCCCCC
Q 029494          160 KDAHQKSVLAMKERMSDLAQSLGMPQGLS  188 (192)
Q Consensus       160 NdA~~Ka~e~~~e~m~~ltGGl~lP~Gl~  188 (192)
                      |+|++++++..+++|++++|||+|| ||+
T Consensus        80 NdA~~k~~e~~~e~m~~~tgGm~lP-Gl~  107 (115)
T PRK14624         80 NDALKKAKEATAYEFQNASGGLDFS-EIS  107 (115)
T ss_pred             HHHHHHHHHHHHHHHHHHhCCCCCC-chH
Confidence            9999999999999999999999999 864



>PRK14625 hypothetical protein; Provisional Back     alignment and domain information
>PRK14622 hypothetical protein; Provisional Back     alignment and domain information
>PRK14621 hypothetical protein; Provisional Back     alignment and domain information
>PRK14626 hypothetical protein; Provisional Back     alignment and domain information
>PRK00153 hypothetical protein; Validated Back     alignment and domain information
>PRK14628 hypothetical protein; Provisional Back     alignment and domain information
>PRK14623 hypothetical protein; Provisional Back     alignment and domain information
>COG0718 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>PRK14627 hypothetical protein; Provisional Back     alignment and domain information
>TIGR00103 DNA_YbaB_EbfC DNA-binding protein, YbaB/EbfC family Back     alignment and domain information
>PRK03762 hypothetical protein; Provisional Back     alignment and domain information
>PRK00587 hypothetical protein; Provisional Back     alignment and domain information
>PRK14629 hypothetical protein; Provisional Back     alignment and domain information
>PF02575 YbaB_DNA_bd: YbaB/EbfC DNA-binding family; InterPro: IPR004401 The function of this protein is unknown Back     alignment and domain information
>PF10921 DUF2710: Protein of unknown function (DUF2710); InterPro: IPR024296 This family of uncharacterised proteins appears to be restricted to Mycobacteriaceae Back     alignment and domain information
>PRK14624 hypothetical protein; Provisional Back     alignment and domain information
>PRK03762 hypothetical protein; Provisional Back     alignment and domain information
>PRK14626 hypothetical protein; Provisional Back     alignment and domain information
>TIGR00103 DNA_YbaB_EbfC DNA-binding protein, YbaB/EbfC family Back     alignment and domain information
>COG0718 Uncharacterized protein conserved in bacteria [Function unknown] Back     alignment and domain information
>PRK00153 hypothetical protein; Validated Back     alignment and domain information
>PRK14628 hypothetical protein; Provisional Back     alignment and domain information
>PRK14627 hypothetical protein; Provisional Back     alignment and domain information
>PF10904 DUF2694: Protein of unknown function (DUF2694); InterPro: IPR024426 This family of proteins with unknown function appears to be restricted to Mycobacterium spp Back     alignment and domain information
>PRK14625 hypothetical protein; Provisional Back     alignment and domain information
>PRK14623 hypothetical protein; Provisional Back     alignment and domain information
>PRK14621 hypothetical protein; Provisional Back     alignment and domain information
>PF02575 YbaB_DNA_bd: YbaB/EbfC DNA-binding family; InterPro: IPR004401 The function of this protein is unknown Back     alignment and domain information
>PRK14622 hypothetical protein; Provisional Back     alignment and domain information
>PRK00587 hypothetical protein; Provisional Back     alignment and domain information
>PF10437 Lip_prot_lig_C: Bacterial lipoate protein ligase C-terminus; InterPro: IPR019491 This is the C-terminal domain of a bacterial lipoate protein ligase Back     alignment and domain information
>PF13103 TonB_2: TonB C terminal; PDB: 1LR0_A Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query192
1pug_A109 Structure Of E. Coli Ybab Length = 109 2e-05
>pdb|1PUG|A Chain A, Structure Of E. Coli Ybab Length = 109 Back     alignment and structure

Iteration: 1

Score = 45.8 bits (107), Expect = 2e-05, Method: Compositional matrix adjust. Identities = 31/105 (29%), Positives = 54/105 (51%), Gaps = 4/105 (3%) Query: 83 GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEA 142 G + NL + ++ Q +Q ++Q+E+A E G L+KVT++G R EI + Sbjct: 5 GGLGNLMKQAQQMQEKMQ----KMQEEIAQLEVTGESGAGLVKVTINGAHNCRRVEIDPS 60 Query: 143 AMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLAQSLGMPQGL 187 +E E L LV A+ DA ++ KE+M+ ++ + +P G Sbjct: 61 LLEDDKEMLEDLVAAAFNDAARRIEETQKEKMASVSSGMQLPPGF 105

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query192
1j8b_A112 YBAB; hypothetical protein, structural genomics, s 2e-13
1ybx_A143 Conserved hypothetical protein; ST genomics, PSI, 6e-13
3f42_A99 Protein HP0035; helicobacter pylori unknown-functi 1e-09
>1j8b_A YBAB; hypothetical protein, structural genomics, structure function project, S2F, unknown function; HET: MSE; 1.75A {Haemophilus influenzae RD} SCOP: d.222.1.1 PDB: 1pug_A Length = 112 Back     alignment and structure
 Score = 63.1 bits (154), Expect = 2e-13
 Identities = 30/105 (28%), Positives = 56/105 (53%), Gaps = 4/105 (3%)

Query: 83  GNMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEA 142
           G +  L   +K+AQ + Q +  ++Q+E+A  E  G     L+K+T++G     R +I  +
Sbjct: 8   GGLGGL---MKQAQQM-QEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPS 63

Query: 143 AMELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLAQSLGMPQGL 187
            ME   E L  L+  A+ DA +++    KE+M+ +   + +P G+
Sbjct: 64  LMEDDKEMLEDLIAAAFNDAVRRAEELQKEKMASVTAGMPLPPGM 108


>1ybx_A Conserved hypothetical protein; ST genomics, PSI, protein structure initiative, southeast COLL for structural genomics, secsg; HET: MSE; 1.80A {Clostridium thermocellum} Length = 143 Back     alignment and structure
>3f42_A Protein HP0035; helicobacter pylori unknown-function, structural genomics, P protein structure initiative; HET: MSE; 1.78A {Helicobacter pylori} Length = 99 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query192
1j8b_A112 YBAB; hypothetical protein, structural genomics, s 100.0
1ybx_A143 Conserved hypothetical protein; ST genomics, PSI, 100.0
3f42_A99 Protein HP0035; helicobacter pylori unknown-functi 99.96
1ybx_A143 Conserved hypothetical protein; ST genomics, PSI, 96.95
1j8b_A112 YBAB; hypothetical protein, structural genomics, s 94.74
3f42_A99 Protein HP0035; helicobacter pylori unknown-functi 82.85
>1j8b_A YBAB; hypothetical protein, structural genomics, structure function project, S2F, unknown function; HET: MSE; 1.75A {Haemophilus influenzae RD} SCOP: d.222.1.1 PDB: 1pug_A Back     alignment and structure
Probab=100.00  E-value=3.4e-35  Score=226.96  Aligned_cols=104  Identities=28%  Similarity=0.426  Sum_probs=88.4

Q ss_pred             cHHHHHHHHHHHHHHHHHHHHHHHHHHhccEEEEEECCCeEEEEEecCcceEEEEeCcccccCCHHHHHHHHHHHHHHHH
Q 029494           84 NMQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAH  163 (192)
Q Consensus        84 nm~~L~~~~KkaQe~mQ~km~klQeeL~~~~vtgsSggGlVkVtvnG~gev~~V~Idp~~l~~D~E~LedLI~aAvNdA~  163 (192)
                      ||++|+   +|||+ ||++++++|+||++++|+|+|+||+|+||+||+++|++|+|||+++++|+|+|||||++|+|+|+
T Consensus         9 nm~~mm---kqaq~-mQ~~m~~~QeeL~~~~v~g~sggG~V~Vt~~G~~ev~~i~Idp~~~~~d~E~LedlI~aA~ndA~   84 (112)
T 1j8b_A            9 GLGGLM---KQAQQ-MQEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLMEDDKEMLEDLIAAAFNDAV   84 (112)
T ss_dssp             -CCCHH---HHHHH-HHHHHHHHHHHHTTSEEEEEEGGGTEEEEEETTCCEEEEEECGGGGGSCHHHHHHHHHHHHHHHH
T ss_pred             CHHHHH---HHHHH-HHHHHHHHHHHHhccEEEEEECCCEEEEEEecCceEEEEEECHHHHhCCHHHHHHHHHHHHHHHH
Confidence            566665   77787 79999999999999999999999999999999999999999999998899999999999999999


Q ss_pred             HHHHHHHHHHHHhhhcCCCCCCCCCCCC
Q 029494          164 QKSVLAMKERMSDLAQSLGMPQGLSEGL  191 (192)
Q Consensus       164 ~Ka~e~~~e~m~~ltGGl~lP~Gl~~g~  191 (192)
                      +++++.++++|++++|||+|||||+.++
T Consensus        85 ~ka~~~~~e~m~~~tgGl~lppG~~~pf  112 (112)
T 1j8b_A           85 RRAEELQKEKMASVTAGMPLPPGMKFPF  112 (112)
T ss_dssp             HHHHHHHHHHHHHHTCC-----------
T ss_pred             HHHHHHHHHHHHHHhCCCCCCCCCCCCC
Confidence            9999999999999999999988997653



>1ybx_A Conserved hypothetical protein; ST genomics, PSI, protein structure initiative, southeast COLL for structural genomics, secsg; HET: MSE; 1.80A {Clostridium thermocellum} Back     alignment and structure
>3f42_A Protein HP0035; helicobacter pylori unknown-function, structural genomics, P protein structure initiative; HET: MSE; 1.78A {Helicobacter pylori} Back     alignment and structure
>1ybx_A Conserved hypothetical protein; ST genomics, PSI, protein structure initiative, southeast COLL for structural genomics, secsg; HET: MSE; 1.80A {Clostridium thermocellum} Back     alignment and structure
>1j8b_A YBAB; hypothetical protein, structural genomics, structure function project, S2F, unknown function; HET: MSE; 1.75A {Haemophilus influenzae RD} SCOP: d.222.1.1 PDB: 1pug_A Back     alignment and structure
>3f42_A Protein HP0035; helicobacter pylori unknown-function, structural genomics, P protein structure initiative; HET: MSE; 1.78A {Helicobacter pylori} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 192
d1j8ba_92 d.222.1.1 (A:) Hypothetical protein HI0442 {Haemop 3e-14
>d1j8ba_ d.222.1.1 (A:) Hypothetical protein HI0442 {Haemophilus influenzae [TaxId: 727]} Length = 92 Back     information, alignment and structure

class: Alpha and beta proteins (a+b)
fold: YbaB-like
superfamily: YbaB-like
family: YbaB-like
domain: Hypothetical protein HI0442
species: Haemophilus influenzae [TaxId: 727]
 Score = 63.1 bits (154), Expect = 3e-14
 Identities = 27/95 (28%), Positives = 50/95 (52%), Gaps = 4/95 (4%)

Query: 85  MQNLYETVKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAM 144
           +  L   +K+AQ + Q +  ++Q+E+A  E  G     L+K+T++G     R +I  + M
Sbjct: 1   LGGL---MKQAQQM-QEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLM 56

Query: 145 ELGAEKLSLLVTEAYKDAHQKSVLAMKERMSDLAQ 179
           E   E L  L+  A+ DA +++    KE+M+ +  
Sbjct: 57  EDDKEMLEDLIAAAFNDAVRRAEELQKEKMASVTA 91


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query192
d1j8ba_92 Hypothetical protein HI0442 {Haemophilus influenza 99.95
d1j8ba_92 Hypothetical protein HI0442 {Haemophilus influenza 93.19
>d1j8ba_ d.222.1.1 (A:) Hypothetical protein HI0442 {Haemophilus influenzae [TaxId: 727]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: YbaB-like
superfamily: YbaB-like
family: YbaB-like
domain: Hypothetical protein HI0442
species: Haemophilus influenzae [TaxId: 727]
Probab=99.95  E-value=1.7e-28  Score=181.20  Aligned_cols=88  Identities=30%  Similarity=0.419  Sum_probs=85.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHhccEEEEEECCCeEEEEEecCcceEEEEeCcccccCCHHHHHHHHHHHHHHHHHHHHHHHH
Q 029494           92 VKKAQMVVQVEAVRVQKELAAAEFDGYCEGELIKVTLSGNQQPVRTEITEAAMELGAEKLSLLVTEAYKDAHQKSVLAMK  171 (192)
Q Consensus        92 ~KkaQe~mQ~km~klQeeL~~~~vtgsSggGlVkVtvnG~gev~~V~Idp~~l~~D~E~LedLI~aAvNdA~~Ka~e~~~  171 (192)
                      |+|||+ ||++++++|++|++++|+|++++|+|+||+||+++|++|+|||+++++|+|+|+|||++|||+|++++++..+
T Consensus         5 mkqaq~-mq~~m~~~q~eL~~~~v~~~sg~g~V~V~~~G~~~v~~i~Id~~~l~~d~e~LedlI~~A~N~A~~k~~~~~~   83 (92)
T d1j8ba_           5 MKQAQQ-MQEKMQKMQEEIAQLEVTGESGAGLVKITINGAHNCRRIDIDPSLMEDDKEMLEDLIAAAFNDAVRRAEELQK   83 (92)
T ss_dssp             HHHHHH-HHHHHHHHHHHHTTSEEEEEEGGGTEEEEEETTCCEEEEEECGGGGGSCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHH-HHHHHHHHHHHHhCeEEEEEECCCEEEEEEECCeEEEEEEECHHHhcCcHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            488888 7999999999999999999999999999999999999999999999889999999999999999999999999


Q ss_pred             HHHHhhhcC
Q 029494          172 ERMSDLAQS  180 (192)
Q Consensus       172 e~m~~ltGG  180 (192)
                      ++|++++||
T Consensus        84 ~~m~~~tGG   92 (92)
T d1j8ba_          84 EKMASVTAG   92 (92)
T ss_dssp             HHHHHHTCC
T ss_pred             HHHHhhcCC
Confidence            999999987



>d1j8ba_ d.222.1.1 (A:) Hypothetical protein HI0442 {Haemophilus influenzae [TaxId: 727]} Back     information, alignment and structure