Citrus Sinensis ID: 022217


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-
MEAGTIFYSAIPWKPKQPFPRNPSSFPRFIRASASSQSQGGYRGPKPSQDLVADWVMNNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLAISLAVTSILTGLVWLSIRPKSITVNSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKYSNIVPSMVQEGI
ccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccEEEcccccHHHHHHHHHHHHHHHHHccccEEEEEEccEEEEEEEEEccccccccHHHHHHHcccccHHHHHHHHcccccccccccccccccccccccccccEEEEEEccccEEEEEEccccccccHHHHHHHHHHHHHHHHHHHHHcccccccccccc
cccccHHccccccccccccccccccccccccccccccccccccccccccHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccHcccccccccccccEEEEEcccccHHHHHHHHHHHHHHHHccccEEEEEEEcccEEEEEEEcccccccccccccccccccccHHHHHHHHccccEEEEEccccccccccccccccccEEEEEEccccEEEEEEccccccccHHHHHHHHHHHHHHHHHHHHHHcccccHHcccc
MEAGTIFYsaipwkpkqpfprnpssfprfirasassqsqggyrgpkpsqdlVADWVMnnddavrslpiyvGGASLLAVLFNRtvsgiapvadasssqSRADLLAISLAVTSILTGLVWlsirpksitvnsfslfqvnpkgvecqmicsylpdsVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMaaesgnsgeAVIVDASKLKQGSVYLGVMRSKAQRYLANLLlypgrselpflpsntqavilqplgdkgiaiiggdtirgfttsdQSWIAFIGEKLDATLAKYSNIVPSMVQEGI
meagtifysaipwkpkqpFPRNPSSFPRFIRASassqsqggyrgpKPSQDLVADWVMNNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLAISLAVTSILTGLVWLSIRPKSITVNSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKYSNIVPSMVQEGI
MEAGTIFYSAIPWKPKQpfprnpssfprfIRASASSQSQGGYRGPKPSQDLVADWVMNNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLAISLAVTSILTGLVWLSIRPKSITVNSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKYSNIVPSMVQEGI
*****IFYSAIPW*************************************LVADWVMNNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVA********ADLLAISLAVTSILTGLVWLSIRPKSITVNSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKYSNIV********
********SAIPWKPKQPF**********************************DWVMNNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVA***SSQSRADLLAISLAVTSILTGLVWLSI*******************VECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQ*S*********AQRYLANLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKY************
MEAGTIFYSAIPWKPKQPFPRNPSSFPRFIRA*************KPSQDLVADWVMNNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVA********ADLLAISLAVTSILTGLVWLSIRPKSITVNSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKYSNIVPSMVQEGI
****TIFYSAIPWKPKQPFPRNPSSF********************PSQDLVADWVMNNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLAISLAVTSILTGLVWLSIRPKSIT*NSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKYSNIVP*******
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHHHHHHoooooooooHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHooooooooooooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEAGTIFYSAIPWKPKQPFPRNPSSFPRFIRASASSQSQGGYRGPKPSQDLVADWVMNNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLAISLAVTSILTGLVWLSIRPKSITVNSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKYSNIVPSMVQEGI
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query301
255558330336 conserved hypothetical protein [Ricinus 0.976 0.875 0.702 1e-113
42562815297 cofactor assembly of complex C [Arabidop 0.936 0.949 0.715 1e-110
297840573298 hypothetical protein ARALYDRAFT_475325 [ 0.936 0.946 0.709 1e-110
449434863300 PREDICTED: uncharacterized protein LOC10 0.933 0.936 0.713 1e-109
449523652300 PREDICTED: uncharacterized LOC101208049 0.933 0.936 0.713 1e-108
5080827305 Hypothetical protein [Arabidopsis thalia 0.936 0.924 0.693 1e-108
225424815291 PREDICTED: uncharacterized protein LOC10 0.963 0.996 0.695 1e-101
356501956299 PREDICTED: uncharacterized protein LOC10 0.963 0.969 0.623 3e-95
51971242242 hypothetical protein [Arabidopsis thalia 0.787 0.979 0.696 4e-87
218195714293 hypothetical protein OsI_17698 [Oryza sa 0.827 0.849 0.636 1e-84
>gi|255558330|ref|XP_002520192.1| conserved hypothetical protein [Ricinus communis] gi|223540684|gb|EEF42247.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  413 bits (1061), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 213/303 (70%), Positives = 242/303 (79%), Gaps = 9/303 (2%)

Query: 1   MEAGTIFYSAIPWKPKQPFP--RNPSSFPRFIRASASSQSQGGYRGPKPSQDLVADWVMN 58
           MEAGTI  ++I ++    FP  R   +   F+RAS+   SQG YRGPKP +DLVADWV N
Sbjct: 41  MEAGTILPTSILFRSPFHFPISRPLQNLRFFVRASSDLNSQGKYRGPKPKRDLVADWVSN 100

Query: 59  NDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLAISLAVTSILTGLVW 118
           NDD VRSLPIYVGGASLLAVLFNR  SGIAPVADASSSQSRADLL + LAVT+IL GL+W
Sbjct: 101 NDDTVRSLPIYVGGASLLAVLFNRAASGIAPVADASSSQSRADLLTLGLAVTNILAGLIW 160

Query: 119 LSIRPKSITVNSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVY 178
           LSI+PKSI++       VNP+GVECQ+I S+LPD VVSELLWAWESLSA TCCRSLVVVY
Sbjct: 161 LSIKPKSISL-------VNPQGVECQIILSHLPDYVVSELLWAWESLSAATCCRSLVVVY 213

Query: 179 DGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELPF 238
           D +  LQIGMAAES N GEA+ VDA+KL QGS+   + +S AQ YLANL LYPGR+ELPF
Sbjct: 214 DCVCFLQIGMAAESPNKGEALSVDAAKLMQGSLVQAIKKSGAQSYLANLSLYPGRTELPF 273

Query: 239 LPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKYSNIVPSMVQ 298
           LP NTQAVILQPLGDKG+AIIGGDTIRGFTTSDQ+WI FIGEKLD+TLAK     P  +Q
Sbjct: 274 LPLNTQAVILQPLGDKGVAIIGGDTIRGFTTSDQAWITFIGEKLDSTLAKCVISSPLAMQ 333

Query: 299 EGI 301
           + I
Sbjct: 334 DRI 336




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|42562815|ref|NP_176193.2| cofactor assembly of complex C [Arabidopsis thaliana] gi|42571921|ref|NP_974051.1| cofactor assembly of complex C [Arabidopsis thaliana] gi|34146872|gb|AAQ62444.1| At1g59840 [Arabidopsis thaliana] gi|51968666|dbj|BAD43025.1| hypothetical protein [Arabidopsis thaliana] gi|51968878|dbj|BAD43131.1| hypothetical protein [Arabidopsis thaliana] gi|62320426|dbj|BAD94887.1| hypothetical protein [Arabidopsis thaliana] gi|332195505|gb|AEE33626.1| cofactor assembly of complex C [Arabidopsis thaliana] gi|332195506|gb|AEE33627.1| cofactor assembly of complex C [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297840573|ref|XP_002888168.1| hypothetical protein ARALYDRAFT_475325 [Arabidopsis lyrata subsp. lyrata] gi|297334009|gb|EFH64427.1| hypothetical protein ARALYDRAFT_475325 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|449434863|ref|XP_004135215.1| PREDICTED: uncharacterized protein LOC101208049 [Cucumis sativus] Back     alignment and taxonomy information
>gi|449523652|ref|XP_004168837.1| PREDICTED: uncharacterized LOC101208049 [Cucumis sativus] Back     alignment and taxonomy information
>gi|5080827|gb|AAD39336.1|AC007258_25 Hypothetical protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|225424815|ref|XP_002267988.1| PREDICTED: uncharacterized protein LOC100255235 [Vitis vinifera] gi|296086476|emb|CBI32065.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|356501956|ref|XP_003519789.1| PREDICTED: uncharacterized protein LOC100776666 [Glycine max] Back     alignment and taxonomy information
>gi|51971242|dbj|BAD44313.1| hypothetical protein [Arabidopsis thaliana] gi|51971329|dbj|BAD44329.1| hypothetical protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|218195714|gb|EEC78141.1| hypothetical protein OsI_17698 [Oryza sativa Indica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query301
TAIR|locus:2025896297 CCB4 "AT1G59840" [Arabidopsis 0.936 0.949 0.695 9.4e-98
TAIR|locus:2025896 CCB4 "AT1G59840" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 971 (346.9 bits), Expect = 9.4e-98, P = 9.4e-98
 Identities = 203/292 (69%), Positives = 229/292 (78%)

Query:     1 MEAGTIFYSA-IPWKPKQXXXXXXXXXXXXIRASASSQSQGG--YRGPKPSQDLVADWVM 57
             MEA  I     IPW   +            IRAS+SS SQ    Y GPKP ++LVAD++ 
Sbjct:     1 MEARIILLRIQIPWSANRQFSHPPLDFPRFIRASSSSTSQKPKTYEGPKPRKNLVADFIS 60

Query:    58 NNDDAVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLAISLAVTSILTGLV 117
              NDD VRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLA+ LAVT++LTGLV
Sbjct:    61 KNDDLVRSLPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLALGLAVTNLLTGLV 120

Query:   118 WLSIRPKSITVNSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVV 177
             WLSIRPKSIT        VNPKGVEC+++ S LP S+VSELLWAWESL   TCC+SLV+V
Sbjct:   121 WLSIRPKSIT-------PVNPKGVECKVVESDLPASMVSELLWAWESLKVATCCKSLVIV 173

Query:   178 YDGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELP 237
             Y+GI L+QIGM AES    + VIV   KL QGSVY GVM+SKAQ YLANL LYPGRSELP
Sbjct:   174 YNGICLIQIGMVAESPEDKKTVIVKTDKLMQGSVYRGVMKSKAQSYLANLSLYPGRSELP 233

Query:   238 FLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAKY 289
             FLP+NTQAVILQPLGDKGIA+IGG+TIRGFT+SDQ+WI+ IGEKLDATL +Y
Sbjct:   234 FLPANTQAVILQPLGDKGIAVIGGNTIRGFTSSDQAWISSIGEKLDATLGRY 285


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.318   0.133   0.388    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      301       289   0.00088  115 3  11 22  0.38    34
                                                     33  0.43    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  607 (65 KB)
  Total size of DFA:  196 KB (2110 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  20.56u 0.09s 20.65t   Elapsed:  00:00:01
  Total cpu time:  20.56u 0.09s 20.65t   Elapsed:  00:00:01
  Start:  Sat May 11 10:59:07 2013   End:  Sat May 11 10:59:08 2013


GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0010190 "cytochrome b6f complex assembly" evidence=IMP
GO:0010207 "photosystem II assembly" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query301
pfam11152195 pfam11152, DUF2930, Protein of unknown function (D 8e-66
>gnl|CDD|221004 pfam11152, DUF2930, Protein of unknown function (DUF2930) Back     alignment and domain information
 Score =  204 bits (521), Expect = 8e-66
 Identities = 77/218 (35%), Positives = 113/218 (51%), Gaps = 23/218 (10%)

Query: 66  LPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLAISLAVTSILTGLVWLSIRPKS 125
           LP+  G    L +L NR  +     AD + SQSRAD+L + LA   IL GL+W   +P+S
Sbjct: 1   LPLIAGLLGGLLLLLNRLST-----ADITPSQSRADVLGLLLAAGLILVGLLWTRAQPRS 55

Query: 126 ITVNSFSLFQVNPKGVECQMICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQ 185
                     V+ +G +   +   L +++  EL WA   L   T   S++V +DG  LL+
Sbjct: 56  PE-------AVDLEGEQGFELSPDLSEALKEELAWASHLLLTATPAASVLVYWDGQVLLR 108

Query: 186 IGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLANLLLYPGRSELPFLPSNTQA 245
            G+                K + G +    + ++   YL NL LYPGR E  +LP NTQA
Sbjct: 109 RGILGP-----------PVKFEPGEICKRALETQQLVYLVNLKLYPGRIEFDYLPPNTQA 157

Query: 246 VILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLD 283
           VI+QPLG++G+ ++GG + R FT SD+ WI    +KL 
Sbjct: 158 VIVQPLGNRGVLVLGGWSPRCFTRSDERWIEGWADKLR 195


This family of proteins has no known function. Length = 195

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 301
PF11152195 DUF2930: Protein of unknown function (DUF2930); In 100.0
PF13492129 GAF_3: GAF domain; PDB: 3EEA_A 4DMZ_A 4DN0_A 1VHM_ 96.82
PF13185148 GAF_2: GAF domain; PDB: 2QYB_A 3KSG_B 3KSF_C 3KSI_ 96.81
smart00065149 GAF Domain present in phytochromes and cGMP-specif 96.8
PF01590154 GAF: GAF domain; InterPro: IPR003018 This domain i 96.33
PRK11061 748 fused phosphoenolpyruvate-protein phosphotransfera 95.74
TIGR01817 534 nifA Nif-specific regulatory protein. This model r 94.15
TIGR02851180 spore_V_T stage V sporulation protein T. Members o 92.35
PRK15429 686 formate hydrogenlyase transcriptional activator Fh 91.72
COG1956163 GAF domain-containing protein [Signal transduction 89.73
>PF11152 DUF2930: Protein of unknown function (DUF2930); InterPro: IPR021325 This family of proteins has no known function Back     alignment and domain information
Probab=100.00  E-value=6.1e-80  Score=552.01  Aligned_cols=195  Identities=43%  Similarity=0.677  Sum_probs=190.3

Q ss_pred             hhHHHHHHHHHHHHHHhhhcCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccccccCCCCcccee
Q 022217           66 LPIYVGGASLLAVLFNRTVSGIAPVADASSSQSRADLLAISLAVTSILTGLVWLSIRPKSITVNSFSLFQVNPKGVECQM  145 (301)
Q Consensus        66 LP~~vG~l~~~llL~Nr~lsgiApv~~~t~aQ~RaevLgl~Lsa~liL~Gllw~~~~P~~~~~~~~s~~~V~L~G~e~f~  145 (301)
                      ||+++|+++++++++||+++     +++||||+|||+||++||++|||+|++|+++||+++++       |+|+|+|+|+
T Consensus         1 LPl~vG~l~~~lll~N~~~~-----~~~t~sq~RaevLg~~ls~~liL~g~lw~~l~p~~~~~-------v~L~G~~~f~   68 (195)
T PF11152_consen    1 LPLIVGLLGGSLLLLNRLSG-----ADATPSQSRAEVLGLILSAVLILTGLLWQRLKPRSPEA-------VDLPGEQGFE   68 (195)
T ss_pred             CceEehHHHHHHHHhhhccc-----CCCCHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCc-------cCCCCccceE
Confidence            69999999999999999665     89999999999999999999999999999999999999       9999999999


Q ss_pred             ecCCCCHHHHHHHHHHHHHHhhhccccEEEEEECCEEEEEEeeccCCCCCccchhhcccccchhhHHHHHhhccCceecc
Q 022217          146 ICSYLPDSVVSELLWAWESLSAVTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQGSVYLGVMRSKAQRYLA  225 (301)
Q Consensus       146 l~~~Lsd~~k~ELAWas~~LLtnT~a~svlV~~~G~vll~RG~~~~~~~~g~a~~~d~~~~~pG~I~~r~~~~~~~iyL~  225 (301)
                      |++||||++|+||||+||+|||||+|+||+|||||+++||||+++++           ++|+||+||+++|++++++||+
T Consensus        69 l~~~L~~~~k~ELAWas~~LLt~T~a~svlV~~~g~~llrrG~~~~~-----------~~~~~g~i~~~~~~~~~~~yL~  137 (195)
T PF11152_consen   69 LSPDLSDAAKEELAWASYALLTNTNARSVLVYYDGRVLLRRGILGSP-----------SKFKPGPICQRAMESGKLIYLV  137 (195)
T ss_pred             ECCCCCHHHHHHHHHHHHHHHhhCCceEEEEEECCEEEEEeeccCCC-----------CCcchHHHHHHHHhcCCceecc
Confidence            99999999999999999999999999999999999999999999864           3899999999999999999999


Q ss_pred             ccccCCCCCCCCCCCCCCCeeEEEecCCCeEEEEecCCCcccChhhHHHHHHHHHHHH
Q 022217          226 NLLLYPGRSELPFLPSNTQAVILQPLGDKGIAIIGGDTIRGFTTSDQSWIAFIGEKLD  283 (301)
Q Consensus       226 nl~lyPGr~Ef~~LP~ntqsVlvQPig~~Gvlvlgs~s~R~FT~kDe~WI~~iA~KL~  283 (301)
                      |++|||||+||+|||+|||||||||+|++||+|+|||++||||||||+||++|||||+
T Consensus       138 nl~lyPGr~Ef~~lP~ntq~VlvqP~g~~G~lvlgs~~~R~ft~~D~~Wi~~iA~Kl~  195 (195)
T PF11152_consen  138 NLKLYPGRVEFDYLPENTQSVLVQPLGQNGVLVLGSNSPRAFTKSDEAWIAGIADKLR  195 (195)
T ss_pred             ccccCCCchhhhhcCCCCcEEEEEEcCCCeEEEEeeCCccccCHHHHHHHHHHHHhcC
Confidence            9999999999999999999999999999999999999999999999999999999985



>PF13492 GAF_3: GAF domain; PDB: 3EEA_A 4DMZ_A 4DN0_A 1VHM_A Back     alignment and domain information
>PF13185 GAF_2: GAF domain; PDB: 2QYB_A 3KSG_B 3KSF_C 3KSI_A 3KSH_A 3MMH_A 3RFB_B 1F5M_A 3KO6_B 3HCY_A Back     alignment and domain information
>smart00065 GAF Domain present in phytochromes and cGMP-specific phosphodiesterases Back     alignment and domain information
>PF01590 GAF: GAF domain; InterPro: IPR003018 This domain is present in phytochromes and cGMP-specific phosphodiesterases Back     alignment and domain information
>PRK11061 fused phosphoenolpyruvate-protein phosphotransferase PtsP/GAF domain; Provisional Back     alignment and domain information
>TIGR01817 nifA Nif-specific regulatory protein Back     alignment and domain information
>TIGR02851 spore_V_T stage V sporulation protein T Back     alignment and domain information
>PRK15429 formate hydrogenlyase transcriptional activator FhlA; Provisional Back     alignment and domain information
>COG1956 GAF domain-containing protein [Signal transduction mechanisms] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query301
3mmh_A167 FRMSR, methionine-R-sulfoxide reductase; oxidoredu 97.79
3ksh_A160 Putative uncharacterized protein; FRMSR, free-Met- 97.59
3rfb_A171 Putative uncharacterized protein; FRMSR, GAF, oxid 97.59
1vhm_A195 Protein YEBR; structural genomics, unknown functio 97.56
3e0y_A181 Conserved domain protein; APC87688.2, geobacter su 97.41
3p01_A184 Two-component response regulator; PSI-2, midwest c 97.4
2w3g_A153 DOSS, two component sensor histidine kinase DEVS ( 97.36
3trc_A171 Phosphoenolpyruvate-protein phosphotransferase; si 97.31
3ci6_A171 Phosphoenolpyruvate-protein phosphotransferase; PE 97.27
2zmf_A189 CAMP and CAMP-inhibited CGMP 3',5'-cyclic phospho 97.04
3o5y_A165 Sensor protein; GAF domain, histidine, kinase, PSI 96.95
3k2n_A177 Sigma-54-dependent transcriptional regulator; PSI- 96.93
3oov_A169 Methyl-accepting chemotaxis protein, putative; str 96.89
2vjw_A149 GAF-B, GAF family protein; histidine kinase, hypox 96.83
2qyb_A181 Membrane protein, putative; GAF domain, domain of 96.81
1f5m_A180 GAF; CGMP binding, signaling protein; 1.90A {Sacch 96.74
3hcy_A151 Putative two-component sensor histidine kinase PR; 96.66
2e4s_A189 CAMP and CAMP-inhibited CGMP 3',5'-cyclic phosphod 96.49
3dba_A180 CONE CGMP-specific 3',5'-cyclic phosphodiesterase 94.88
3ibj_A 691 CGMP-dependent 3',5'-cyclic phosphodiesterase; PDE 94.81
1mc0_A368 3',5'-cyclic nucleotide phosphodiesterase 2A; GAF 94.72
2lb5_A208 Sensor histidine kinase; PCB, transferase, GAF dom 94.22
2k2n_A172 Sensor protein, SYB-CPH1(GAF); phytochrome, GAF do 94.13
1ykd_A398 Adenylate cyclase; GAF domain, bound cyclic AMP li 93.03
1ykd_A 398 Adenylate cyclase; GAF domain, bound cyclic AMP li 92.47
4glq_A171 Methyl-accepting chemotaxis protein; chromophore, 92.01
1mc0_A 368 3',5'-cyclic nucleotide phosphodiesterase 2A; GAF 88.03
2w1r_A123 Spovt, stage V sporulation protein T; regulatory G 87.6
3cit_A160 Sensor histidine kinase; MEGA: 3.30.450.40, struct 84.57
3ibj_A 691 CGMP-dependent 3',5'-cyclic phosphodiesterase; PDE 81.12
>3mmh_A FRMSR, methionine-R-sulfoxide reductase; oxidoreductase; HET: SME MRD; 1.25A {Neisseria meningitidis} SCOP: d.110.2.0 Back     alignment and structure
Probab=97.79  E-value=0.00033  Score=59.67  Aligned_cols=82  Identities=13%  Similarity=0.211  Sum_probs=69.8

Q ss_pred             ccchh-hHHHHHhhccCceeccccccCCCCCCCCCCCCCCCeeEEEecC--CC--eEEEEecCCCcccChhhHHHHHHHH
Q 022217          205 KLKQG-SVYLGVMRSKAQRYLANLLLYPGRSELPFLPSNTQAVILQPLG--DK--GIAIIGGDTIRGFTTSDQSWIAFIG  279 (301)
Q Consensus       205 ~~~pG-~I~~r~~~~~~~iyL~nl~lyPGr~Ef~~LP~ntqsVlvQPig--~~--Gvlvlgs~s~R~FT~kDe~WI~~iA  279 (301)
                      .+.+| -+|.++.++++++.+.|..-+|+....+   .+++|.+|.||-  ++  |||-+.+..++.||+.|+..++.+|
T Consensus        77 ~i~~geGi~G~v~~~g~~~~v~Dv~~~p~~~~~~---~~~~S~i~vPi~~~g~viGVL~i~s~~~~~F~~~d~~~L~~lA  153 (167)
T 3mmh_A           77 RIPFGRGVCGQAWAKGGTVVVGDVDAHPDHIACS---SLSRSEIVVPLFSDGRCIGVLDADSEHLAQFDETDALYLGELA  153 (167)
T ss_dssp             EEETTSHHHHHHHHHTSCEEESCGGGSTTCCCSS---TTCCEEEEEEEEETTEEEEEEEEEESSTTCCCHHHHHHHHHHH
T ss_pred             EeccCCChHHHHHhCCcEEEECCcccCcchhhcC---ccCCeEEEEEeccCCEEEEEEEEecCCCCCCCHHHHHHHHHHH
Confidence            45665 6899999999999999999999854433   579999999994  32  9999999999999999999999999


Q ss_pred             HHHHHHHhhc
Q 022217          280 EKLDATLAKY  289 (301)
Q Consensus       280 ~KL~~tL~~~  289 (301)
                      +-+..++.-+
T Consensus       154 ~~la~~i~~~  163 (167)
T 3mmh_A          154 KILEKRFEAS  163 (167)
T ss_dssp             HHHHHHHHHH
T ss_pred             HHHHHHHHhh
Confidence            9999887643



>3ksh_A Putative uncharacterized protein; FRMSR, free-Met-R-SO, oxidoreductase; 1.50A {Staphylococcus aureus} SCOP: d.110.2.0 PDB: 3ksf_A 3ksi_A 3ksg_A* Back     alignment and structure
>3rfb_A Putative uncharacterized protein; FRMSR, GAF, oxidoreductase, SME; HET: SME; 2.30A {Streptococcus pneumoniae} Back     alignment and structure
>1vhm_A Protein YEBR; structural genomics, unknown function; HET: MES; 2.10A {Escherichia coli} SCOP: d.110.2.1 Back     alignment and structure
>3e0y_A Conserved domain protein; APC87688.2, geobacter sulfurreducens PCA, structural genomics, PSI-2, midwest center for structural G MCSG; 3.10A {Geobacter sulfurreducens} Back     alignment and structure
>3p01_A Two-component response regulator; PSI-2, midwest center for structural genomics, protein struc initiative, MCSG, signali protein; 2.65A {Nostoc SP} Back     alignment and structure
>2w3g_A DOSS, two component sensor histidine kinase DEVS (GAF family protein); redox sensor, heme, hypoxia, GAF domain, transferase; HET: HEM; 1.40A {Mycobacterium tuberculosis} PDB: 2w3d_A* 2w3f_A* 2w3e_A* 2w3h_A* 2y79_A* 2y8h_A* 2vzw_A* Back     alignment and structure
>3trc_A Phosphoenolpyruvate-protein phosphotransferase; signal transduction; HET: MSE; 1.65A {Coxiella burnetii} Back     alignment and structure
>3ci6_A Phosphoenolpyruvate-protein phosphotransferase; PEP-phosphotransferase, GAF domain, structura genomics, PSI-2, protein structure initiative; HET: MSE P4G; 1.55A {Acinetobacter SP} Back     alignment and structure
>2zmf_A CAMP and CAMP-inhibited CGMP 3',5'-cyclic phospho 10A; GAF domain, phosphodiesterase, CGMP-binding, HYD nucleotide-binding, structural genomics; HET: MSE CMP; 2.10A {Homo sapiens} Back     alignment and structure
>3o5y_A Sensor protein; GAF domain, histidine, kinase, PSI, MCSG, structural genomics, midwest center for structural genomics, protein S initiative; 2.45A {Bacillus halodurans} Back     alignment and structure
>3k2n_A Sigma-54-dependent transcriptional regulator; PSI-2, protein structure initiative, structural genomics; 2.50A {Chlorobium tepidum tls} Back     alignment and structure
>3oov_A Methyl-accepting chemotaxis protein, putative; structural genomics, PSI-2, protein structure initiative; 2.20A {Geobacter sulfurreducens} Back     alignment and structure
>2vjw_A GAF-B, GAF family protein; histidine kinase, hypoxia sensing, hydrolase; HET: MSE; 2.0A {Mycobacterium smegmatis} PDB: 2vks_A Back     alignment and structure
>2qyb_A Membrane protein, putative; GAF domain, domain of putative membrane protein, PSI-2, MCSG structural genomics; 2.40A {Geobacter sulfurreducens pca} Back     alignment and structure
>1f5m_A GAF; CGMP binding, signaling protein; 1.90A {Saccharomyces cerevisiae} SCOP: d.110.2.1 PDB: 3ko6_A* Back     alignment and structure
>3hcy_A Putative two-component sensor histidine kinase PR; two-component sensor histidine kinase protein, structural GE PSI, MCSG; 2.80A {Sinorhizobium meliloti} Back     alignment and structure
>3dba_A CONE CGMP-specific 3',5'-cyclic phosphodiesterase alpha'; 3', GAF domain, cyclic nucleotide phosphodiesterase hydrolase, lipoprotein, membrane; HET: 35G; 2.57A {Gallus gallus} Back     alignment and structure
>3ibj_A CGMP-dependent 3',5'-cyclic phosphodiesterase; PDE2A, GAF-domains, allosteric regulation hydrolase, membrane; 3.02A {Homo sapiens} Back     alignment and structure
>1mc0_A 3',5'-cyclic nucleotide phosphodiesterase 2A; GAF domain, 3',5' guanosine monophosphate, hydrolase; HET: PCG; 2.86A {Mus musculus} SCOP: d.110.2.1 d.110.2.1 Back     alignment and structure
>2lb5_A Sensor histidine kinase; PCB, transferase, GAF domain, phosphoprotein; HET: CYC; NMR {Synechococcus SP} PDB: 2lb9_A* Back     alignment and structure
>2k2n_A Sensor protein, SYB-CPH1(GAF); phytochrome, GAF domain, phycocyanobilin, PCB, bacteriophytochrome, cyanobacterial phytochrome, kinase; HET: CYC; NMR {Synechococcus SP} SCOP: d.110.2.1 PDB: 2kli_A* 2koi_A* Back     alignment and structure
>1ykd_A Adenylate cyclase; GAF domain, bound cyclic AMP ligand, lyase; HET: CMP; 1.90A {Anabaena SP} Back     alignment and structure
>1ykd_A Adenylate cyclase; GAF domain, bound cyclic AMP ligand, lyase; HET: CMP; 1.90A {Anabaena SP} Back     alignment and structure
>4glq_A Methyl-accepting chemotaxis protein; chromophore, phytochrome, cyanobacteriochrome, phycoviolobil bilin, BILI-protein; HET: PVN; 1.77A {Thermosynechococcus elongatus} PDB: 4fof_A* Back     alignment and structure
>1mc0_A 3',5'-cyclic nucleotide phosphodiesterase 2A; GAF domain, 3',5' guanosine monophosphate, hydrolase; HET: PCG; 2.86A {Mus musculus} SCOP: d.110.2.1 d.110.2.1 Back     alignment and structure
>2w1r_A Spovt, stage V sporulation protein T; regulatory GAF domain of spovt, transcription, transcription regulation, repressor, activator; 1.50A {Bacillus subtilis} Back     alignment and structure
>3cit_A Sensor histidine kinase; MEGA: 3.30.450.40, structural genomics, sensor histidine KIN pseudomonas syringae, PSI-2; HET: MSE; 1.90A {Pseudomonas syringae PV} Back     alignment and structure
>3ibj_A CGMP-dependent 3',5'-cyclic phosphodiesterase; PDE2A, GAF-domains, allosteric regulation hydrolase, membrane; 3.02A {Homo sapiens} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query301
d1vhma_159 Hypothetical protein YebR {Escherichia coli [TaxId 97.93
d1f5ma_176 Hypothetical protein ykl069wp {Baker's yeast (Sacc 97.58
d1mc0a1187 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF 95.52
d1mc0a2154 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF 92.22
d2k2na1170 Sensor protein CYB2465 {Synechococcus sp. [TaxId: 82.37
>d1vhma_ d.110.2.1 (A:) Hypothetical protein YebR {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Profilin-like
superfamily: GAF domain-like
family: GAF domain
domain: Hypothetical protein YebR
species: Escherichia coli [TaxId: 562]
Probab=97.93  E-value=0.00011  Score=59.32  Aligned_cols=118  Identities=16%  Similarity=0.198  Sum_probs=85.4

Q ss_pred             HHHHHHHHhh---hccccEEEEEECCEEEEEEeeccCCCCCccchhhcccccchh-hHHHHHhhccCceeccccccCCCC
Q 022217          158 LLWAWESLSA---VTCCRSLVVVYDGIYLLQIGMAAESGNSGEAVIVDASKLKQG-SVYLGVMRSKAQRYLANLLLYPGR  233 (301)
Q Consensus       158 LAWas~~LLt---nT~a~svlV~~~G~vll~RG~~~~~~~~g~a~~~d~~~~~pG-~I~~r~~~~~~~iyL~nl~lyPGr  233 (301)
                      |+-.+..|.+   ++...++.+..++...+  +.......        ...+..| .+|.++.++++++.+.|..-.|..
T Consensus        27 L~~i~~~l~~~~~~~~~~~~yl~d~~~~~l--~~~~~~~~--------~~~i~~g~g~~g~v~~~~~~~~v~d~~~d~~~   96 (159)
T d1vhma_          27 LANTSALLYERLTDINWAGFYLLEDDTLVL--GPFQGKIA--------CVRIPVGRGVCGTAVARNQVQRIEDVHVFDGH   96 (159)
T ss_dssp             HHHHHHHHHHHSSSCSEEEEEEEETTEEEE--EEEEESCC--------CSEEETTSHHHHHHHHHTSCEEESCTTTCTTC
T ss_pred             HHHHHHHHHHhhccCceEEEEEEECCEeee--eeecCCcc--------eeEEecCCChHHHHHHcCCcEEecChHHhhhh
Confidence            5666666643   44445565656665443  22222111        0134444 478999999999999999877764


Q ss_pred             CCCCCCCCCCCeeEEEecC--CC--eEEEEecCCCcccChhhHHHHHHHHHHHHHHHhh
Q 022217          234 SELPFLPSNTQAVILQPLG--DK--GIAIIGGDTIRGFTTSDQSWIAFIGEKLDATLAK  288 (301)
Q Consensus       234 ~Ef~~LP~ntqsVlvQPig--~~--Gvlvlgs~s~R~FT~kDe~WI~~iA~KL~~tL~~  288 (301)
                      .   ....+.+|++|.||-  ++  |+|.+++..++.||+.|...++.+|+.+..+++.
T Consensus        97 ~---~~~~~~~S~l~vPl~~~~~viGvl~v~s~~~~~F~~~d~~ll~~la~qia~ai~~  152 (159)
T d1vhma_          97 I---ACDAASNSEIVLPLVVKNQIIGVLDIDSTVFGRFTDEDEQGLRQLVAQLEKVLAT  152 (159)
T ss_dssp             C---CSCCCCSEEEEEEEEETTEEEEEEEEEESSTTCCCHHHHHHHHHHHHHHHHHHHT
T ss_pred             h---hcccccceeEEeCEEECCEEEEEEEecCCCCCCCCHHHHHHHHHHHHHHHHHHHc
Confidence            4   346789999999984  32  9999999999999999999999999999999875



>d1f5ma_ d.110.2.1 (A:) Hypothetical protein ykl069wp {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1mc0a1 d.110.2.1 (A:215-401) 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF A and GAF B domains {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1mc0a2 d.110.2.1 (A:402-555) 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF A and GAF B domains {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2k2na1 d.110.2.1 (A:31-200) Sensor protein CYB2465 {Synechococcus sp. [TaxId: 1131]} Back     information, alignment and structure