Citrus Sinensis ID: 023161


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280------
MSILPYNSLMQLKIPSHSCAKTTRKSFSICARLDNSQQEEQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIWIDLK
cccccccccccccccccccccccccccccEEcccccHHHHHHccccEEEEEcccccccccccHHHHHHHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccEEcEEEcccccHHHHHHHHHHHHHHHHcccccEEEEEEccEEEEEEEEcccccccHHHHHHHHHHHHcccccccccccccccccccccccccccccccEEEEEEccccccccccccccccEEEEEEEcccccccHHHHHHHHHHHHHHcccccccccc
ccccEccccccccccccccccccccEEEEEEEccccccccccccEEEEEEEcccccccHcHccHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccEEEEEcccccHHHHHHHHHHHHHHHHccccEEEEEEEccEEEEEEccccccccccHHHHHHHHHHcccEEEEEccccEEccccccccccccccccccEEEEEEccccccccccccccccEEEEEEccccccccHHHHHHHHHHHHHHcccEEEEEcc
msilpynslmqlkipshscakttrKSFSICARLDNSQQEEQQLNLSVLRftfgipgfdesylprwigygFGSLIVLNHfafsnsvtsAQLRSEVLGLSLAAFSVTLPYlgkflkgaspvsqkslpesgEQIFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELcvrgywqtpdgaskTQLLEWFERQIENIglsdlkdslyfpqsadagqwemlpkgtcsvfvqpviqapnpsavevEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIWIDLK
msilpynslmqlkipshscaKTTRKSFSICARLDNSQQEEQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKavsnkfrgksiwidlk
MSILPYNSLMQLKIPSHSCAKTTRKSFSICARLDNSQQEEQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIWIDLK
**************************FSICAR*********QLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKG***************IFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIWID**
****PYNSLMQLK*********************************VLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFL*****************IFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQ*********LLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIWIDL*
MSILPYNSLMQLKIPSHSCAKTTRKSFSICARLDNSQQEEQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIWIDLK
*SILPYNSLMQLKIPSHSCA*TTRKSFSICARLDNSQQEEQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIWIDLK
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooHHHHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHooooooooooooooooHHHHHHHHHHHHHHHHHiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSILPYNSLMQLKIPSHSCAKTTRKSFSICARLDNSQQEEQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIWIDLK
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query286
225455453311 PREDICTED: uncharacterized protein LOC10 0.895 0.823 0.726 1e-107
297741104313 unnamed protein product [Vitis vinifera] 0.891 0.814 0.725 1e-106
224136914265 predicted protein [Populus trichocarpa] 0.905 0.977 0.698 1e-99
255539547308 conserved hypothetical protein [Ricinus 0.947 0.879 0.647 4e-98
449446061288 PREDICTED: uncharacterized protein LOC10 0.916 0.909 0.642 1e-94
15242277275 uncharacterized protein [Arabidopsis tha 0.905 0.941 0.609 5e-90
297796017272 hypothetical protein ARALYDRAFT_495283 [ 0.905 0.952 0.609 8e-90
147816662349 hypothetical protein VITISV_012455 [Viti 0.895 0.733 0.588 9e-90
356559871287 PREDICTED: uncharacterized protein LOC10 0.828 0.825 0.640 2e-84
242040471293 hypothetical protein SORBIDRAFT_01g03118 0.951 0.928 0.537 6e-78
>gi|225455453|ref|XP_002274647.1| PREDICTED: uncharacterized protein LOC100252183 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  392 bits (1008), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 191/263 (72%), Positives = 218/263 (82%), Gaps = 7/263 (2%)

Query: 18  SCAKTTRKSFSICARLDNSQQEEQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLIVLN 77
           S   TTRK  SI A       ++QQLNLSVLRFT GIPGFDESYLPRWIGYGFGS I+LN
Sbjct: 54  STINTTRKFQSISA-----SNQQQQLNLSVLRFTLGIPGFDESYLPRWIGYGFGSFILLN 108

Query: 78  HFAFS--NSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIFVMS 135
           HF  S  N++T+AQLR+E LGL LAAFSV LPYLGKFLKGA+PV Q +LPE  EQIFVM+
Sbjct: 109 HFVGSDLNTITAAQLRTEALGLCLAAFSVVLPYLGKFLKGAAPVDQTTLPEGIEQIFVMT 168

Query: 136 QNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEWFERQI 195
           QNISD LKE+LAWATY+LLRNTN+I+VLISIRG LCVRGYW TPD  SK ++L+W E++I
Sbjct: 169 QNISDILKEDLAWATYILLRNTNTIAVLISIRGALCVRGYWNTPDDVSKARVLDWVEKEI 228

Query: 196 ENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLAS 255
           E IGLSDLKD+LYFPQSAD+G WEMLPKGTCS+ VQPV Q P+    E+EKI+GFVLLAS
Sbjct: 229 EKIGLSDLKDTLYFPQSADSGLWEMLPKGTCSLLVQPVSQIPSQGTDEMEKIDGFVLLAS 288

Query: 256 SMTYAYSHKDRAWIKAVSNKFRG 278
           SM YAY+ KDRAWI AV+NKFRG
Sbjct: 289 SMNYAYTDKDRAWIGAVANKFRG 311




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|297741104|emb|CBI31835.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224136914|ref|XP_002322447.1| predicted protein [Populus trichocarpa] gi|222869443|gb|EEF06574.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255539547|ref|XP_002510838.1| conserved hypothetical protein [Ricinus communis] gi|223549953|gb|EEF51440.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|449446061|ref|XP_004140790.1| PREDICTED: uncharacterized protein LOC101219803 [Cucumis sativus] gi|449530412|ref|XP_004172189.1| PREDICTED: uncharacterized LOC101219803 [Cucumis sativus] Back     alignment and taxonomy information
>gi|15242277|ref|NP_200024.1| uncharacterized protein [Arabidopsis thaliana] gi|79330641|ref|NP_001032060.1| uncharacterized protein [Arabidopsis thaliana] gi|10177746|dbj|BAB11059.1| unnamed protein product [Arabidopsis thaliana] gi|117959001|gb|ABK59697.1| At5g52110 [Arabidopsis thaliana] gi|332008789|gb|AED96172.1| uncharacterized protein [Arabidopsis thaliana] gi|332008790|gb|AED96173.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297796017|ref|XP_002865893.1| hypothetical protein ARALYDRAFT_495283 [Arabidopsis lyrata subsp. lyrata] gi|297311728|gb|EFH42152.1| hypothetical protein ARALYDRAFT_495283 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|147816662|emb|CAN68387.1| hypothetical protein VITISV_012455 [Vitis vinifera] Back     alignment and taxonomy information
>gi|356559871|ref|XP_003548220.1| PREDICTED: uncharacterized protein LOC100817953 [Glycine max] Back     alignment and taxonomy information
>gi|242040471|ref|XP_002467630.1| hypothetical protein SORBIDRAFT_01g031180 [Sorghum bicolor] gi|241921484|gb|EER94628.1| hypothetical protein SORBIDRAFT_01g031180 [Sorghum bicolor] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query286
TAIR|locus:2173048275 HCF208 "AT5G52110" [Arabidopsi 0.905 0.941 0.609 1.7e-84
TAIR|locus:2173048 HCF208 "AT5G52110" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 846 (302.9 bits), Expect = 1.7e-84, P = 1.7e-84
 Identities = 164/269 (60%), Positives = 212/269 (78%)

Query:    15 PSHSCAKTTRKSFSICARLDN----SQQEEQQLNLSVLRFTFGIPGFDESYLPRWIGYGF 70
             P  +     ++S  I AR +N    S+  +QQLNLSVLRFTFGIPGFDESYLPRWIGYGF
Sbjct:    12 PKFALQPRAQRSTRIFARTENDSPQSKTSDQQLNLSVLRFTFGIPGFDESYLPRWIGYGF 71

Query:    71 GSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQ 130
             GSL++LNHF+ S  ++ +Q+RSE LGLSLAAFS+ LPY+GKFLKG S V Q+SLPE GEQ
Sbjct:    72 GSLLLLNHFSASAPISESQMRSEALGLSLAAFSIALPYIGKFLKG-SVVEQRSLPEEGEQ 130

Query:   131 IFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEW 190
             +FV+S NI D+LKE+LAWATYVLLRNT++I+VLIS++GELCVRGYW  PD  SK QL +W
Sbjct:   131 VFVISSNIGDSLKEDLAWATYVLLRNTSTIAVLISVQGELCVRGYWNCPDQMSKAQLHDW 190

Query:   191 FERQIENIGLSDLKDSLYFPQSADAG-QWEMLPKGTCSVFVQPVIQAPNPSAVEVEKIEG 249
             F+++++ IGL+D+K++LYFPQ A +    ++LP GT S+FVQP++Q  N    E +K+ G
Sbjct:   191 FKKKVDEIGLADVKETLYFPQYAGSALSLDILPDGTRSLFVQPLVQNTN----EPQKVNG 246

Query:   250 FVLLASSMTYAYSHKDRAWIKAVSNKFRG 278
             F+L+AS+  YAYS KDRAWI A++ KFRG
Sbjct:   247 FLLVASTAGYAYSDKDRAWIGAMAEKFRG 275


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.133   0.395    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      286       286   0.00086  115 3  11 22  0.37    34
                                                     33  0.42    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  612 (65 KB)
  Total size of DFA:  213 KB (2117 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  22.33u 0.21s 22.54t   Elapsed:  00:00:01
  Total cpu time:  22.33u 0.21s 22.54t   Elapsed:  00:00:01
  Start:  Sat May 11 13:39:25 2013   End:  Sat May 11 13:39:26 2013


GO:0003674 "molecular_function" evidence=ND
GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0010190 "cytochrome b6f complex assembly" evidence=IMP
GO:0010207 "photosystem II assembly" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query286
pfam11152195 pfam11152, DUF2930, Protein of unknown function (D 6e-56
>gnl|CDD|221004 pfam11152, DUF2930, Protein of unknown function (DUF2930) Back     alignment and domain information
 Score =  178 bits (454), Expect = 6e-56
 Identities = 66/217 (30%), Positives = 105/217 (48%), Gaps = 23/217 (10%)

Query: 62  LPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQ 121
           LP   G   G L++LN    +  +T +Q R++VLGL LAA  + +  L    +  SP   
Sbjct: 1   LPLIAGLLGGLLLLLNRL-STADITPSQSRADVLGLLLAAGLILVGLLWTRAQPRSP-EA 58

Query: 122 KSLPESGEQIFVMSQNISDALKENLAWATYVLLRNTNSISVLISIRGELCVR-GYWQTPD 180
             L   GEQ F +S ++S+ALKE LAWA+++LL  T + SVL+   G++ +R G    P 
Sbjct: 59  VDLE--GEQGFELSPDLSEALKEELAWASHLLLTATPAASVLVYWDGQVLLRRGILGPPV 116

Query: 181 GASKTQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQPVIQAPNPS 240
              K +  E  +R +E   L  L +   +P   +   ++ LP  T +V VQP+       
Sbjct: 117 ---KFEPGEICKRALETQQLVYLVNLKLYPGRIE---FDYLPPNTQAVIVQPLGN----- 165

Query: 241 AVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFR 277
                   G ++L       ++  D  WI+  ++K R
Sbjct: 166 -------RGVLVLGGWSPRCFTRSDERWIEGWADKLR 195


This family of proteins has no known function. Length = 195

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 286
PF11152195 DUF2930: Protein of unknown function (DUF2930); In 100.0
PF13185148 GAF_2: GAF domain; PDB: 2QYB_A 3KSG_B 3KSF_C 3KSI_ 95.25
PF13492129 GAF_3: GAF domain; PDB: 3EEA_A 4DMZ_A 4DN0_A 1VHM_ 94.96
PRK11061 748 fused phosphoenolpyruvate-protein phosphotransfera 92.61
smart00065149 GAF Domain present in phytochromes and cGMP-specif 91.99
PF01590154 GAF: GAF domain; InterPro: IPR003018 This domain i 90.33
TIGR02851180 spore_V_T stage V sporulation protein T. Members o 89.43
PRK15429 686 formate hydrogenlyase transcriptional activator Fh 85.2
TIGR01817 534 nifA Nif-specific regulatory protein. This model r 82.6
>PF11152 DUF2930: Protein of unknown function (DUF2930); InterPro: IPR021325 This family of proteins has no known function Back     alignment and domain information
Probab=100.00  E-value=2e-73  Score=506.90  Aligned_cols=194  Identities=37%  Similarity=0.544  Sum_probs=185.2

Q ss_pred             hhHHHHHHHHHHHHhhhhccCCCCChhhHHHHHHHHHHHHhhhhhhhhhhhhcccCCCCCCCCCCCCeeceEeccCchHH
Q 023161           62 LPRWIGYGFGSLIVLNHFAFSNSVTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIFVMSQNISDA  141 (286)
Q Consensus        62 LP~~vG~l~g~Lll~N~~~s~a~~t~aQ~RsevLg~~Laa~~v~L~l~G~~w~~~~P~~~~~v~l~geq~F~l~~~Lsd~  141 (286)
                      ||+++|+++++++++||++++ .+||||+||||||++||+++   +++|++|++++|++++.+++.|+|+|+|++||||+
T Consensus         1 LPl~vG~l~~~lll~N~~~~~-~~t~sq~RaevLg~~ls~~l---iL~g~lw~~l~p~~~~~v~L~G~~~f~l~~~L~~~   76 (195)
T PF11152_consen    1 LPLIVGLLGGSLLLLNRLSGA-DATPSQSRAEVLGLILSAVL---ILTGLLWQRLKPRSPEAVDLPGEQGFELSPDLSDA   76 (195)
T ss_pred             CceEehHHHHHHHHhhhcccC-CCCHHHHHHHHHHHHHHHHH---HHHHHHHhhccCCCCCccCCCCccceEECCCCCHH
Confidence            799999999999999997776 68999999999999999555   89999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHhhhccccEEEEEECCEEE-EeeeecCCCCCChHHHHHHHHHhhhhcccccCCCcccccCCcccccccc
Q 023161          142 LKENLAWATYVLLRNTNSISVLISIRGELC-VRGYWQTPDGASKTQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEM  220 (286)
Q Consensus       142 ~k~ELAWaS~~LLtnT~a~svlV~~~g~ll-~RG~~~~p~~~~~g~I~~~~~~~~~~~~Lv~L~~t~lyPgr~E~~~~~~  220 (286)
                      +|+|||||||+|||||+|+||+|||||++| +||++++++++++|+||++++++++.+||+|+   .|||||+|   |+|
T Consensus        77 ~k~ELAWas~~LLt~T~a~svlV~~~g~~llrrG~~~~~~~~~~g~i~~~~~~~~~~~yL~nl---~lyPGr~E---f~~  150 (195)
T PF11152_consen   77 AKEELAWASYALLTNTNARSVLVYYDGRVLLRRGILGSPSKFKPGPICQRAMESGKLIYLVNL---KLYPGRVE---FDY  150 (195)
T ss_pred             HHHHHHHHHHHHHhhCCceEEEEEECCEEEEEeeccCCCCCcchHHHHHHHHhcCCceecccc---ccCCCchh---hhh
Confidence            999999999999999999999999999966 58999888999999999999999888888877   89999999   899


Q ss_pred             CCCCCceeEEEeccCCCCCCcccccccceEEEEeeCCccccchhhHHHHHHHHhhhc
Q 023161          221 LPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFR  277 (286)
Q Consensus       221 LP~ntqsVlvQPig~~~~~~~~~~~~~~Gvlvlgs~s~RafT~kDe~WI~~iA~KL~  277 (286)
                      ||+|||||||||+|+            +||||+|||++||||+|||+||++|||||+
T Consensus       151 lP~ntq~VlvqP~g~------------~G~lvlgs~~~R~ft~~D~~Wi~~iA~Kl~  195 (195)
T PF11152_consen  151 LPENTQSVLVQPLGQ------------NGVLVLGSNSPRAFTKSDEAWIAGIADKLR  195 (195)
T ss_pred             cCCCCcEEEEEEcCC------------CeEEEEeeCCccccCHHHHHHHHHHHHhcC
Confidence            999999999999999            599999999999999999999999999996



>PF13185 GAF_2: GAF domain; PDB: 2QYB_A 3KSG_B 3KSF_C 3KSI_A 3KSH_A 3MMH_A 3RFB_B 1F5M_A 3KO6_B 3HCY_A Back     alignment and domain information
>PF13492 GAF_3: GAF domain; PDB: 3EEA_A 4DMZ_A 4DN0_A 1VHM_A Back     alignment and domain information
>PRK11061 fused phosphoenolpyruvate-protein phosphotransferase PtsP/GAF domain; Provisional Back     alignment and domain information
>smart00065 GAF Domain present in phytochromes and cGMP-specific phosphodiesterases Back     alignment and domain information
>PF01590 GAF: GAF domain; InterPro: IPR003018 This domain is present in phytochromes and cGMP-specific phosphodiesterases Back     alignment and domain information
>TIGR02851 spore_V_T stage V sporulation protein T Back     alignment and domain information
>PRK15429 formate hydrogenlyase transcriptional activator FhlA; Provisional Back     alignment and domain information
>TIGR01817 nifA Nif-specific regulatory protein Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query286
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 45.2 bits (106), Expect = 2e-05
 Identities = 40/224 (17%), Positives = 77/224 (34%), Gaps = 62/224 (27%)

Query: 28  SICARLDNSQQEEQQLNL--SVLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHF------ 79
           +  +R D+S   + +++   + LR         + Y           L+VL +       
Sbjct: 212 NWTSRSDHSSNIKLRIHSIQAELRRLL----KSKPY-EN-------CLLVLLNVQNAKAW 259

Query: 80  -AFSNS----VTSAQLRSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIF-- 132
            AF+ S    +T+   R + +   L+A + T   L       +P       E  + +   
Sbjct: 260 NAFNLSCKILLTT---RFKQVTDFLSAATTTHISLDHHSMTLTP------DEV-KSLLLK 309

Query: 133 VMSQNISDALKENLAWATYVLLRNTNSISVLI--SIRGELCVRGYWQTPDGASKTQLLEW 190
            +     D L         VL  N   +S +I  SIR  L     W+  +    T ++E 
Sbjct: 310 YLDCRPQD-LPRE------VLTTNPRRLS-IIAESIRDGLATWDNWKHVNCDKLTTIIES 361

Query: 191 FERQIENIGLSDLKDSLY-----FPQSADAGQWEMLPKGTCSVF 229
               +  +  ++ +  ++     FP SA       +P    S+ 
Sbjct: 362 S---LNVLEPAEYRK-MFDRLSVFPPSAH------IPTILLSLI 395


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query286
3rfb_A171 Putative uncharacterized protein; FRMSR, GAF, oxid 97.04
3mmh_A167 FRMSR, methionine-R-sulfoxide reductase; oxidoredu 97.01
3p01_A184 Two-component response regulator; PSI-2, midwest c 96.75
3ksh_A160 Putative uncharacterized protein; FRMSR, free-Met- 96.67
3e0y_A181 Conserved domain protein; APC87688.2, geobacter su 96.53
1f5m_A180 GAF; CGMP binding, signaling protein; 1.90A {Sacch 96.36
1vhm_A195 Protein YEBR; structural genomics, unknown functio 96.31
3trc_A171 Phosphoenolpyruvate-protein phosphotransferase; si 96.31
3o5y_A165 Sensor protein; GAF domain, histidine, kinase, PSI 96.04
3ci6_A171 Phosphoenolpyruvate-protein phosphotransferase; PE 96.02
3oov_A169 Methyl-accepting chemotaxis protein, putative; str 95.61
2w3g_A153 DOSS, two component sensor histidine kinase DEVS ( 95.4
2e4s_A189 CAMP and CAMP-inhibited CGMP 3',5'-cyclic phosphod 95.35
3hcy_A151 Putative two-component sensor histidine kinase PR; 95.3
2zmf_A189 CAMP and CAMP-inhibited CGMP 3',5'-cyclic phospho 95.2
3dba_A180 CONE CGMP-specific 3',5'-cyclic phosphodiesterase 95.03
2vjw_A149 GAF-B, GAF family protein; histidine kinase, hypox 94.58
2qyb_A181 Membrane protein, putative; GAF domain, domain of 94.41
3k2n_A177 Sigma-54-dependent transcriptional regulator; PSI- 94.36
2lb5_A208 Sensor histidine kinase; PCB, transferase, GAF dom 92.64
2k2n_A172 Sensor protein, SYB-CPH1(GAF); phytochrome, GAF do 92.06
3ibj_A 691 CGMP-dependent 3',5'-cyclic phosphodiesterase; PDE 91.79
1ykd_A 398 Adenylate cyclase; GAF domain, bound cyclic AMP li 91.31
1mc0_A368 3',5'-cyclic nucleotide phosphodiesterase 2A; GAF 90.38
4glq_A171 Methyl-accepting chemotaxis protein; chromophore, 90.28
1ykd_A398 Adenylate cyclase; GAF domain, bound cyclic AMP li 85.93
>3rfb_A Putative uncharacterized protein; FRMSR, GAF, oxidoreductase, SME; HET: SME; 2.30A {Streptococcus pneumoniae} Back     alignment and structure
Probab=97.04  E-value=0.0095  Score=51.14  Aligned_cols=110  Identities=16%  Similarity=0.156  Sum_probs=79.6

Q ss_pred             hccccEEEEEECCEEEEeeeecCCCC--CCh-HHHHHHHHHhhhhcccccCCCcccccCCccccccccCCCCCceeEEEe
Q 023161          156 NTNSISVLISIRGELCVRGYWQTPDG--ASK-TQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWEMLPKGTCSVFVQP  232 (286)
Q Consensus       156 nT~a~svlV~~~g~ll~RG~~~~p~~--~~~-g~I~~~~~~~~~~~~Lv~L~~t~lyPgr~E~~~~~~LP~ntqsVlvQP  232 (286)
                      ..+.+++-++.++++.+..+-+.++.  +.. .-||-++.++.+..-.-|.   .-+|+...   .+   ..++|.||.|
T Consensus        50 ~~~~~gfYL~~~~~L~l~a~~G~~a~~ri~~GeGv~G~va~tg~~i~V~Dv---~~~p~~i~---~~---~~~~Sei~VP  120 (171)
T 3rfb_A           50 NTVFAGFYLFDGKELVLGPFQGGVSCIRIALGKGVCGEAAHFQETVIVGDV---TTYLNYIS---CD---SLAKSEIVVP  120 (171)
T ss_dssp             TEEEEEEEEECSSEEEEEEEESSSCCCEEETTSHHHHHHHHTTSCEEESCT---TSCSSCCC---SC---TTCCEEEEEE
T ss_pred             CCCEEEEEEEECCEEEEEeccCCccceEeeCCcCHHHHHHhhCCEEEECCc---ccCccccc---cC---cccCceEEEE
Confidence            35566766667777554333333332  222 3477888888888877666   55676543   12   4689999999


Q ss_pred             ccCCCCCCcccccccceEEEEeeCCccccchhhHHHHHHHHhhhccccce
Q 023161          233 VIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIW  282 (286)
Q Consensus       233 ig~~~~~~~~~~~~~~Gvlvlgs~s~RafT~kDe~WI~~iA~KL~~~~~~  282 (286)
                      |-.+        .+.=|||-+.+..++.|++.|+.+++.+|+-+..+..|
T Consensus       121 I~~~--------g~viGVL~i~s~~~~~F~e~D~~~L~~lA~~la~~~~~  162 (171)
T 3rfb_A          121 MMKN--------GQLLGVLDLDSSEIEDYDAMDRDYLEQFVAILLEKTTW  162 (171)
T ss_dssp             EEET--------TEEEEEEEEEESSTTCCCHHHHHHHHHHHHHHHHSCBC
T ss_pred             EEEC--------CEEEEEEEEecCCCCCCCHHHHHHHHHHHHHHHHHhcc
Confidence            9652        24479999999999999999999999999999999888



>3mmh_A FRMSR, methionine-R-sulfoxide reductase; oxidoreductase; HET: SME MRD; 1.25A {Neisseria meningitidis} SCOP: d.110.2.0 Back     alignment and structure
>3p01_A Two-component response regulator; PSI-2, midwest center for structural genomics, protein struc initiative, MCSG, signali protein; 2.65A {Nostoc SP} Back     alignment and structure
>3ksh_A Putative uncharacterized protein; FRMSR, free-Met-R-SO, oxidoreductase; 1.50A {Staphylococcus aureus} SCOP: d.110.2.0 PDB: 3ksf_A 3ksi_A 3ksg_A* Back     alignment and structure
>3e0y_A Conserved domain protein; APC87688.2, geobacter sulfurreducens PCA, structural genomics, PSI-2, midwest center for structural G MCSG; 3.10A {Geobacter sulfurreducens} Back     alignment and structure
>1f5m_A GAF; CGMP binding, signaling protein; 1.90A {Saccharomyces cerevisiae} SCOP: d.110.2.1 PDB: 3ko6_A* Back     alignment and structure
>1vhm_A Protein YEBR; structural genomics, unknown function; HET: MES; 2.10A {Escherichia coli} SCOP: d.110.2.1 Back     alignment and structure
>3trc_A Phosphoenolpyruvate-protein phosphotransferase; signal transduction; HET: MSE; 1.65A {Coxiella burnetii} Back     alignment and structure
>3o5y_A Sensor protein; GAF domain, histidine, kinase, PSI, MCSG, structural genomics, midwest center for structural genomics, protein S initiative; 2.45A {Bacillus halodurans} Back     alignment and structure
>3ci6_A Phosphoenolpyruvate-protein phosphotransferase; PEP-phosphotransferase, GAF domain, structura genomics, PSI-2, protein structure initiative; HET: MSE P4G; 1.55A {Acinetobacter SP} Back     alignment and structure
>3oov_A Methyl-accepting chemotaxis protein, putative; structural genomics, PSI-2, protein structure initiative; 2.20A {Geobacter sulfurreducens} Back     alignment and structure
>2w3g_A DOSS, two component sensor histidine kinase DEVS (GAF family protein); redox sensor, heme, hypoxia, GAF domain, transferase; HET: HEM; 1.40A {Mycobacterium tuberculosis} PDB: 2w3d_A* 2w3f_A* 2w3e_A* 2w3h_A* 2y79_A* 2y8h_A* 2vzw_A* Back     alignment and structure
>3hcy_A Putative two-component sensor histidine kinase PR; two-component sensor histidine kinase protein, structural GE PSI, MCSG; 2.80A {Sinorhizobium meliloti} Back     alignment and structure
>2zmf_A CAMP and CAMP-inhibited CGMP 3',5'-cyclic phospho 10A; GAF domain, phosphodiesterase, CGMP-binding, HYD nucleotide-binding, structural genomics; HET: MSE CMP; 2.10A {Homo sapiens} Back     alignment and structure
>3dba_A CONE CGMP-specific 3',5'-cyclic phosphodiesterase alpha'; 3', GAF domain, cyclic nucleotide phosphodiesterase hydrolase, lipoprotein, membrane; HET: 35G; 2.57A {Gallus gallus} Back     alignment and structure
>2vjw_A GAF-B, GAF family protein; histidine kinase, hypoxia sensing, hydrolase; HET: MSE; 2.0A {Mycobacterium smegmatis} PDB: 2vks_A Back     alignment and structure
>2qyb_A Membrane protein, putative; GAF domain, domain of putative membrane protein, PSI-2, MCSG structural genomics; 2.40A {Geobacter sulfurreducens pca} Back     alignment and structure
>3k2n_A Sigma-54-dependent transcriptional regulator; PSI-2, protein structure initiative, structural genomics; 2.50A {Chlorobium tepidum tls} Back     alignment and structure
>2lb5_A Sensor histidine kinase; PCB, transferase, GAF domain, phosphoprotein; HET: CYC; NMR {Synechococcus SP} PDB: 2lb9_A* Back     alignment and structure
>2k2n_A Sensor protein, SYB-CPH1(GAF); phytochrome, GAF domain, phycocyanobilin, PCB, bacteriophytochrome, cyanobacterial phytochrome, kinase; HET: CYC; NMR {Synechococcus SP} SCOP: d.110.2.1 PDB: 2kli_A* 2koi_A* Back     alignment and structure
>3ibj_A CGMP-dependent 3',5'-cyclic phosphodiesterase; PDE2A, GAF-domains, allosteric regulation hydrolase, membrane; 3.02A {Homo sapiens} Back     alignment and structure
>1ykd_A Adenylate cyclase; GAF domain, bound cyclic AMP ligand, lyase; HET: CMP; 1.90A {Anabaena SP} Back     alignment and structure
>1mc0_A 3',5'-cyclic nucleotide phosphodiesterase 2A; GAF domain, 3',5' guanosine monophosphate, hydrolase; HET: PCG; 2.86A {Mus musculus} SCOP: d.110.2.1 d.110.2.1 Back     alignment and structure
>4glq_A Methyl-accepting chemotaxis protein; chromophore, phytochrome, cyanobacteriochrome, phycoviolobil bilin, BILI-protein; HET: PVN; 1.77A {Thermosynechococcus elongatus} PDB: 4fof_A* Back     alignment and structure
>1ykd_A Adenylate cyclase; GAF domain, bound cyclic AMP ligand, lyase; HET: CMP; 1.90A {Anabaena SP} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query286
d1vhma_159 Hypothetical protein YebR {Escherichia coli [TaxId 97.57
d1f5ma_176 Hypothetical protein ykl069wp {Baker's yeast (Sacc 96.51
d1mc0a2154 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF 89.82
d1mc0a1187 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF 89.8
>d1vhma_ d.110.2.1 (A:) Hypothetical protein YebR {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: Profilin-like
superfamily: GAF domain-like
family: GAF domain
domain: Hypothetical protein YebR
species: Escherichia coli [TaxId: 562]
Probab=97.57  E-value=0.00038  Score=55.60  Aligned_cols=124  Identities=13%  Similarity=0.106  Sum_probs=85.6

Q ss_pred             HHHHHHHHhh---hccccEEEEEECCEEEEeeeecCCC--CCCh-HHHHHHHHHhhhhcccccCCCcccccCCccccccc
Q 023161          146 LAWATYVLLR---NTNSISVLISIRGELCVRGYWQTPD--GASK-TQLLEWFERQIENIGLSDLKDSLYFPQSADAGQWE  219 (286)
Q Consensus       146 LAWaS~~LLt---nT~a~svlV~~~g~ll~RG~~~~p~--~~~~-g~I~~~~~~~~~~~~Lv~L~~t~lyPgr~E~~~~~  219 (286)
                      |+-.+..|.+   ++...++.+..++.....-+.+...  .+.. .-++-++.++.++...-|..+   .|..      -
T Consensus        27 L~~i~~~l~~~~~~~~~~~~yl~d~~~~~l~~~~~~~~~~~i~~g~g~~g~v~~~~~~~~v~d~~~---d~~~------~   97 (159)
T d1vhma_          27 LANTSALLYERLTDINWAGFYLLEDDTLVLGPFQGKIACVRIPVGRGVCGTAVARNQVQRIEDVHV---FDGH------I   97 (159)
T ss_dssp             HHHHHHHHHHHSSSCSEEEEEEEETTEEEEEEEEESCCCSEEETTSHHHHHHHHHTSCEEESCTTT---CTTC------C
T ss_pred             HHHHHHHHHHhhccCceEEEEEEECCEeeeeeecCCcceeEEecCCChHHHHHHcCCcEEecChHH---hhhh------h
Confidence            5566655533   4455566666666644332222222  2222 337788888888887766632   2332      3


Q ss_pred             cCCCCCceeEEEeccCCCCCCcccccccceEEEEeeCCccccchhhHHHHHHHHhhhccccceeecC
Q 023161          220 MLPKGTCSVFVQPVIQAPNPSAVEVEKIEGFVLLASSMTYAYSHKDRAWIKAVSNKFRGKSIWIDLK  286 (286)
Q Consensus       220 ~LP~ntqsVlvQPig~~~~~~~~~~~~~~Gvlvlgs~s~RafT~kDe~WI~~iA~KL~~~~~~~~~~  286 (286)
                      ....+.+|++|.||..+        .+.-|+|.+++..++.||+.|...++.+|+.+..+++..|++
T Consensus        98 ~~~~~~~S~l~vPl~~~--------~~viGvl~v~s~~~~~F~~~d~~ll~~la~qia~ai~~~~~~  156 (159)
T d1vhma_          98 ACDAASNSEIVLPLVVK--------NQIIGVLDIDSTVFGRFTDEDEQGLRQLVAQLEKVLATTDYK  156 (159)
T ss_dssp             CSCCCCSEEEEEEEEET--------TEEEEEEEEEESSTTCCCHHHHHHHHHHHHHHHHHHHTSSGG
T ss_pred             hcccccceeEEeCEEEC--------CEEEEEEEecCCCCCCCCHHHHHHHHHHHHHHHHHHHccchh
Confidence            34678999999999652        244799999999999999999999999999999999888764



>d1f5ma_ d.110.2.1 (A:) Hypothetical protein ykl069wp {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1mc0a2 d.110.2.1 (A:402-555) 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF A and GAF B domains {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d1mc0a1 d.110.2.1 (A:215-401) 3',5'-cyclic nucleotide phosphodiesterase 2A, GAF A and GAF B domains {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure