Citrus Sinensis ID: 026171


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240--
MMTSNQLTAIARGEANTSFAFSTRNTVEAKWVRQIKMSSPNPNVKKANLSATKKERIELPINYDYDGNCGSKAKAYHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQNDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLLRQLLQDYDKWVQQKFEEFA
ccccccccEEEcccccccccccccccEEEEEEEEHHccccccccccccEEEEEEEEEEEEcccccccccccccccccHHHHHccccEEEEEEccccccccccccccEEEEEcccccEEEEEEEEEEEEEEEEccccEEEEEEEEEEEcccccccccccEEEEEEEEEEEEEcccccEEEEEEEEEEEEEEEcccccccccHHHHcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccc
cccccHHHHHHccccccEccccccccEEEEEEEEEEEccccccccEEEEEEEcccEEEEEEccccccccccccccccHHHHHcccccEEEEccHHHHHHHccccccEEEEEEccEEEEEEEEEEEEEEEEEEccccEEEEEEEEEEcccHHHHHHHHHcEEEEEEEEEccccccccEEEEEEEEEEEEEccccccccccHHHHHccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc
MMTSNQLTAIARgeantsfafstrnTVEAKWVRQIkmsspnpnvkkanLSATKKerielpinydydgncgskakAYHIseflshpsgIQAMLNTSALQsfqfldtntyrcvlpkvqflnfeaapvmdlrvtptdkdctvqllsckfegsdivesQNDRFSAFMINHMtwntndsesFLEVEVKLKLCLEIytrpfsllpisaverpgnlMMQALVDRLVPLLLRQLLQDYDKWVQQKFEEFA
MMTSNQLtaiargeantsfafstrnTVEAKWVrqikmsspnpnvkkanlsatkkerieLPINYDYDGNCGSKAKAYHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQNDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLLRQLLQDYDKWVQQKFEEFA
MMTSNQLTAIARGEANTSFAFSTRNTVEAKWVRQIKMSSPNPNVKKANLSATKKERIELPINYDYDGNCGSKAKAYHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQNDRFSAFMINHMTWNTNDSESFlevevklklcleIYTRPFSLLPISAVERPGNLMMQAlvdrlvplllrqllqdYDKWVQQKFEEFA
******************FAFSTRNTVEAKWVR***********************IELPINYDYDGNCGSKAKAYHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCK***************AFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLLRQLLQDYDKWVQQ******
**************AN*S**F******************************************************YHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQNDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLLRQLLQDYDKW**Q******
MMTSNQLTAIARGEANTSFAFSTRNTVEAKWVRQIKMSSPNPNVKKANLSATKKERIELPINYDYDGNCGSKAKAYHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQNDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLLRQLLQDYDKWVQQKFEEFA
****NQLTAIARGEANTSFAFSTRNTVEAKWVRQIKMSSPNPNVKKANLSATKKERIELPINYDYDGNCGSKAKAYHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQNDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLLRQLLQDYDKWVQQKFE***
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MMTSNQLTAIARGEANTSFAFSTRNTVEAKWVRQIKMSSPNPNVKKANLSATKKERIELPINYDYDGNCGSKAKAYHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQNDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLLRQLLQDYDKWVQQKFEEFA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query242
356561899244 PREDICTED: uncharacterized protein LOC10 0.805 0.799 0.676 4e-73
356541352211 PREDICTED: uncharacterized protein LOC10 0.805 0.924 0.671 1e-72
356554381211 PREDICTED: uncharacterized protein LOC10 0.805 0.924 0.669 1e-71
255558642224 conserved hypothetical protein [Ricinus 0.871 0.941 0.614 5e-71
440583714292 hypothetical protein [Lupinus angustifol 0.838 0.695 0.586 8e-71
449441974241 PREDICTED: uncharacterized protein LOC10 0.938 0.941 0.573 3e-68
224082312194 predicted protein [Populus trichocarpa] 0.760 0.948 0.695 8e-65
296081822232 unnamed protein product [Vitis vinifera] 0.909 0.948 0.589 3e-62
294464841311 unknown [Picea sitchensis] 0.776 0.604 0.577 2e-58
225429892232 PREDICTED: uncharacterized protein LOC10 0.768 0.801 0.635 3e-58
>gi|356561899|ref|XP_003549214.1| PREDICTED: uncharacterized protein LOC100813897 [Glycine max] Back     alignment and taxonomy information
 Score =  280 bits (716), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 136/201 (67%), Positives = 164/201 (81%), Gaps = 6/201 (2%)

Query: 38  SSPNPNVKK-ANLSATKKERIELPINYDYDGNCGSKAKAYHISEFLSHPSGIQAMLNTSA 96
           +S N N KK ANL A KKERI LP    Y+ + G   K YHISEFLS PSGI A+LNT A
Sbjct: 42  TSLNSNAKKKANLFAEKKERIRLPT---YNDDLG--GKKYHISEFLSQPSGIAAVLNTKA 96

Query: 97  LQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQN 156
           LQSF+ LD NTYRC LPK+QFLNFEAAP++DLRVT TD+DC V++LSCKFEGS++V+ QN
Sbjct: 97  LQSFESLDANTYRCELPKLQFLNFEAAPLLDLRVTSTDEDCLVEMLSCKFEGSEVVKEQN 156

Query: 157 DRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVD 216
           D FSAFM N MTW    +ESFLEV+VKL L LEIYT+PF+++P SAVE PGN+MMQALVD
Sbjct: 157 DHFSAFMRNQMTWGGAGAESFLEVDVKLNLTLEIYTQPFTMMPTSAVEGPGNIMMQALVD 216

Query: 217 RLVPLLLRQLLQDYDKWVQQK 237
           +LVPLLL+Q+LQDYD+WV+++
Sbjct: 217 KLVPLLLQQMLQDYDEWVRKQ 237




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356541352|ref|XP_003539141.1| PREDICTED: uncharacterized protein LOC100803285 [Glycine max] Back     alignment and taxonomy information
>gi|356554381|ref|XP_003545525.1| PREDICTED: uncharacterized protein LOC100793446 [Glycine max] Back     alignment and taxonomy information
>gi|255558642|ref|XP_002520346.1| conserved hypothetical protein [Ricinus communis] gi|223540565|gb|EEF42132.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|440583714|emb|CCH47217.1| hypothetical protein [Lupinus angustifolius] Back     alignment and taxonomy information
>gi|449441974|ref|XP_004138757.1| PREDICTED: uncharacterized protein LOC101204116 [Cucumis sativus] gi|449499567|ref|XP_004160852.1| PREDICTED: uncharacterized protein LOC101226750 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224082312|ref|XP_002306644.1| predicted protein [Populus trichocarpa] gi|222856093|gb|EEE93640.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|296081822|emb|CBI20827.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|294464841|gb|ADE77926.1| unknown [Picea sitchensis] Back     alignment and taxonomy information
>gi|225429892|ref|XP_002281057.1| PREDICTED: uncharacterized protein LOC100250928 [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query242
TAIR|locus:1005716370274 AT4G31115 [Arabidopsis thalian 0.768 0.678 0.477 3.1e-44
TAIR|locus:2184377255 AT5G04440 "AT5G04440" [Arabido 0.533 0.505 0.305 2.6e-15
TAIR|locus:1005716370 AT4G31115 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 466 (169.1 bits), Expect = 3.1e-44, P = 3.1e-44
 Identities = 94/197 (47%), Positives = 129/197 (65%)

Query:    43 NVKKANLSATKKERIELPINYDYDGNCGSKAKAYHISEFLSHPSGIQAMLNTSALQSFQF 102
             ++KKAN+SA++K+RI+L IN       G K   +  SEFL HPSG++A++N  ALQS+  
Sbjct:    78 SLKKANISASRKQRIKLEIN-------GEKELTF--SEFLKHPSGMEAVINAKALQSYHL 128

Query:   103 LDTN--TYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQNDRFS 160
             +D +  TYRC LPKVQ ++FE  PV+ LRVTPT +DCTV+LLSCK EGS+++E+Q++RFS
Sbjct:   129 VDDSDDTYRCTLPKVQLMSFEVYPVLVLRVTPTQEDCTVELLSCKLEGSELLENQSERFS 188

Query:   161 AFMINHMTWNTNDSESFXXXXXXXXXXXXIYTRPFSLLPISAVERPGNLMMQAXXXXXXX 220
             A M N MTWN    E F            I TRPF++LP+SAVE PGNL+MQ        
Sbjct:   189 AIMTNCMTWNMEHPEPFLEVDVRLNVTLEISTRPFTMLPVSAVEAPGNLVMQTLVDTLVP 248

Query:   221 XXXXXXXXXYDKWVQQK 237
                      YD+W++++
Sbjct:   249 LLLQQLLKDYDEWIKKQ 265




GO:0005634 "nucleus" evidence=ISM
TAIR|locus:2184377 AT5G04440 "AT5G04440" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query242
pfam09366170 pfam09366, DUF1997, Protein of unknown function (D 1e-39
>gnl|CDD|117908 pfam09366, DUF1997, Protein of unknown function (DUF1997) Back     alignment and domain information
 Score =  134 bits (340), Expect = 1e-39
 Identities = 46/159 (28%), Positives = 79/159 (49%), Gaps = 8/159 (5%)

Query: 78  ISEFLSHPSG-IQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKD 136
           ++E+L  P     A+L+   ++  + L    YR  +    F  FE  PV+ LRV P D  
Sbjct: 18  LAEYLRQPQRVFSALLD--PMKVER-LGDGRYRLTVRPFGFFGFEVEPVVVLRVEPEDDG 74

Query: 137 CTVQLLSCKFEGSDIVESQNDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFS 196
            T++LL C+ EG  +V   ND F   +   +  +   +E+ LE +  L + + +   P  
Sbjct: 75  LTIELLDCELEGLPLV---NDDFDLDLRASLYPDREATETGLEGDADLSVTVSL-PPPLR 130

Query: 197 LLPISAVERPGNLMMQALVDRLVPLLLRQLLQDYDKWVQ 235
           LLP   +E  G  ++Q ++ ++   L RQLL+D+  W+ 
Sbjct: 131 LLPEPVLESTGESLLQGILRQIKRRLTRQLLEDFHAWIA 169


This family of proteins are functionally uncharacterized. Length = 170

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 242
PF09366158 DUF1997: Protein of unknown function (DUF1997); In 100.0
PF06240140 COXG: Carbon monoxide dehydrogenase subunit G (Cox 97.86
cd07823146 SRPBCC_5 Ligand-binding SRPBCC domain of an unchar 97.26
cd05018144 CoxG Carbon monoxide dehydrogenase subunit G (CoxG 96.46
COG3427146 Carbon monoxide dehydrogenase subunit G, CoxG [Ene 94.51
>PF09366 DUF1997: Protein of unknown function (DUF1997); InterPro: IPR018971 This family of proteins are functionally uncharacterised Back     alignment and domain information
Probab=100.00  E-value=7.4e-46  Score=310.59  Aligned_cols=156  Identities=35%  Similarity=0.611  Sum_probs=151.6

Q ss_pred             cchHHHhcCCCce-EEecCCCcCcceEEccCCeEEEEecCceeEeEEEEEEEEEEEEEcCCCCeEEEEeeEEeCCccccc
Q 026171           76 YHISEFLSHPSGI-QAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVES  154 (242)
Q Consensus        76 ~~l~eYL~~p~r~-~alldp~~~~rie~Lgd~~fRl~l~~~~ff~~eVePvv~lrV~~~~~~~~v~l~sc~l~G~~~v~~  154 (242)
                      ++|++||++|+|| ++|+|++   ++|+||||+|||+|+|++||+|+|+|+|+|+|++++++|.+++.+|+|+|++.+++
T Consensus         2 ~~l~~YL~~~~r~~~~~~d~~---~ie~l~~~~yr~~~~~~~~~~~~v~P~v~l~v~~~~~~~~i~~~~~~l~G~~~~~~   78 (158)
T PF09366_consen    2 APLAEYLSDPQRWFSALFDPM---RIEPLGDNTYRLKMRPFQFFGFEVEPVVDLRVWPQDDGLTIRSLDCELRGSPLVEQ   78 (158)
T ss_pred             CchHHHHhCchhHHHHhcCHH---HcEEcCCCeEEEEEcCccEEEEEEEEEEEEEEEEcCCCeEEEEEEEEEeCCCcccc
Confidence            6899999999999 9999997   99999999999999999999999999999999999999999999999999998877


Q ss_pred             cccceeeeEEEEEEEeecCCCceEEEEEEEEEEEEEeCCcceecchhhhhchHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 026171          155 QNDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLLRQLLQDYDKWV  234 (242)
Q Consensus       155 ~n~~F~l~m~~~L~~~~~~~~~~L~g~~~L~V~v~l~P~pf~llP~~lle~tGn~vL~~IL~~i~~rl~~qL~~DY~~W~  234 (242)
                       |.+|+++++|.|+|.+.++.+.++|+++|+|++++ |++|+++|++++|+|||+||++|+++|++||+++|++||++|+
T Consensus        79 -~~~f~l~~~~~l~~~~~~~~t~l~~~~~l~V~v~~-P~~~~~~P~~~l~~~G~~vl~~il~~i~~r~~~~l~~Dy~~w~  156 (158)
T PF09366_consen   79 -NDGFSLDLQASLYPEEPPGRTRLEGDADLSVSVEL-PPPFRLLPESLLESTGNAVLQQILRQIKPRFLQQLQADYHRWA  156 (158)
T ss_pred             -CCcEEEEEEEEEEEecCCCceEEEEEEEEEEEEEc-ChhHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence             99999999999999998899999999999999999 9999999999999999999999999999999999999999998


Q ss_pred             Hh
Q 026171          235 QQ  236 (242)
Q Consensus       235 ~~  236 (242)
                      ++
T Consensus       157 ~~  158 (158)
T PF09366_consen  157 RE  158 (158)
T ss_pred             hC
Confidence            74



>PF06240 COXG: Carbon monoxide dehydrogenase subunit G (CoxG); InterPro: IPR010419 The CO dehydrogenase structural genes coxMSL are flanked by nine accessory genes arranged as the cox gene cluster Back     alignment and domain information
>cd07823 SRPBCC_5 Ligand-binding SRPBCC domain of an uncharacterized subfamily of proteins Back     alignment and domain information
>cd05018 CoxG Carbon monoxide dehydrogenase subunit G (CoxG) Back     alignment and domain information
>COG3427 Carbon monoxide dehydrogenase subunit G, CoxG [Energy production and conversion] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query242
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 43.7 bits (102), Expect = 4e-05
 Identities = 41/266 (15%), Positives = 79/266 (29%), Gaps = 71/266 (26%)

Query: 16  NTSFAFSTRNTVEAKWVRQIKMS----SPNPNVK--------KANL--SATKKERIELP- 60
           N +  F+  N    +   +++ +     P  NV         K  +        +++   
Sbjct: 121 NDNQVFAKYNVSRLQPYLKLRQALLELRPAKNVLIDGVLGSGKTWVALDVCLSYKVQCKM 180

Query: 61  ------INYDYDGNCGSKAK--------AYHI----SEFLSHPSGIQAMLNTSALQSFQF 102
                 +N     NC S            Y I    +    H S I+  +++   +  + 
Sbjct: 181 DFKIFWLNL---KNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQAELRRL 237

Query: 103 LDTNTY-RC--VLPKVQFLNFEAAPVMDLR----VTPTDKDCTVQLLSCKFEGSDIVESQ 155
           L +  Y  C  VL  VQ  N +A    +L     +T   K  T           D +   
Sbjct: 238 LKSKPYENCLLVLLNVQ--NAKAWNAFNLSCKILLTTRFKQVT-----------DFL--S 282

Query: 156 NDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALV 215
               +   ++H +      E    +   L   L+        LP   +    N    +++
Sbjct: 283 AATTTHISLDHHSMTLTPDEV-KSL---LLKYLDC---RPQDLPREVLT--TNPRRLSII 333

Query: 216 DRLVPLLLRQLLQDYDKWVQQKFEEF 241
                  +R  L  +D W     ++ 
Sbjct: 334 ----AESIRDGLATWDNWKHVNCDKL 355


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query242
3p9v_A161 Uncharacterized protein; structural genomics, PSI- 95.79
2ns9_A157 Hypothetical protein APE2225; uncharacterized cons 93.1
2pcs_A162 Conserved protein; structural genomics, unknown fu 92.25
3ggn_A155 Uncharacterized protein DR_A0006; structural genom 81.28
>3p9v_A Uncharacterized protein; structural genomics, PSI-biology, protein structure initiati northeast structural genomics consortium, NESG; 1.78A {Marinobacter aquaeolei} Back     alignment and structure
Probab=95.79  E-value=0.15  Score=40.77  Aligned_cols=134  Identities=14%  Similarity=0.186  Sum_probs=73.4

Q ss_pred             cchHHHhcCCCceEEecCCCcCcceEEccCCeEEEEecCceeEeEEEEEEEEEEEEEcCCCCeEEEEeeEEeCCcccccc
Q 026171           76 YHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLPKVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVESQ  155 (242)
Q Consensus        76 ~~l~eYL~~p~r~~alldp~~~~rie~Lgd~~fRl~l~~~~ff~~eVePvv~lrV~~~~~~~~v~l~sc~l~G~~~v~~~  155 (242)
                      ..+=+++.|.++|-++.+.=  +-.+..+++.|+..+..+++-+..+.=....++...+++..|.++..  .|       
T Consensus        20 e~v~~lv~Dve~w~~~yp~~--~p~~~~~~~~~~~~~a~l~vg~~~~~~~~~s~~~~~~~~~~I~~~~~--~g-------   88 (161)
T 3p9v_A           20 DEVFDLLADVPKSASHFPKV--DKLVDLGNNAYRWEMEKVGVDKHAIQSVYACTYHADKEAGKITWSPI--KG-------   88 (161)
T ss_dssp             HHHHHHHTCHHHHHTTSTTE--EEEEEEETTEEEEEECCBSSGGGCBCCEEEEEEEEETTTTEEEEEEC--TT-------
T ss_pred             HHHHHHHhChhhhHhhCCCe--EEEEEcCCCceEEEEEEEeeeeEEEEEEEEEEEEEcCCCCEEEEEEe--cC-------
Confidence            66889999999975555432  34467788877776644443322222222334444343445554322  12       


Q ss_pred             ccceeeeEEEEEEEeecCCCceEEEEEEEEEEEEEeCCcceecchhhhhchHHHHHHHHHHHHHHHHHHHHHHHH
Q 026171          156 NDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLLRQLLQDY  230 (242)
Q Consensus       156 n~~F~l~m~~~L~~~~~~~~~~L~g~~~L~V~v~l~P~pf~llP~~lle~tGn~vL~~IL~~i~~rl~~qL~~DY  230 (242)
                        .|...+.|.-+....++++   ..+++.+.+++ +.|+.    ++++.+...+++.+.++|...|...+.+=|
T Consensus        89 --~~~~~l~g~w~f~p~~~g~---t~V~~~~~~e~-~~pl~----~ll~~~~~~~~~~~~~~~v~af~~ra~~~~  153 (161)
T 3p9v_A           89 --EGNGVVSGSWTLSAKGDNA---TAVKFQTSAEL-TVPLP----SLLKLAISPVIKHEFNSLVDTYMANLKKAF  153 (161)
T ss_dssp             --STTEEEEEEEEEEESSSSC---EEEEEEEEEEE-EECSC----GGGHHHHHHHHHHHHHHHHHHHHHHHHTTT
T ss_pred             --ccceeEEEEEEEEECCCCe---EEEEEEEEEEE-cCcch----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence              3333456655554433332   23334444444 22222    556777888888888888888877765433



>2ns9_A Hypothetical protein APE2225; uncharacterized conserved protein, structural genomics, PSI, structure initiative; 1.80A {Aeropyrum pernix} SCOP: d.129.3.10 Back     alignment and structure
>2pcs_A Conserved protein; structural genomics, unknown function, ligand binding, PSI-2 structure initiative; HET: UNL; 2.40A {Geobacillus kaustophilus} SCOP: d.129.3.10 Back     alignment and structure
>3ggn_A Uncharacterized protein DR_A0006; structural genomics, PSI- 2, protein structure initiative, northeast structural genomics consortium; 2.00A {Deinococcus radiodurans R1} PDB: 2kcz_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query242
d2pcsa1147 Hypothetical protein GKP20 {Geobacillus kaustophil 96.68
d2ns9a1147 Hypothetical protein APE2225 {Aeropyrum pernix [Ta 95.08
d2d4ra1146 Hypothetical protein TTHA0849 {Thermus thermophilu 83.47
>d2pcsa1 d.129.3.10 (A:1-147) Hypothetical protein GKP20 {Geobacillus kaustophilus [TaxId: 1462]} Back     information, alignment and structure
class: Alpha and beta proteins (a+b)
fold: TBP-like
superfamily: Bet v1-like
family: CoxG-like
domain: Hypothetical protein GKP20
species: Geobacillus kaustophilus [TaxId: 1462]
Probab=96.68  E-value=0.027  Score=41.23  Aligned_cols=131  Identities=11%  Similarity=0.144  Sum_probs=74.3

Q ss_pred             cchHHHhcCCCceEEecCCCcCcceEEccCCeEEEEec-CceeEeEEEEEEEEEEEEEcCCCCeEEEEeeEEeCCccccc
Q 026171           76 YHISEFLSHPSGIQAMLNTSALQSFQFLDTNTYRCVLP-KVQFLNFEAAPVMDLRVTPTDKDCTVQLLSCKFEGSDIVES  154 (242)
Q Consensus        76 ~~l~eYL~~p~r~~alldp~~~~rie~Lgd~~fRl~l~-~~~ff~~eVePvv~lrV~~~~~~~~v~l~sc~l~G~~~v~~  154 (242)
                      ..+=+.|.||++.-.|++.=  ..++.+++++|..++. +++.+..+..=.+.+.-.. . ...+........+.+    
T Consensus        14 e~v~~~l~D~~~~~~~~Pg~--~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~----   85 (147)
T d2pcsa1          14 EEVWSKLMDPSILSKCIMGC--KSLELIGEDKYKADLQIGIAAVKGKYDAIIEVTDIK-P-PYHYKLLVNGEGGPG----   85 (147)
T ss_dssp             HHHHHHHTCHHHHHHHSTTE--EEEEEEETTEEEEEEEECCGGGCEEEEEEEEEEEEE-T-TTEEEEEEEEEETTE----
T ss_pred             HHHHHHHcCHHHHHhhCcch--hhceecCCCEEEEEEEEeecceeeeeEEEEEEeccC-c-ccceeEeeccccccc----
Confidence            67899999999998888754  4799999999998877 4666555554333333333 1 212222211111111    


Q ss_pred             cccceeeeEEEEEEEeecCCCceEEEEEEEEEEEEEeCCcceecchhhhhchHHHHHHHHHHHHHHHHH
Q 026171          155 QNDRFSAFMINHMTWNTNDSESFLEVEVKLKLCLEIYTRPFSLLPISAVERPGNLMMQALVDRLVPLLL  223 (242)
Q Consensus       155 ~n~~F~l~m~~~L~~~~~~~~~~L~g~~~L~V~v~l~P~pf~llP~~lle~tGn~vL~~IL~~i~~rl~  223 (242)
                         ...+...-.+.. ..++.+    .+++++.+++ ..++..+...+++++.+.++++.+.++..++.
T Consensus        86 ---~~~~~~~~~~~~-~~~~~T----~v~~~~~~~~-~g~l~~l~~~li~~~~~~~~~~f~~~~~~~l~  145 (147)
T d2pcsa1          86 ---FVNAEGVIDLTP-INDECT----QLTYTYSAEV-GGKVAAIGQRMLGGVAKLLISDFFKKIQKEIA  145 (147)
T ss_dssp             ---EEEEEEEEEEEE-SSSSEE----EEEEEEEEEE-ESGGGGGCHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             ---hhcceeEEeeec-ccCCcE----EEEEEEEEEE-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence               111111111111 122333    3444445566 56677777777777777777777777776654



>d2ns9a1 d.129.3.10 (A:10-156) Hypothetical protein APE2225 {Aeropyrum pernix [TaxId: 56636]} Back     information, alignment and structure
>d2d4ra1 d.129.3.6 (A:2-147) Hypothetical protein TTHA0849 {Thermus thermophilus [TaxId: 274]} Back     information, alignment and structure