Citrus Sinensis ID: 020416


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320------
MAAAALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAGALPLSSILYISAIQFILFCLFMVFFPPL
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHcccccccccccccccccccccccccccEEEEcccccccccccccEEEEEEEEEccccEEEEEccccccccHHHHHHHHHHHHHHcccccccEEEEEHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccEEEEEcccHHHHHHHHccccccccccccccccccccccccccccEEEEEccccHHHHHHcccccccEEEcHHHHHHHHHHHccccc
cccEEEEcccccccccccccccccccccccccccccccccccccccccccEccccccccHHHcccHHHHccccHHcccccccccEEEEccccccccccEEEEEEccccccccccccEEEEEEEEcccccEEEEEEcccccccHHHHHHHHHHHHHHcccccccEEEEccHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccHHHccccEEEEEEcHHHHHHHccccccccccccccccHHccccccccccccEEEEEcccHHHHHHHHcccccHEEEHHHHHHHHHHHHccccc
MAAAAlslnnststnsptlnshkpiskftsltkptnvsfnfltntpprlqhfrprpsvsesslsvpkeadaeieaddveddptqelsyldeetdpesiteweldfcsrpildirgKKIWELVVCdgslslqytkyfpnnvinsITLKEAIVAICddlgvpipekIRFFRSQMQTIITKACkeldikpipskrCLSLLLWLEERYETvytrhpgfqkgskpllaldnpfpmelpdnlfgdkwafVQLPFSAVQEEVSSLESKfvfgasldldllgievddktlipglavassrakplagalplssILYISAIQFILFCLFMVFFPPL
maaaalslnnststnsptlnshkpiSKFTSLTKPTNVSFNFLTNTPPRLQHFrprpsvsesslsvpkeadaeieaddveddpTQELSyldeetdpesiteweldFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKackeldikpipskRCLSLLLWLEERYETVYTrhpgfqkgskpLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAGALPLSSILYISAIQFILFCLFMVFFPPL
MaaaalslnnststnsptlnsHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKeadaeieaddveddPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAGALPLSSILYISAIQFILFCLFMVFFPPL
*************************************************************************************************ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAGALPLSSILYISAIQFILFCLFMVFF***
****************************************************************************************LDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPG*************PFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAGALPLSSILYISAIQFILFCLFMVFFPPL
********************SHKPISKFTSLTKPTNVSFNFLTNTPPRLQHF******************AEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAGALPLSSILYISAIQFILFCLFMVFFPPL
**********STST*************F*SLTKPTNVSFNFL*************************************DDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAGALPLSSILYISAIQFILFCLFMVFFPPL
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHiii
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHHHHHHiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooHHHHHHHHHHHHHHHHHHHHiii
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAAAALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAGALPLSSILYISAIQFILFCLFMVFFPPL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query326
225450083378 PREDICTED: uncharacterized protein LOC10 0.892 0.769 0.77 1e-118
255553548377 conserved hypothetical protein [Ricinus 0.886 0.766 0.748 1e-118
118489335376 unknown [Populus trichocarpa x Populus d 0.914 0.792 0.690 1e-114
449436313379 PREDICTED: uncharacterized protein LOC10 0.834 0.717 0.742 1e-114
224104083325 predicted protein [Populus trichocarpa] 0.748 0.750 0.791 1e-111
388502160382 unknown [Lotus japonicus] 0.904 0.772 0.656 1e-111
224104081325 predicted protein [Populus trichocarpa] 0.711 0.713 0.818 1e-110
297829368374 hypothetical protein ARALYDRAFT_478142 [ 0.874 0.762 0.707 1e-110
18398129374 RNA binding protein [Arabidopsis thalian 0.874 0.762 0.704 1e-109
297736276298 unnamed protein product [Vitis vinifera] 0.659 0.721 0.860 1e-106
>gi|225450083|ref|XP_002278058.1| PREDICTED: uncharacterized protein LOC100243060 [Vitis vinifera] Back     alignment and taxonomy information
 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 231/300 (77%), Positives = 254/300 (84%), Gaps = 9/300 (3%)

Query: 4   AALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTN---TPPRLQHFRPRPSVSE 60
           A LSLN  T   +PTL SHKPI +F SLT PT     F TN   T P+L HFR   SVSE
Sbjct: 2   AGLSLN-PTKITTPTLQSHKPIYRFNSLTNPTKTQLKFPTNPAKTHPKLLHFR-HSSVSE 59

Query: 61  SSLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWE 120
           SS+SVPKE + + E DD    PT E++YLD ETDPESI+EWELDFCSRPILDIRGKKIWE
Sbjct: 60  SSVSVPKEVEVDDEEDD----PTSEMNYLDRETDPESISEWELDFCSRPILDIRGKKIWE 115

Query: 121 LVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKAC 180
           L+VCD SLSLQYTKYFPNNVINS+TLK AI +I D+L VP+PEKIRFFRSQMQTI+TKAC
Sbjct: 116 LLVCDSSLSLQYTKYFPNNVINSVTLKNAIESISDELDVPLPEKIRFFRSQMQTIVTKAC 175

Query: 181 KELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDK 240
           KEL IKPIPSKRCLSL+LWLEERYETVYTRHPGFQ+GSKPLL LDNPFPM+LP+NLFG+K
Sbjct: 176 KELGIKPIPSKRCLSLILWLEERYETVYTRHPGFQQGSKPLLTLDNPFPMQLPENLFGEK 235

Query: 241 WAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDDKTLIPGLAVASSRAKPLAGAL 300
           WAFVQLPFSAVQEEVSSLE++ VFGASLDLDLLGIEVD  TLIPGLAVASSRAKPLA  +
Sbjct: 236 WAFVQLPFSAVQEEVSSLETRLVFGASLDLDLLGIEVDANTLIPGLAVASSRAKPLAAWM 295




Source: Vitis vinifera

Species: Vitis vinifera

Genus: Vitis

Family: Vitaceae

Order: Vitales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255553548|ref|XP_002517815.1| conserved hypothetical protein [Ricinus communis] gi|223543087|gb|EEF44622.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|118489335|gb|ABK96472.1| unknown [Populus trichocarpa x Populus deltoides] Back     alignment and taxonomy information
>gi|449436313|ref|XP_004135937.1| PREDICTED: uncharacterized protein LOC101208052 [Cucumis sativus] gi|449488836|ref|XP_004158187.1| PREDICTED: uncharacterized protein LOC101230638 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224104083|ref|XP_002313311.1| predicted protein [Populus trichocarpa] gi|222849719|gb|EEE87266.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|388502160|gb|AFK39146.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|224104081|ref|XP_002313310.1| predicted protein [Populus trichocarpa] gi|222849718|gb|EEE87265.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|297829368|ref|XP_002882566.1| hypothetical protein ARALYDRAFT_478142 [Arabidopsis lyrata subsp. lyrata] gi|297328406|gb|EFH58825.1| hypothetical protein ARALYDRAFT_478142 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|18398129|ref|NP_566327.1| RNA binding protein [Arabidopsis thaliana] gi|6648213|gb|AAF21211.1|AC013483_35 unknown protein [Arabidopsis thaliana] gi|18252181|gb|AAL61923.1| unknown protein [Arabidopsis thaliana] gi|24899681|gb|AAN65055.1| unknown protein [Arabidopsis thaliana] gi|332641109|gb|AEE74630.1| RNA binding protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|297736276|emb|CBI24914.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query326
TAIR|locus:2077432374 ATAB2 [Arabidopsis thaliana (t 0.828 0.721 0.721 1.5e-99
TAIR|locus:2077432 ATAB2 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 988 (352.9 bits), Expect = 1.5e-99, P = 1.5e-99
 Identities = 197/273 (72%), Positives = 219/273 (80%)

Query:    25 ISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKXXXXXXXXXXXXXXPTQ 84
             I K +S TKP      F + T  +   F  R S+ ESSLS+ K              PT 
Sbjct:    19 IPKPSSFTKPIKTHHLFSSETLLKRCRFVSR-SLPESSLSITKEQEVANEVEEDD--PTS 75

Query:    85 ELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSI 144
             ELSYLD E+D +SI EWELDFCSRPILD RGKKIWELVVCD SLSLQ TKYFPNNVINSI
Sbjct:    76 ELSYLDPESDADSIKEWELDFCSRPILDSRGKKIWELVVCDASLSLQVTKYFPNNVINSI 135

Query:   145 TLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
             TLK+AIV I  DLGVP+PEKIRFFRSQMQTIITKACKEL IK +PSKRCLSL LWL+ERY
Sbjct:   136 TLKDAIVTITQDLGVPLPEKIRFFRSQMQTIITKACKELAIKAVPSKRCLSLFLWLQERY 195

Query:   205 ETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVF 264
             +TVYTRHPGFQKGS PLL+LDNPFPM LP+NLFG+KWAFVQLP+SAV+EE+S  + KFVF
Sbjct:   196 DTVYTRHPGFQKGSLPLLSLDNPFPMNLPENLFGEKWAFVQLPYSAVREEISDFDEKFVF 255

Query:   265 GASLDLDLLGIEVDDKTLIPGLAVASSRAKPLA 297
             GASLDLDLLGIEVD+ TLIPGL+VA+SRAKPLA
Sbjct:   256 GASLDLDLLGIEVDENTLIPGLSVATSRAKPLA 288


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.321   0.139   0.419    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      326       292   0.00090  115 3  11 22  0.36    34
                                                     33  0.44    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  616 (65 KB)
  Total size of DFA:  216 KB (2119 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  23.47u 0.10s 23.57t   Elapsed:  00:00:01
  Total cpu time:  23.47u 0.10s 23.57t   Elapsed:  00:00:01
  Start:  Tue May 21 01:52:40 2013   End:  Tue May 21 01:52:41 2013


GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0003723 "RNA binding" evidence=IDA
GO:0006412 "translation" evidence=IMP
GO:0009658 "chloroplast organization" evidence=IMP
GO:0009704 "de-etiolation" evidence=IEP
GO:0006364 "rRNA processing" evidence=RCA
GO:0009073 "aromatic amino acid family biosynthetic process" evidence=RCA
GO:0009902 "chloroplast relocation" evidence=RCA
GO:0010027 "thylakoid membrane organization" evidence=RCA
GO:0010155 "regulation of proton transport" evidence=RCA
GO:0010207 "photosystem II assembly" evidence=RCA
GO:0016226 "iron-sulfur cluster assembly" evidence=RCA
GO:0016556 "mRNA modification" evidence=RCA
GO:0019761 "glucosinolate biosynthetic process" evidence=RCA
GO:0034660 "ncRNA metabolic process" evidence=RCA
GO:0035304 "regulation of protein dephosphorylation" evidence=RCA
GO:0042793 "transcription from plastid promoter" evidence=RCA
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=RCA
GO:0046777 "protein autophosphorylation" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query326
pfam06485270 pfam06485, DUF1092, Protein of unknown function (D 1e-76
>gnl|CDD|219056 pfam06485, DUF1092, Protein of unknown function (DUF1092) Back     alignment and domain information
 Score =  235 bits (602), Expect = 1e-76
 Identities = 96/220 (43%), Positives = 123/220 (55%), Gaps = 12/220 (5%)

Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS-LQYTKYFPNNVINSITLKEAIVAICDDLGV 159
           WELDF SRPILD  GKK WEL++CD   S  +Y K+ P + +NS+ L  A+     + G 
Sbjct: 1   WELDFYSRPILDEDGKKRWELLICDTPRSPFRYAKFCPASEVNSVWLARALEEAIAE-GW 59

Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
             P++IRFFRSQM T+I +AC+EL I+  PS+R  +L  WLEER E VY + PG+   + 
Sbjct: 60  EKPDRIRFFRSQMLTMIQRACEELGIEVEPSRRTYALKDWLEEREEEVYPQEPGYMALAP 119

Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLESKFVFGASLDLDLLGIEVDD 279
           P +ALD P P  LP+ L GD WAF  LP   ++E     E    FG    L L    + D
Sbjct: 120 PPVALDKPPPQPLPEALRGDAWAFASLPAGDLREA---FEWPIPFGELFPLPL---GLAD 173

Query: 280 KTLIPGLAVASS-RAKPLAG---ALPLSSILYISAIQFIL 315
            T IPGL + S  RA  LA     L   S+      Q IL
Sbjct: 174 DTPIPGLRIFSGRRALALAAWLSGLEPVSLSVDGGDQLIL 213


This family consists of several hypothetical proteins of unknown function all from photosynthetic organisms including plants and cyanobacteria. Length = 270

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 326
PF06485270 DUF1092: Protein of unknown function (DUF1092); In 100.0
PF00665120 rve: Integrase core domain; InterPro: IPR001584 In 96.86
>PF06485 DUF1092: Protein of unknown function (DUF1092); InterPro: IPR009472 This family consists of several hypothetical proteins of unknown function all from photosynthetic organisms including plants and cyanobacteria Back     alignment and domain information
Probab=100.00  E-value=2.1e-90  Score=647.64  Aligned_cols=202  Identities=49%  Similarity=0.834  Sum_probs=194.6

Q ss_pred             cccccccCccccCCCceeEEEEEeeCCCceEEEecCCCCCCCHHHHHHHHHHHHHhcCCCCCCeeeeeHHHHHHHHHHHH
Q 020416          101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKAC  180 (326)
Q Consensus       101 WELDFYSRPilD~~GKKlWELLICDs~~~F~y~~~CP~s~VNS~WL~~AL~~ai~~~~~~~P~~IRfFR~QM~nmI~kAc  180 (326)
                      ||||||||||+|++|||+|||||||++++|+|+++||||+|||+||++||++|+++ |+.+|++|||||+||+|||++||
T Consensus         1 WElDFYsRPi~d~~gkk~WELLIcd~~~~f~~~~~cP~~~vNS~wL~~al~~a~~~-~~~~P~~Ir~FR~qm~~mI~~A~   79 (270)
T PF06485_consen    1 WELDFYSRPILDENGKKLWELLICDSPGSFRYAKFCPNSEVNSIWLRQALEEAIDQ-GGEKPDRIRFFRSQMLNMITKAC   79 (270)
T ss_pred             CccccccCcccCCCCCeeEEEEEEcCCCCEEEEEeCCCCccCHHHHHHHHHHHHHh-cCCCCcEEEEEhHHHHHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999996 67999999999999999999999


Q ss_pred             hhCCCccccccchHHHHHHHHHHhhhhccCCCCCCCCCCCCCCCCCCCCCCCCcccCCCeeEEeeecchhhHHHhhcccc
Q 020416          181 KELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAVQEEVSSLES  260 (326)
Q Consensus       181 ~eLgI~v~PSRRT~aL~~WLeERy~~vYp~~pGy~~~~~~p~~ld~p~P~pLPDaL~GEkWaFvsLpa~~l~e~~~~~~~  260 (326)
                      ++|||+++||||||||++||+||+++|||+|+||++++++++++++++|+|||||||||+|+||+||+++|+|+   .+|
T Consensus        80 ~~lgI~~~pSRRt~aL~~wL~eR~~~vYp~~~gy~~~~~~~~~~~~~~P~pLPdaL~Ge~W~FvsLp~~~l~e~---~e~  156 (270)
T PF06485_consen   80 EELGIPVEPSRRTYALKQWLEEREEEVYPQEPGYQPGAPPPVALDPPPPQPLPDALRGEKWAFVSLPAGDLREA---FEW  156 (270)
T ss_pred             HHCCCceeecccHHHHHHHHHHHHHHhCcCCCCCCCCCCCccccCCCCCCCCChhhCCCceEEEEccHHHHHhh---hcc
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999998   489


Q ss_pred             ccccCccccCcccCCCCCCCCCcceeEEecC-CchhHHhccCCccccccc
Q 020416          261 KFVFGASLDLDLLGIEVDDKTLIPGLAVASS-RAKPLAGALPLSSILYIS  309 (326)
Q Consensus       261 pI~Fge~lpL~~l~lgL~~dt~IPGvvIfS~-RAl~LA~WLsgl~L~yi~  309 (326)
                      ||.||+++||+   +||++|++||||+|||+ |||+||+||+|+++.++.
T Consensus       157 ~i~fg~l~Pl~---~~l~~~~~IPGv~I~s~~Ral~LA~Wl~glEp~~L~  203 (270)
T PF06485_consen  157 PIPFGELLPLP---LGLASDTPIPGVVIFSGRRALPLAAWLSGLEPVSLN  203 (270)
T ss_pred             CccccccCCCC---CCCCcCCccceEEEecCcchhHHHHHhccCceEEEE
Confidence            99999998887   89999999999999996 999999999976655544



>PF00665 rve: Integrase core domain; InterPro: IPR001584 Integrase comprises three domains capable of folding independently and whose three-dimensional structures are known Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query326
3kks_A152 Integrase, IN; beta-strands flanked by alpha-helic 92.9
1cxq_A162 Avian sarcoma virus integrase; mixed beta-sheet su 91.46
1c0m_A238 Protein (integrase); HIV, X-RAY crystallography, p 91.34
2x78_A200 Integrase; viral protein, DNA-directed DNA polyme 90.77
1bco_A327 MUA domain II, bacteriophage MU transposase; polyn 84.73
3hph_A219 Integrase, IN; protein-protein complex, tetramer, 82.17
>3kks_A Integrase, IN; beta-strands flanked by alpha-helices, DNA binding protein; 2.20A {Bovine immunodeficiency virus} PDB: 3kkr_A Back     alignment and structure
Probab=92.90  E-value=0.58  Score=36.22  Aligned_cols=81  Identities=15%  Similarity=0.164  Sum_probs=61.2

Q ss_pred             cccccccccCccccCCCceeEEEEEeeCCCceEEEecCCCCCCCHHHHHHHHHHHHHhcCCCCCCeeeeeHHHHH--HHH
Q 020416           99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQ--TII  176 (326)
Q Consensus        99 ~iWELDFYSRPilD~~GKKlWELLICDs~~~F~y~~~CP~s~VNS~WL~~AL~~ai~~~~~~~P~~IRfFR~QM~--nmI  176 (326)
                      .+||+|+-.-      |+ .|=++|.|.-+.+.+...++  ..++..+..+|..++...|   |..|+.=|..-.  ..+
T Consensus         3 ~~w~~D~t~~------~~-~y~~~~iD~~sr~~~~~~~~--~~~~~~~~~~l~~~~~~~g---p~~i~sDnG~~f~s~~~   70 (152)
T 3kks_A            3 HLWQMDNTHW------NK-TIIWVAVETNSGLVEAQVIP--EETALQVALCILQLIQRYT---VLHLHSDNGPCFTAHRI   70 (152)
T ss_dssp             CEEEEEEEEE------TT-EEEEEEEETTTCCEEEEEES--SSCHHHHHHHHHHHHHHSC---CSEEEECSCHHHHSHHH
T ss_pred             cEEEEeeeee------CC-EEEEEEEEcCCCcEEEEEeC--CcCHHHHHHHHHHHHHHhC---CcEEecCCchHhhHHHH
Confidence            4899999643      44 48888888887788887776  5678889999999888754   899876553222  357


Q ss_pred             HHHHhhCCCcccccc
Q 020416          177 TKACKELDIKPIPSK  191 (326)
Q Consensus       177 ~kAc~eLgI~v~PSR  191 (326)
                      +..|+++||...-++
T Consensus        71 ~~~~~~~gi~~~~~~   85 (152)
T 3kks_A           71 ENLCKYLGITKTTGI   85 (152)
T ss_dssp             HHHHHHTTCEEEESS
T ss_pred             HHHHHHcCCeecccC
Confidence            888999999987665



>1cxq_A Avian sarcoma virus integrase; mixed beta-sheet surrounded by alpha-helices, transferase; HET: EPE; 1.02A {Avian sarcoma virus} SCOP: c.55.3.2 PDB: 1cxu_A* 1czb_A* 1cz9_A* 1asu_A* 1asw_A* 1asv_A 1a5v_A* 1a5w_A* 1a5x_A* 1vsf_A* 1vse_A* 3o4n_A* 1vsm_A 3o4q_A* 1vsd_A* 1vsh_A* 1vsi_A* 1vsj_A* 1vsk_A 1vsl_A Back     alignment and structure
>1c0m_A Protein (integrase); HIV, X-RAY crystallography, protein structure, transferase; 2.53A {Rous sarcoma virus} SCOP: b.34.7.1 c.55.3.2 PDB: 1c1a_A Back     alignment and structure
>2x78_A Integrase; viral protein, DNA-directed DNA polyme nucleotidyltransferase, DNA integration, aspartyl protease, recombination, transferase; 2.00A {Human spumaretrovirus} PDB: 2x6n_A 2x6s_A 2x74_A 3dlr_A Back     alignment and structure
>1bco_A MUA domain II, bacteriophage MU transposase; polynucleotidyl transferase, DNA binding, endonuclease, integrase; 2.40A {Enterobacteria phage MU} SCOP: b.48.1.1 c.55.3.3 PDB: 1bcm_A Back     alignment and structure
>3hph_A Integrase, IN; protein-protein complex, tetramer, DNA integration, endonucl magnesium, metal-binding, multifunctional enzyme; 2.64A {Maedi visna virus} PDB: 3hpg_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query326
d1cxqa_147 Retroviral integrase, catalytic domain {Rous sarco 92.85
d1exqa_154 Retroviral integrase, catalytic domain {Human immu 83.91
>d1cxqa_ c.55.3.2 (A:) Retroviral integrase, catalytic domain {Rous sarcoma virus RSV [TaxId: 11886]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Retroviral integrase, catalytic domain
domain: Retroviral integrase, catalytic domain
species: Rous sarcoma virus RSV [TaxId: 11886]
Probab=92.85  E-value=0.083  Score=40.62  Aligned_cols=86  Identities=16%  Similarity=0.198  Sum_probs=61.4

Q ss_pred             cccccccccCccccCCCceeEEEEEeeCCCceEEEecCCCCCCCHHHHHHHHHHHHHhcCCCCCCeeeeeHHH--HHHHH
Q 020416           99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQ--MQTII  176 (326)
Q Consensus        99 ~iWELDFYSRPilD~~GKKlWELLICDs~~~F~y~~~CP~s~VNS~WL~~AL~~ai~~~~~~~P~~IRfFR~Q--M~nmI  176 (326)
                      .+|++||-.-|.   .|.+.|=++|.|.-+.+.+.  ++....++..+..++..++...|  .|..|+.=+..  +..-+
T Consensus         8 ~~w~~D~~~~~~---~~~~~~l~v~vD~~Sr~i~~--~~~~~~t~~~~~~~l~~~~~~~g--~p~~i~sDnG~~F~s~~~   80 (147)
T d1cxqa_           8 QIWQTDFTLEPR---MAPRSWLAVTVDTASSAIVV--TQHGRVTSVAAQHHWATAIAVLG--RPKAIKTDNGSCFTSKST   80 (147)
T ss_dssp             CEEEEEEEECGG---GTTSCEEEEEEETTTCCEEE--EEESSCCHHHHHHHHHHHHHHHC--CCSEEECCSCHHHHSHHH
T ss_pred             CEEEEEEEEecC---CCCeEEEEEEEecccceecc--cccCccchHHHHHHHHHHHHHhC--CCeEEEecccccccchhh
Confidence            589999844332   24445777777766655554  55666789999999999998875  59999864422  23567


Q ss_pred             HHHHhhCCCcccccc
Q 020416          177 TKACKELDIKPIPSK  191 (326)
Q Consensus       177 ~kAc~eLgI~v~PSR  191 (326)
                      +..|+++||+...+.
T Consensus        81 ~~~~~~~gI~~~~~~   95 (147)
T d1cxqa_          81 REWLARWGIAHTTGI   95 (147)
T ss_dssp             HHHHHHHTCEEECCC
T ss_pred             hhhhhheeEEeeeec
Confidence            889999999987665



>d1exqa_ c.55.3.2 (A:) Retroviral integrase, catalytic domain {Human immunodeficiency virus type 1 [TaxId: 11676]} Back     information, alignment and structure