Citrus Sinensis ID: 019808


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-----
MAAAALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHcccccccccccccccccccccccccEEEEEcccccccccccccEEEEEEEEEccccEEEEEccccccccHHHHHHHHHHHHHHcccccccEEEEEHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccEEEccccHHHHHccccEEEEEEEcccccEEEEcccccEEEEEEEccccccHHHHHHHHHHHHHccccEEEEEEcccccccEEEEEEEcccccccc
cccEEEEcccccccccccccccccccccccccccccccccccccccccccEccccccccHHHcccHHHHccccHHccccccccEEEEEccccccccccEEEEEEccccccccccccEEEEEEEEcccccEEEEEEcccccccHHHHHHHHHHHHHHcccccccEEEEccHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccHHHccccHHEEHHHHHHHHcccccEEEEEccccccEEEEcccccEEEEEEccccHHHHHHHHHHHHHHHHcccEEEEEEEcccccccccHHHHHcccccccc
MAAAAlslnnststnsptlnshkpiskftsltkptnvsfnfltntpprlqhfrprpsvsesslsvpkeadaeieaddveddptqelsyldeetdpesiteweldfcsrpildirgKKIWELVVCdgslslqytkyfpnnvinsITLKEAIVAICddlgvpipekIRFFRSQMQTIITKACkeldikpipskrCLSLLLWLEERYETvytrhpgfqkgskpllaldnpfpmelpdnlfgdkwAFVQLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYAnykknpvttsEAEAWEAAKKACGGLHFLAIqeeldsedcvgfwllldlppppv
maaaalslnnststnsptlnshkpiSKFTSLTKPTNVSFNFLTNTPPRLQHFrprpsvsesslsvpkeadaeieaddveddpTQELSyldeetdpesiteweldFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKackeldikpipskRCLSLLLWLEERYETVYTrhpgfqkgskpLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV
MaaaalslnnststnsptlnsHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKeadaeieaddveddPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSeaeaweaakkaCGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV
*************************************************************************************************ITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDL*****
********************************************************************************************TDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHP**************PFPMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTS********KKACGGLHFLAIQEELDSEDCVGFWLLLDLPP***
********************SHKPISKFTSLTKPTNVSFNFLTNTPPRLQHF******************AEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV
**********STST***************SLTKPTNVSFNFL*************************************DDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPP*
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAAAALSLNNSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEADAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDCVGFWLLLDLPPPPV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query335
297829368374 hypothetical protein ARALYDRAFT_478142 [ 0.955 0.855 0.636 1e-126
18398129374 RNA binding protein [Arabidopsis thalian 0.955 0.855 0.631 1e-125
255553548377 conserved hypothetical protein [Ricinus 0.937 0.832 0.667 1e-124
225450083378 PREDICTED: uncharacterized protein LOC10 0.943 0.835 0.683 1e-122
118489335376 unknown [Populus trichocarpa x Populus d 0.934 0.832 0.634 1e-119
449436313379 PREDICTED: uncharacterized protein LOC10 0.886 0.783 0.662 1e-118
388502160382 unknown [Lotus japonicus] 0.955 0.837 0.603 1e-116
224104083325 predicted protein [Populus trichocarpa] 0.773 0.796 0.698 1e-113
224104081325 predicted protein [Populus trichocarpa] 0.737 0.76 0.715 1e-112
363807199377 uncharacterized protein LOC100795572 [Gl 0.973 0.864 0.612 1e-112
>gi|297829368|ref|XP_002882566.1| hypothetical protein ARALYDRAFT_478142 [Arabidopsis lyrata subsp. lyrata] gi|297328406|gb|EFH58825.1| hypothetical protein ARALYDRAFT_478142 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 238/374 (63%), Positives = 269/374 (71%), Gaps = 54/374 (14%)

Query: 10  NSTSTNSPTLNSHKPISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKEA 69
           N+T   +P+L     I K +S TKP      F + T  +   F  R S+ ESSLS+ KE 
Sbjct: 7   NTTRIQTPSLPR---IPKPSSFTKPIKTHHLFSSETLLKRCRFVSR-SLPESSLSITKEQ 62

Query: 70  DAEIEADDVEDDPTQELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLS 129
           +   E +  EDDPT ELSYLD E+D +SI EWELDFCSRPILD RGKKIWELVVCD SLS
Sbjct: 63  EVANEVE--EDDPTSELSYLDPESDADSIKEWELDFCSRPILDSRGKKIWELVVCDASLS 120

Query: 130 LQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIP 189
           LQ TKYFPNNVINSITLK+AIV I  DLGVP+PEKIRFFRSQMQTIITKACKEL IK +P
Sbjct: 121 LQVTKYFPNNVINSITLKDAIVTITQDLGVPLPEKIRFFRSQMQTIITKACKELAIKAVP 180

Query: 190 SKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS 249
           SKRCLSL LWL+ERY+TVYTRHPGFQKGS PLL+LDNPFPM LP+NLFG+KWAFVQLP+S
Sbjct: 181 SKRCLSLFLWLQERYDTVYTRHPGFQKGSLPLLSLDNPFPMNLPENLFGEKWAFVQLPYS 240

Query: 250 ------------------------------------------------AWMNGLEVCSIE 261
                                                           AWMNGLEVCSIE
Sbjct: 241 AVREEISDFEEKFVFGATLDLDLLGIEVDENTLIPGLSVATSRAKPLAAWMNGLEVCSIE 300

Query: 262 TDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAWEAAKKACGGLHFLAIQEELDSEDC 321
            D+++G LILSVGI+TRY+YA YKK PVTT EAEAWE+AKKA GGLHFLAIQ++LDS+DC
Sbjct: 301 ADSSKGCLILSVGIATRYVYATYKKTPVTTDEAEAWESAKKASGGLHFLAIQDDLDSDDC 360

Query: 322 VGFWLLLDLPPPPV 335
           VGFWLL+DLPPPPV
Sbjct: 361 VGFWLLIDLPPPPV 374




Source: Arabidopsis lyrata subsp. lyrata

Species: Arabidopsis lyrata

Genus: Arabidopsis

Family: Brassicaceae

Order: Brassicales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|18398129|ref|NP_566327.1| RNA binding protein [Arabidopsis thaliana] gi|6648213|gb|AAF21211.1|AC013483_35 unknown protein [Arabidopsis thaliana] gi|18252181|gb|AAL61923.1| unknown protein [Arabidopsis thaliana] gi|24899681|gb|AAN65055.1| unknown protein [Arabidopsis thaliana] gi|332641109|gb|AEE74630.1| RNA binding protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|255553548|ref|XP_002517815.1| conserved hypothetical protein [Ricinus communis] gi|223543087|gb|EEF44622.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|225450083|ref|XP_002278058.1| PREDICTED: uncharacterized protein LOC100243060 [Vitis vinifera] Back     alignment and taxonomy information
>gi|118489335|gb|ABK96472.1| unknown [Populus trichocarpa x Populus deltoides] Back     alignment and taxonomy information
>gi|449436313|ref|XP_004135937.1| PREDICTED: uncharacterized protein LOC101208052 [Cucumis sativus] gi|449488836|ref|XP_004158187.1| PREDICTED: uncharacterized protein LOC101230638 [Cucumis sativus] Back     alignment and taxonomy information
>gi|388502160|gb|AFK39146.1| unknown [Lotus japonicus] Back     alignment and taxonomy information
>gi|224104083|ref|XP_002313311.1| predicted protein [Populus trichocarpa] gi|222849719|gb|EEE87266.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224104081|ref|XP_002313310.1| predicted protein [Populus trichocarpa] gi|222849718|gb|EEE87265.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|363807199|ref|NP_001242607.1| uncharacterized protein LOC100795572 [Glycine max] gi|255640179|gb|ACU20380.1| unknown [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query335
TAIR|locus:2077432374 ATAB2 [Arabidopsis thaliana (t 0.665 0.596 0.707 6.1e-114
TAIR|locus:2077432 ATAB2 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 801 (287.0 bits), Expect = 6.1e-114, Sum P(2) = 6.1e-114
 Identities = 160/226 (70%), Positives = 176/226 (77%)

Query:    25 ISKFTSLTKPTNVSFNFLTNTPPRLQHFRPRPSVSESSLSVPKXXXXXXXXXXXXXXPTQ 84
             I K +S TKP      F + T  +   F  R S+ ESSLS+ K              PT 
Sbjct:    19 IPKPSSFTKPIKTHHLFSSETLLKRCRFVSR-SLPESSLSITKEQEVANEVEEDD--PTS 75

Query:    85 ELSYLDEETDPESITEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSI 144
             ELSYLD E+D +SI EWELDFCSRPILD RGKKIWELVVCD SLSLQ TKYFPNNVINSI
Sbjct:    76 ELSYLDPESDADSIKEWELDFCSRPILDSRGKKIWELVVCDASLSLQVTKYFPNNVINSI 135

Query:   145 TLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERY 204
             TLK+AIV I  DLGVP+PEKIRFFRSQMQTIITKACKEL IK +PSKRCLSL LWL+ERY
Sbjct:   136 TLKDAIVTITQDLGVPLPEKIRFFRSQMQTIITKACKELAIKAVPSKRCLSLFLWLQERY 195

Query:   205 ETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFSA 250
             +TVYTRHPGFQKGS PLL+LDNPFPM LP+NLFG+KWAFVQLP+SA
Sbjct:   196 DTVYTRHPGFQKGSLPLLSLDNPFPMNLPENLFGEKWAFVQLPYSA 241


GO:0009507 "chloroplast" evidence=ISM;IDA
GO:0003723 "RNA binding" evidence=IDA
GO:0006412 "translation" evidence=IMP
GO:0009658 "chloroplast organization" evidence=IMP
GO:0009704 "de-etiolation" evidence=IEP
GO:0006364 "rRNA processing" evidence=RCA
GO:0009073 "aromatic amino acid family biosynthetic process" evidence=RCA
GO:0009902 "chloroplast relocation" evidence=RCA
GO:0010027 "thylakoid membrane organization" evidence=RCA
GO:0010155 "regulation of proton transport" evidence=RCA
GO:0010207 "photosystem II assembly" evidence=RCA
GO:0016226 "iron-sulfur cluster assembly" evidence=RCA
GO:0016556 "mRNA modification" evidence=RCA
GO:0019761 "glucosinolate biosynthetic process" evidence=RCA
GO:0034660 "ncRNA metabolic process" evidence=RCA
GO:0035304 "regulation of protein dephosphorylation" evidence=RCA
GO:0042793 "transcription from plastid promoter" evidence=RCA
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=RCA
GO:0046777 "protein autophosphorylation" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
fgenesh2_kg.3__843__AT3G08010.1
annotation not avaliable (374 aa)
(Arabidopsis lyrata)
Predicted Functional Partners:
fgenesh2_kg.8__1751__AT5G58260.1
annotation not avaliable (209 aa)
      0.666
fgenesh2_kg.7__3536__AT5G38660.1
annotation not avaliable (287 aa)
      0.661
scaffold_302117.1
annotation not avaliable (190 aa)
      0.661
scaffold_701391.1
annotation not avaliable (183 aa)
      0.648
fgenesh2_kg.8__1178__AT5G52970.1
annotation not avaliable (222 aa)
      0.647
scaffold_304439.1
annotation not avaliable (298 aa)
      0.642
fgenesh2_kg.5__1948__AT3G55330.1
annotation not avaliable (229 aa)
      0.633
fgenesh2_kg.6__2705__AT5G27560.1
annotation not avaliable (340 aa)
      0.624
fgenesh2_kg.5__435__AT3G26710.1
annotation not avaliable (267 aa)
      0.622
fgenesh2_kg.5__2683__AT3G61870.1
annotation not avaliable (271 aa)
      0.621

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query335
pfam06485270 pfam06485, DUF1092, Protein of unknown function (D 2e-91
>gnl|CDD|219056 pfam06485, DUF1092, Protein of unknown function (DUF1092) Back     alignment and domain information
 Score =  273 bits (701), Expect = 2e-91
 Identities = 109/273 (39%), Positives = 147/273 (53%), Gaps = 47/273 (17%)

Query: 101 WELDFCSRPILDIRGKKIWELVVCDGSLS-LQYTKYFPNNVINSITLKEAIVAICDDLGV 159
           WELDF SRPILD  GKK WEL++CD   S  +Y K+ P + +NS+ L  A+     + G 
Sbjct: 1   WELDFYSRPILDEDGKKRWELLICDTPRSPFRYAKFCPASEVNSVWLARALEEAIAE-GW 59

Query: 160 PIPEKIRFFRSQMQTIITKACKELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSK 219
             P++IRFFRSQM T+I +AC+EL I+  PS+R  +L  WLEER E VY + PG+   + 
Sbjct: 60  EKPDRIRFFRSQMLTMIQRACEELGIEVEPSRRTYALKDWLEEREEEVYPQEPGYMALAP 119

Query: 220 PLLALDNPFPMELPDNLFGDKWAFVQLPF------------------------------- 248
           P +ALD P P  LP+ L GD WAF  LP                                
Sbjct: 120 PPVALDKPPPQPLPEALRGDAWAFASLPAGDLREAFEWPIPFGELFPLPLGLADDTPIPG 179

Query: 249 ------------SAWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEA 296
                       +AW++GLE  S+  D     LIL  G+  R++ A+  ++P    EA+A
Sbjct: 180 LRIFSGRRALALAAWLSGLEPVSLSVDGGDQ-LILEAGLDDRWLVADL-EDPEAKQEAKA 237

Query: 297 WEAAKKACGGLHFLAIQEELDSEDCVGFWLLLD 329
           +E AK+   GLHFLA+Q + +SE   GFWLL D
Sbjct: 238 FEEAKQQAQGLHFLAVQPDPESETFAGFWLLRD 270


This family consists of several hypothetical proteins of unknown function all from photosynthetic organisms including plants and cyanobacteria. Length = 270

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 335
PF06485270 DUF1092: Protein of unknown function (DUF1092); In 100.0
PF00665120 rve: Integrase core domain; InterPro: IPR001584 In 96.8
>PF06485 DUF1092: Protein of unknown function (DUF1092); InterPro: IPR009472 This family consists of several hypothetical proteins of unknown function all from photosynthetic organisms including plants and cyanobacteria Back     alignment and domain information
Probab=100.00  E-value=2.7e-107  Score=764.64  Aligned_cols=227  Identities=52%  Similarity=0.930  Sum_probs=223.4

Q ss_pred             eecccccCccccCCCceeEEEEEeeCCCceEEEecCCCCCCCHHHHHHHHHHHHhhcCCCCCCeeeeehHHHHHHHHHHh
Q 019808          101 WELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQTIITKAC  180 (335)
Q Consensus       101 WELDFYSRPilD~~GKKlWELLICd~~~~F~y~~~CP~s~VNS~WL~~aL~~ai~~a~~~~P~~IRfFR~QM~nmI~kAc  180 (335)
                      ||||||||||+|++|||+|||||||++++|+|+++||||+|||+||++||++|+++ |+.+|++|||||+||+|||++||
T Consensus         1 WElDFYsRPi~d~~gkk~WELLIcd~~~~f~~~~~cP~~~vNS~wL~~al~~a~~~-~~~~P~~Ir~FR~qm~~mI~~A~   79 (270)
T PF06485_consen    1 WELDFYSRPILDENGKKLWELLICDSPGSFRYAKFCPNSEVNSIWLRQALEEAIDQ-GGEKPDRIRFFRSQMLNMITKAC   79 (270)
T ss_pred             CccccccCcccCCCCCeeEEEEEEcCCCCEEEEEeCCCCccCHHHHHHHHHHHHHh-cCCCCcEEEEEhHHHHHHHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999 78999999999999999999999


Q ss_pred             hhCCCccccccchHHHHHHHHHHHhhhccCCCCCCCCCCCCCCCCCCCCCCCCcccCCCeeEEEeeccc-----------
Q 019808          181 KELDIKPIPSKRCLSLLLWLEERYETVYTRHPGFQKGSKPLLALDNPFPMELPDNLFGDKWAFVQLPFS-----------  249 (335)
Q Consensus       181 ~~LgI~v~PSRRT~aL~~WL~ER~e~VYp~~pGY~~~~~~p~~~~~~~P~pLPdaL~Ge~WaFvsLpa~-----------  249 (335)
                      ++|||+++||||||||++||+||+++|||+|+||++++++++++++++|+|||||||||+|+||+||++           
T Consensus        80 ~~lgI~~~pSRRt~aL~~wL~eR~~~vYp~~~gy~~~~~~~~~~~~~~P~pLPdaL~Ge~W~FvsLp~~~l~e~~e~~i~  159 (270)
T PF06485_consen   80 EELGIPVEPSRRTYALKQWLEEREEEVYPQEPGYQPGAPPPVALDPPPPQPLPDALRGEKWAFVSLPAGDLREAFEWPIP  159 (270)
T ss_pred             HHCCCceeecccHHHHHHHHHHHHHHhCcCCCCCCCCCCCccccCCCCCCCCChhhCCCceEEEEccHHHHHhhhccCcc
Confidence            999999999999999999999999999999999999999999999999999999999999999999999           


Q ss_pred             --------------------------------cccCCcceEEEEEecCCCeEEEeccCCCeEEEEEecCCcchhHHHHHH
Q 019808          250 --------------------------------AWMNGLEVCSIETDTARGSLILSVGISTRYIYANYKKNPVTTSEAEAW  297 (335)
Q Consensus       250 --------------------------------~WlsGlEp~~L~~d~~~~~LILEtGl~DrWilAt~~d~~~~~~eA~~f  297 (335)
                                                      +||+|+|||+|+||.+..+||||||++||||||||+ ++++.+||++|
T Consensus       160 fg~l~Pl~~~l~~~~~IPGv~I~s~~Ral~LA~Wl~glEp~~L~~~~~~~~LiLEaGl~drWi~a~~~-d~~~~~ea~~f  238 (270)
T PF06485_consen  160 FGELLPLPLGLASDTPIPGVVIFSGRRALPLAAWLSGLEPVSLNYDPGEPGLILEAGLDDRWILATLE-DPEAAAEAQAF  238 (270)
T ss_pred             ccccCCCCCCCCcCCccceEEEecCcchhHHHHHhccCceEEEEEecCCceEEEecCCcceEEEEeCC-CHHHHHHHHHH
Confidence                                            999999999999998866999999999999999999 49999999999


Q ss_pred             HHHHHHcCCcEEEEEecCCCCCceeeEEeecC
Q 019808          298 EAAKKACGGLHFLAIQEELDSEDCVGFWLLLD  329 (335)
Q Consensus       298 E~aKq~a~GLHFLaVQ~dp~~etfaGFWLL~d  329 (335)
                      |++|++|+|||||||||||++++|+|||||||
T Consensus       239 e~~K~~a~GlHFLaVQ~d~~~~t~~GFWLL~~  270 (270)
T PF06485_consen  239 EQAKQAAQGLHFLAVQPDPSSETFAGFWLLQD  270 (270)
T ss_pred             HHHHHHcCCceEEEEecCCCCCceeeEEEecC
Confidence            99999999999999999999999999999996



>PF00665 rve: Integrase core domain; InterPro: IPR001584 Integrase comprises three domains capable of folding independently and whose three-dimensional structures are known Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query335
3kks_A152 Integrase, IN; beta-strands flanked by alpha-helic 92.75
2x78_A200 Integrase; viral protein, DNA-directed DNA polyme 91.36
1cxq_A162 Avian sarcoma virus integrase; mixed beta-sheet su 90.29
1c0m_A238 Protein (integrase); HIV, X-RAY crystallography, p 89.64
3hph_A219 Integrase, IN; protein-protein complex, tetramer, 81.17
1bco_A327 MUA domain II, bacteriophage MU transposase; polyn 81.06
>3kks_A Integrase, IN; beta-strands flanked by alpha-helices, DNA binding protein; 2.20A {Bovine immunodeficiency virus} PDB: 3kkr_A Back     alignment and structure
Probab=92.75  E-value=0.69  Score=36.02  Aligned_cols=81  Identities=15%  Similarity=0.164  Sum_probs=61.3

Q ss_pred             eeeecccccCccccCCCceeEEEEEeeCCCceEEEecCCCCCCCHHHHHHHHHHHHhhcCCCCCCeeeeehHHHH--HHH
Q 019808           99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQMQ--TII  176 (335)
Q Consensus        99 ~iWELDFYSRPilD~~GKKlWELLICd~~~~F~y~~~CP~s~VNS~WL~~aL~~ai~~a~~~~P~~IRfFR~QM~--nmI  176 (335)
                      .+||+||-.-      |+ .|=++|.|.-..+.+...++  ..++..+..+|..++...|   |..|+.=|..-+  ..+
T Consensus         3 ~~w~~D~t~~------~~-~y~~~~iD~~sr~~~~~~~~--~~~~~~~~~~l~~~~~~~g---p~~i~sDnG~~f~s~~~   70 (152)
T 3kks_A            3 HLWQMDNTHW------NK-TIIWVAVETNSGLVEAQVIP--EETALQVALCILQLIQRYT---VLHLHSDNGPCFTAHRI   70 (152)
T ss_dssp             CEEEEEEEEE------TT-EEEEEEEETTTCCEEEEEES--SSCHHHHHHHHHHHHHHSC---CSEEEECSCHHHHSHHH
T ss_pred             cEEEEeeeee------CC-EEEEEEEEcCCCcEEEEEeC--CcCHHHHHHHHHHHHHHhC---CcEEecCCchHhhHHHH
Confidence            4899999643      44 48888888877777887776  5678899999999999843   999987553322  357


Q ss_pred             HHHhhhCCCcccccc
Q 019808          177 TKACKELDIKPIPSK  191 (335)
Q Consensus       177 ~kAc~~LgI~v~PSR  191 (335)
                      ...|+.+||...-++
T Consensus        71 ~~~~~~~gi~~~~~~   85 (152)
T 3kks_A           71 ENLCKYLGITKTTGI   85 (152)
T ss_dssp             HHHHHHTTCEEEESS
T ss_pred             HHHHHHcCCeecccC
Confidence            888999999887664



>2x78_A Integrase; viral protein, DNA-directed DNA polyme nucleotidyltransferase, DNA integration, aspartyl protease, recombination, transferase; 2.00A {Human spumaretrovirus} PDB: 2x6n_A 2x6s_A 2x74_A 3dlr_A Back     alignment and structure
>1cxq_A Avian sarcoma virus integrase; mixed beta-sheet surrounded by alpha-helices, transferase; HET: EPE; 1.02A {Avian sarcoma virus} SCOP: c.55.3.2 PDB: 1cxu_A* 1czb_A* 1cz9_A* 1asu_A* 1asw_A* 1asv_A 1a5v_A* 1a5w_A* 1a5x_A* 1vsf_A* 1vse_A* 3o4n_A* 1vsm_A 3o4q_A* 1vsd_A* 1vsh_A* 1vsi_A* 1vsj_A* 1vsk_A 1vsl_A Back     alignment and structure
>1c0m_A Protein (integrase); HIV, X-RAY crystallography, protein structure, transferase; 2.53A {Rous sarcoma virus} SCOP: b.34.7.1 c.55.3.2 PDB: 1c1a_A Back     alignment and structure
>3hph_A Integrase, IN; protein-protein complex, tetramer, DNA integration, endonucl magnesium, metal-binding, multifunctional enzyme; 2.64A {Maedi visna virus} PDB: 3hpg_A Back     alignment and structure
>1bco_A MUA domain II, bacteriophage MU transposase; polynucleotidyl transferase, DNA binding, endonuclease, integrase; 2.40A {Enterobacteria phage MU} SCOP: b.48.1.1 c.55.3.3 PDB: 1bcm_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query335
d1cxqa_147 Retroviral integrase, catalytic domain {Rous sarco 92.42
d1exqa_154 Retroviral integrase, catalytic domain {Human immu 85.7
>d1cxqa_ c.55.3.2 (A:) Retroviral integrase, catalytic domain {Rous sarcoma virus RSV [TaxId: 11886]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Retroviral integrase, catalytic domain
domain: Retroviral integrase, catalytic domain
species: Rous sarcoma virus RSV [TaxId: 11886]
Probab=92.42  E-value=0.11  Score=40.10  Aligned_cols=86  Identities=16%  Similarity=0.198  Sum_probs=61.8

Q ss_pred             eeeecccccCccccCCCceeEEEEEeeCCCceEEEecCCCCCCCHHHHHHHHHHHHhhcCCCCCCeeeeehHH--HHHHH
Q 019808           99 TEWELDFCSRPILDIRGKKIWELVVCDGSLSLQYTKYFPNNVINSITLKEAIVAICDDLGVPIPEKIRFFRSQ--MQTII  176 (335)
Q Consensus        99 ~iWELDFYSRPilD~~GKKlWELLICd~~~~F~y~~~CP~s~VNS~WL~~aL~~ai~~a~~~~P~~IRfFR~Q--M~nmI  176 (335)
                      .+|++||-.-|.   .|.+.|=++|.|.-+.+.+.  +|.....+..+..+|..++...|  .|..|+.=+..  +..-+
T Consensus         8 ~~w~~D~~~~~~---~~~~~~l~v~vD~~Sr~i~~--~~~~~~t~~~~~~~l~~~~~~~g--~p~~i~sDnG~~F~s~~~   80 (147)
T d1cxqa_           8 QIWQTDFTLEPR---MAPRSWLAVTVDTASSAIVV--TQHGRVTSVAAQHHWATAIAVLG--RPKAIKTDNGSCFTSKST   80 (147)
T ss_dssp             CEEEEEEEECGG---GTTSCEEEEEEETTTCCEEE--EEESSCCHHHHHHHHHHHHHHHC--CCSEEECCSCHHHHSHHH
T ss_pred             CEEEEEEEEecC---CCCeEEEEEEEecccceecc--cccCccchHHHHHHHHHHHHHhC--CCeEEEecccccccchhh
Confidence            589999844332   24445777777765555554  55666889999999999999864  59999865532  23567


Q ss_pred             HHHhhhCCCcccccc
Q 019808          177 TKACKELDIKPIPSK  191 (335)
Q Consensus       177 ~kAc~~LgI~v~PSR  191 (335)
                      +..|+++||+...+.
T Consensus        81 ~~~~~~~gI~~~~~~   95 (147)
T d1cxqa_          81 REWLARWGIAHTTGI   95 (147)
T ss_dssp             HHHHHHHTCEEECCC
T ss_pred             hhhhhheeEEeeeec
Confidence            889999999987665



>d1exqa_ c.55.3.2 (A:) Retroviral integrase, catalytic domain {Human immunodeficiency virus type 1 [TaxId: 11676]} Back     information, alignment and structure