Citrus Sinensis ID: 028897


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200--
MKRPMPWGDQVDVISSDDSASDGDADNGLDGRQSSNDMTIDQPTKELTSEVENIQKQKCQGLNEKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSITLNSPKSPDSTKSSSPQVL
ccccccccccEEEEcccccccccccccccccccccccccccccccccHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHccccccccccccccHHHHHHHHHHHHHcccHHHHHHHHHHHHHcccccccccccccccccccccHHHHHHHHHHHHHHcccHHHHHHHHHHcccccccccccccccccccccccccccccc
ccccccccccEEEEEccccccccccccccccccccccEEEccccccccccccccccHHHHccccccHHHHHHHHHHHHHHccccccccccccccHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHcccccccccccHHHcccHHHHHHHHHHHHHHHHccccHHHHHHHHHccccccccEcccccccccccccccccccccc
mkrpmpwgdqvdvissddsasdgdadngldgrqssndmtidqptkELTSEVENIQKQKCQGLNEKGSLFRRAEMYQEYMkqlpiptqrssiipftswvglGNSIKQLYEQPLHYLTNIHLkqwdqlrfgtedehkpldslvhpckaEATVWLIEEVHRLTSCHHHLAklwlsdpmhyvfidsitlnspkspdstkssspqvl
mkrpmpwgdqvdvissddsasdgdadngldgrqssndmTIDQPTKELTSEVENiqkqkcqglnekgSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSItlnspkspdstkssspqvl
MKRPMPWgdqvdvissddsasdgdadngldgRQSSNDMTIDQPTKELTSEVENIQKQKCQGLNEKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSITLNspkspdstkssspQVL
*******************************************************************LFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSITL*****************
*****P*******************************************************LNEKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSIT******************
********DQVDVISSDDSASDGDADNGLDGRQSSNDMTIDQPTKELTSEVENIQKQKCQGLNEKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSITLNS***************
****MPWGDQ************************SNDMT***PTK********IQKQKCQGLNEKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSITLN****************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MKRPMPWGDQVDVISSDDSASDGDADNGLDGRQSSNDMTIDQPTKELTSEVENIQKQKCQGLNEKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSITLNSPKSPDSTKSSSPQVL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query202 2.2.26 [Sep-21-2011]
Q9LUJ3163 Protein RDM1 OS=Arabidops yes no 0.603 0.748 0.672 2e-46
>sp|Q9LUJ3|RDM1_ARATH Protein RDM1 OS=Arabidopsis thaliana GN=RDM1 PE=1 SV=1 Back     alignment and function desciption
 Score =  185 bits (469), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 82/122 (67%), Positives = 102/122 (83%)

Query: 62  LNEKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLK 121
           + ++GSL RRAEMYQ+YMKQ+PIPT R S+IPFTSWVGL  S+KQLY QPLHYLTN+ L+
Sbjct: 38  VADEGSLLRRAEMYQDYMKQVPIPTNRGSLIPFTSWVGLSISMKQLYGQPLHYLTNVLLQ 97

Query: 122 QWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFID 181
           +WDQ RFGT+ E + LDS++HP KAEAT+WL+EE+HRLT  H H+A LW SDPM++ FID
Sbjct: 98  RWDQSRFGTDSEEQRLDSIIHPTKAEATIWLVEEIHRLTPSHLHMALLWRSDPMYHSFID 157

Query: 182 SI 183
            I
Sbjct: 158 PI 159




Regulator of RNA-directed DNA methylation (RdDM). Binds to single-stranded methyl DNA.
Arabidopsis thaliana (taxid: 3702)

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query202
224099821175 predicted protein [Populus trichocarpa] 0.831 0.96 0.623 3e-57
255552097175 conserved hypothetical protein [Ricinus 0.831 0.96 0.618 4e-57
449432680174 PREDICTED: protein RDM1-like isoform 1 [ 0.821 0.954 0.604 2e-54
297737087174 unnamed protein product [Vitis vinifera] 0.826 0.959 0.586 2e-52
359477543137 PREDICTED: protein RDM1 [Vitis vinifera] 0.653 0.963 0.612 4e-47
351722067165 uncharacterized protein LOC100500129 [Gl 0.777 0.951 0.540 5e-46
297830986167 predicted protein [Arabidopsis lyrata su 0.603 0.730 0.680 2e-45
357480931 369 hypothetical protein MTR_5g006600 [Medic 0.603 0.330 0.655 6e-45
357480933179 hypothetical protein MTR_5g006610 [Medic 0.841 0.949 0.521 7e-45
30686741163 RNA-directed DNA methylation 1 protein [ 0.603 0.748 0.672 1e-44
>gi|224099821|ref|XP_002311634.1| predicted protein [Populus trichocarpa] gi|222851454|gb|EEE89001.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  227 bits (578), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 116/186 (62%), Positives = 139/186 (74%), Gaps = 18/186 (9%)

Query: 1   MKRPMPWGDQVDVISSDDSASDGDA---DNGLDGRQSSNDMTIDQPTKELTSEVENIQKQ 57
           MKR +PW D VDVISSD+S+S       ++GLDG++ SN +TIDQP KE TSE       
Sbjct: 1   MKRTIPWTDPVDVISSDESSSSDSDIELNDGLDGQRLSNYVTIDQPIKETTSE------- 53

Query: 58  KCQGLNEKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTN 117
                   G L RRAEMYQ+YM+ +PIP QR S IPF+SW GLG SIKQLY+QPLHYLTN
Sbjct: 54  --------GVLIRRAEMYQDYMRHIPIPVQRGSAIPFSSWAGLGKSIKQLYQQPLHYLTN 105

Query: 118 IHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHY 177
           I L +WDQ R G+EDEH PLD ++HPCKAEAT+WL+EEVHR TS HHH+AK+W SDPMH+
Sbjct: 106 ILLNKWDQQRIGSEDEHTPLDIMIHPCKAEATIWLVEEVHRRTSSHHHVAKIWQSDPMHH 165

Query: 178 VFIDSI 183
            F+DSI
Sbjct: 166 AFVDSI 171




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|255552097|ref|XP_002517093.1| conserved hypothetical protein [Ricinus communis] gi|223543728|gb|EEF45256.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|449432680|ref|XP_004134127.1| PREDICTED: protein RDM1-like isoform 1 [Cucumis sativus] gi|449432682|ref|XP_004134128.1| PREDICTED: protein RDM1-like isoform 2 [Cucumis sativus] gi|449504162|ref|XP_004162269.1| PREDICTED: protein RDM1-like isoform 1 [Cucumis sativus] gi|449504166|ref|XP_004162270.1| PREDICTED: protein RDM1-like isoform 2 [Cucumis sativus] Back     alignment and taxonomy information
>gi|297737087|emb|CBI26288.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|359477543|ref|XP_002279112.2| PREDICTED: protein RDM1 [Vitis vinifera] Back     alignment and taxonomy information
>gi|351722067|ref|NP_001237231.1| uncharacterized protein LOC100500129 [Glycine max] gi|255629410|gb|ACU15051.1| unknown [Glycine max] Back     alignment and taxonomy information
>gi|297830986|ref|XP_002883375.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297329215|gb|EFH59634.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|357480931|ref|XP_003610751.1| hypothetical protein MTR_5g006600 [Medicago truncatula] gi|355512086|gb|AES93709.1| hypothetical protein MTR_5g006600 [Medicago truncatula] Back     alignment and taxonomy information
>gi|357480933|ref|XP_003610752.1| hypothetical protein MTR_5g006610 [Medicago truncatula] gi|355512087|gb|AES93710.1| hypothetical protein MTR_5g006610 [Medicago truncatula] gi|388497794|gb|AFK36963.1| unknown [Medicago truncatula] Back     alignment and taxonomy information
>gi|30686741|ref|NP_188907.2| RNA-directed DNA methylation 1 protein [Arabidopsis thaliana] gi|73921107|sp|Q9LUJ3.1|RDM1_ARATH RecName: Full=Protein RDM1; AltName: Full=Protein RNA-directed DNA methylation 1 gi|9279686|dbj|BAB01243.1| unnamed protein product [Arabidopsis thaliana] gi|332643143|gb|AEE76664.1| RNA-directed DNA methylation 1 protein [Arabidopsis thaliana] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query202
TAIR|locus:2094399163 RDM1 "AT3G22680" [Arabidopsis 0.594 0.736 0.683 2.4e-44
TAIR|locus:2094399 RDM1 "AT3G22680" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 467 (169.5 bits), Expect = 2.4e-44, P = 2.4e-44
 Identities = 82/120 (68%), Positives = 101/120 (84%)

Query:    64 EKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQW 123
             ++GSL RRAEMYQ+YMKQ+PIPT R S+IPFTSWVGL  S+KQLY QPLHYLTN+ L++W
Sbjct:    40 DEGSLLRRAEMYQDYMKQVPIPTNRGSLIPFTSWVGLSISMKQLYGQPLHYLTNVLLQRW 99

Query:   124 DQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSI 183
             DQ RFGT+ E + LDS++HP KAEAT+WL+EE+HRLT  H H+A LW SDPM++ FID I
Sbjct:   100 DQSRFGTDSEEQRLDSIIHPTKAEATIWLVEEIHRLTPSHLHMALLWRSDPMYHSFIDPI 159


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.133   0.419    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      202       165   0.00093  107 3  11 22  0.38    32
                                                     30  0.45    34


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  611 (65 KB)
  Total size of DFA:  177 KB (2102 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  16.80u 0.18s 16.98t   Elapsed:  00:00:00
  Total cpu time:  16.80u 0.18s 16.98t   Elapsed:  00:00:00
  Start:  Mon May 20 20:21:47 2013   End:  Mon May 20 20:21:47 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM;IDA
GO:0006306 "DNA methylation" evidence=IGI;RCA
GO:0005515 "protein binding" evidence=IPI
GO:0043621 "protein self-association" evidence=IDA
GO:0044030 "regulation of DNA methylation" evidence=IMP
GO:0070918 "production of small RNA involved in gene silencing by RNA" evidence=IMP
GO:0031047 "gene silencing by RNA" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

ID ?Name ?Annotated EC number ?Identity ?Query coverage ?Hit coverage ?RBH(Q2H) ?RBH(H2Q) ?
Q9LUJ3RDM1_ARATHNo assigned EC number0.67210.60390.7484yesno

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.VIII.552.1
hypothetical protein (121 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query202
pfam09187119 pfam09187, DUF1950, Domain of unknown function(DUF 4e-63
>gnl|CDD|192224 pfam09187, DUF1950, Domain of unknown function(DUF1950) Back     alignment and domain information
 Score =  190 bits (485), Expect = 4e-63
 Identities = 77/116 (66%), Positives = 98/116 (84%)

Query: 68  LFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLR 127
           L R AEMYQ+YMKQLPIP++R S IP T+W GLGNS+KQLY QPLHYLTN+ L++WDQ R
Sbjct: 1   LLRPAEMYQQYMKQLPIPSKRGSHIPNTTWRGLGNSMKQLYGQPLHYLTNVLLQRWDQSR 60

Query: 128 FGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSI 183
            G+++EHKPLDS++HP KAEAT+W++EE HRLT+   H+A LW+SDPM++ FID I
Sbjct: 61  IGSDNEHKPLDSIIHPSKAEATIWVVEEFHRLTTSTQHVATLWISDPMYHAFIDPI 116


Members of this family pertain to a set of functionally uncharacterized hypothetical eukaryotic proteins. Length = 119

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 202
PF09187119 DUF1950: Domain of unknown function(DUF1950); Inte 100.0
>PF09187 DUF1950: Domain of unknown function(DUF1950); InterPro: IPR015270 Members of this family are a set of functionally uncharacterised hypothetical eukaryotic proteins [] Back     alignment and domain information
Probab=100.00  E-value=2.5e-88  Score=533.81  Aligned_cols=118  Identities=76%  Similarity=1.303  Sum_probs=93.5

Q ss_pred             HHHHHHHHHHhcccCCCCCCCCccccccchhhhhHHHHHHhcCchHHHHHHHHhhhhhhccCCCCCCCCCccccCccchh
Q 028897           68 LFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAE  147 (202)
Q Consensus        68 lirrAemYQ~yMkqiPIP~~r~s~Ipf~sW~gLg~SiKqlYgQPLHYLTnilLkqWDq~RiGs~dE~kpLd~IIhP~kAE  147 (202)
                      |||||||||+||||||||++|||+|||+||+|||||||||||||||||||+|||||||+||||+||+||||+||||||||
T Consensus         1 liRrAemYQ~yMk~iPIP~~rgs~Ip~~sW~gL~~SiKqlYgQPLHYLTn~llkqWDq~RiGs~dE~kpLd~IIhp~kAe   80 (119)
T PF09187_consen    1 LIRRAEMYQDYMKQIPIPSSRGSVIPFTSWMGLGRSIKQLYGQPLHYLTNILLKQWDQSRIGSEDEHKPLDAIIHPCKAE   80 (119)
T ss_dssp             THHHHHHHHHHHHTS---SSSSS----SSHHHHHHHHHHHHT----HHHHHHHHHHHHTTTT-SS----GGGTS-HHHHH
T ss_pred             CcchHHHHHHHHHcCCCCCCCCCccccchHHHHHHHHHHHhCCcHHHHHHHHHHHHHHhccCCccccCcHHHhcChhhHH
Confidence            68999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hhHHHHHHHHhhccchHHHHHHHhhCCcccccccccCC
Q 028897          148 ATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSITL  185 (202)
Q Consensus       148 atIW~vEevHR~tsS~~hlA~LW~~DPmy~~fiD~if~  185 (202)
                      ||||+||||||+||||+|||+||++|||||+|||+|||
T Consensus        81 atIW~vEevHR~tsS~~hlA~LW~~DPmyh~fvD~ifp  118 (119)
T PF09187_consen   81 ATIWLVEEVHRQTSSHQHLAKLWLSDPMYHAFVDSIFP  118 (119)
T ss_dssp             HHHHHHHHHHHHS--HHHHHHHHHT-TTGGGGS--S--
T ss_pred             HHHHHHHHHHHhcCChHHHHHHHhcCchhhhhccccCC
Confidence            99999999999999999999999999999999999998



; PDB: 2Q3T_A 1VK5_A 3GAN_A.


Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query202
1vk5_A157 X-Ray Structure Of Gene Product From Arabidopsis Th 4e-47
3gan_A157 Crystal Structure Of Gene Product From Arabidopsis 4e-44
>pdb|1VK5|A Chain A, X-Ray Structure Of Gene Product From Arabidopsis Thaliana At3g22680 Length = 157 Back     alignment and structure

Iteration: 1

Score = 184 bits (466), Expect = 4e-47, Method: Compositional matrix adjust. Identities = 82/122 (67%), Positives = 102/122 (83%) Query: 62 LNEKGSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLK 121 + ++GSL RRAEMYQ+YMKQ+PIPT R S+IPFTSWVGL S+KQLY QPLHYLTN+ L+ Sbjct: 32 VADEGSLLRRAEMYQDYMKQVPIPTNRGSLIPFTSWVGLSISMKQLYGQPLHYLTNVLLQ 91 Query: 122 QWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFID 181 +WDQ RFGT+ E + LDS++HP KAEAT+WL+EE+HRLT H H+A LW SDPM++ FID Sbjct: 92 RWDQSRFGTDSEEQRLDSIIHPTKAEATIWLVEEIHRLTPSHLHMALLWRSDPMYHSFID 151 Query: 182 SI 183 I Sbjct: 152 PI 153
>pdb|3GAN|A Chain A, Crystal Structure Of Gene Product From Arabidopsis Thaliana At3g22680 With Bound Suramin Length = 157 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query202
1vk5_A157 Expressed protein; structural genomics, unknown fu 6e-66
>1vk5_A Expressed protein; structural genomics, unknown function, protein structur initiative, center for eukaryotic structural genomics, CESG; HET: CPS; 1.60A {Arabidopsis thaliana} SCOP: a.220.1.1 PDB: 2q3t_A* 3gan_A* Length = 157 Back     alignment and structure
 Score =  198 bits (505), Expect = 6e-66
 Identities = 87/163 (53%), Positives = 110/163 (67%), Gaps = 15/163 (9%)

Query: 21  SDGDADNGLDGRQSSNDMTIDQPTKELTSEVENIQKQKCQGLNEKGSLFRRAEMYQEYMK 80
           S     + +D   S     +D   +++  E               GSL RRAEMYQ+YMK
Sbjct: 6   SGDSGSSDVDAEISDGFSPLDTSHRDVADE---------------GSLLRRAEMYQDYMK 50

Query: 81  QLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSL 140
           Q+PIPT R S+IPFTSWVGL  S+KQLY QPLHYLTN+ L++WDQ RFGT+ E + LDS+
Sbjct: 51  QVPIPTNRGSLIPFTSWVGLSISMKQLYGQPLHYLTNVLLQRWDQSRFGTDSEEQRLDSI 110

Query: 141 VHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSI 183
           +HP KAEAT+WL+EE+HRLT  H H+A LW SDPM++ FID I
Sbjct: 111 IHPTKAEATIWLVEEIHRLTPSHLHMALLWRSDPMYHSFIDPI 153


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query202
1vk5_A157 Expressed protein; structural genomics, unknown fu 100.0
>1vk5_A Expressed protein; structural genomics, unknown function, protein structur initiative, center for eukaryotic structural genomics, CESG; HET: CPS; 1.60A {Arabidopsis thaliana} SCOP: a.220.1.1 PDB: 2q3t_A* 3gan_A* Back     alignment and structure
Probab=100.00  E-value=3.6e-96  Score=596.98  Aligned_cols=152  Identities=59%  Similarity=1.020  Sum_probs=120.4

Q ss_pred             ccccCCCCCC---CCCCCCCCCCCCCCCccccCCCCccchHHHHHHHHhhhcCCcccchHHHHHHHHHHhcccCCCCCCC
Q 028897           12 DVISSDDSAS---DGDADNGLDGRQSSNDMTIDQPTKELTSEVENIQKQKCQGLNEKGSLFRRAEMYQEYMKQLPIPTQR   88 (202)
Q Consensus        12 DviSSd~SsS---d~Ea~nGk~~~~~~n~~~l~~s~k~~~~~~~~~~k~k~~g~~es~~lirrAemYQ~yMkqiPIP~~r   88 (202)
                      ||||||+|+|   |+|++||        +||||||.|++++|               ++|||||||||+||||||||++|
T Consensus         2 dv~sS~~s~Ssd~d~e~~~g--------~~~i~~s~k~~~~e---------------~~liRrAEmYQ~YMK~iPIP~~~   58 (157)
T 1vk5_A            2 ELRPSGDSGSSDVDAEISDG--------FSPLDTSHRDVADE---------------GSLLRRAEMYQDYMKQVPIPTNR   58 (157)
T ss_dssp             ---------------------------------------------------------CTTHHHHHHHHHHHHTSCCCSSC
T ss_pred             CcccCCCCccCCccccccCC--------cccCCCcccccCcc---------------hhhhhHHHHHHHHHHhCCCCccc
Confidence            7889999887   8899988        59999999999999               99999999999999999999999


Q ss_pred             CccccccchhhhhHHHHHHhcCchHHHHHHHHhhhhhhccCCCCCCCCCccccCccchhhhHHHHHHHHhhccchHHHHH
Q 028897           89 SSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAK  168 (202)
Q Consensus        89 ~s~Ipf~sW~gLg~SiKqlYgQPLHYLTnilLkqWDq~RiGs~dE~kpLd~IIhP~kAEatIW~vEevHR~tsS~~hlA~  168 (202)
                      |++|||+||+|||||||||||||||||||+|||||||+||||+||+||||+||||||||||||+||||||+||||+|||+
T Consensus        59 ~s~Ipf~sW~gL~~SiKQlY~QPLHYLTN~LLkqWDQ~RiGs~dE~kpLd~IIhP~kAeatIW~vEE~HR~tsS~~hla~  138 (157)
T 1vk5_A           59 GSLIPFTSWVGLSISMKQLYGQPLHYLTNVLLQRWDQSRFGTDSEEQRLDSIIHPTKAEATIWLVEEIHRLTPSHLHMAL  138 (157)
T ss_dssp             CSCCCCSSHHHHHHHHHHHHCCCCCHHHHHHHHHHHHTTTTCSSCCCCGGGTSCHHHHHHHHHHHHHHHHHSCCHHHHHH
T ss_pred             CCcccchhHHHHHHHHHHHhcCcHHHHHHHHHHHHHHhhcCCccccCchhhhccchhhhhHHHHHHHHHhhcCChHHHHH
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHhhCCcccccccccCCC
Q 028897          169 LWLSDPMHYVFIDSITLN  186 (202)
Q Consensus       169 LW~~DPmy~~fiD~if~~  186 (202)
                      ||++|||||+|||+|||+
T Consensus       139 LW~sDPmyhafiD~ifp~  156 (157)
T 1vk5_A          139 LWRSDPMYHSFIDPIFPE  156 (157)
T ss_dssp             HHHTCTTGGGGSCCCCCC
T ss_pred             HHhcCchhhhhcccCCCC
Confidence            999999999999999985




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 202
d1vk5a_121 a.220.1.1 (A:) Hypothetical protein At3g22680 {Tha 3e-71
>d1vk5a_ a.220.1.1 (A:) Hypothetical protein At3g22680 {Thale-cress (Arabidopsis thaliana) [TaxId: 3702]} Length = 121 Back     information, alignment and structure

class: All alpha proteins
fold: Hypothetical protein At3g22680
superfamily: Hypothetical protein At3g22680
family: Hypothetical protein At3g22680
domain: Hypothetical protein At3g22680
species: Thale-cress (Arabidopsis thaliana) [TaxId: 3702]
 Score =  209 bits (534), Expect = 3e-71
 Identities = 82/118 (69%), Positives = 99/118 (83%)

Query: 66  GSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQ 125
           GSL RRAEMYQ+YMKQ+PIPT R S+IPFTSWVGL  S+KQLY QPLHYLTN+ L++WDQ
Sbjct: 1   GSLLRRAEMYQDYMKQVPIPTNRGSLIPFTSWVGLSISMKQLYGQPLHYLTNVLLQRWDQ 60

Query: 126 LRFGTEDEHKPLDSLVHPCKAEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSI 183
            RFGT+ E + LDS++HP KAEAT+WL+EE+HRLT  H H+A LW SDPM++ FID I
Sbjct: 61  SRFGTDSEEQRLDSIIHPTKAEATIWLVEEIHRLTPSHLHMALLWRSDPMYHSFIDPI 118


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query202
d1vk5a_121 Hypothetical protein At3g22680 {Thale-cress (Arabi 100.0
>d1vk5a_ a.220.1.1 (A:) Hypothetical protein At3g22680 {Thale-cress (Arabidopsis thaliana) [TaxId: 3702]} Back     information, alignment and structure
class: All alpha proteins
fold: Hypothetical protein At3g22680
superfamily: Hypothetical protein At3g22680
family: Hypothetical protein At3g22680
domain: Hypothetical protein At3g22680
species: Thale-cress (Arabidopsis thaliana) [TaxId: 3702]
Probab=100.00  E-value=1.6e-87  Score=527.23  Aligned_cols=121  Identities=68%  Similarity=1.158  Sum_probs=119.4

Q ss_pred             chHHHHHHHHHHhcccCCCCCCCCccccccchhhhhHHHHHHhcCchHHHHHHHHhhhhhhccCCCCCCCCCccccCccc
Q 028897           66 GSLFRRAEMYQEYMKQLPIPTQRSSIIPFTSWVGLGNSIKQLYEQPLHYLTNIHLKQWDQLRFGTEDEHKPLDSLVHPCK  145 (202)
Q Consensus        66 ~~lirrAemYQ~yMkqiPIP~~r~s~Ipf~sW~gLg~SiKqlYgQPLHYLTnilLkqWDq~RiGs~dE~kpLd~IIhP~k  145 (202)
                      ++|||||||||+||||||||++|||+|||+||+|||+|||||||||||||||+|||||||+||||+||+||||+||||||
T Consensus         1 ~~l~R~AemYQ~yMk~iPIP~~~~s~Ipf~tW~gL~~SiKQlY~QPLHYLTn~lLk~WDq~Rigs~dE~k~ld~iIhP~k   80 (121)
T d1vk5a_           1 GSLLRRAEMYQDYMKQVPIPTNRGSLIPFTSWVGLSISMKQLYGQPLHYLTNVLLQRWDQSRFGTDSEEQRLDSIIHPTK   80 (121)
T ss_dssp             CTTHHHHHHHHHHHHTSCCCSSCCSCCCCSSHHHHHHHHHHHHCCCCCHHHHHHHHHHHHTTTTCSSCCCCGGGTSCHHH
T ss_pred             ChHHHHHHHHHHHHHhCCCCcccCCcccchhHHHHHHHHHHHhcCcHHHHHHHHHHHHHHhccCCccccCchhhhccchh
Confidence            57999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hhhhHHHHHHHHhhccchHHHHHHHhhCCcccccccccCCC
Q 028897          146 AEATVWLIEEVHRLTSCHHHLAKLWLSDPMHYVFIDSITLN  186 (202)
Q Consensus       146 AEatIW~vEevHR~tsS~~hlA~LW~~DPmy~~fiD~if~~  186 (202)
                      ||||||+||||||+||||+|||+||++|||||+|||+|||.
T Consensus        81 AeatIW~vEe~HR~tsS~~h~A~LW~sDPm~hafiD~ifp~  121 (121)
T d1vk5a_          81 AEATIWLVEEIHRLTPSHLHMALLWRSDPMYHSFIDPIFPE  121 (121)
T ss_dssp             HHHHHHHHHHHHHHSCCHHHHHHHHHTCTTGGGGSCCCCCC
T ss_pred             hhHHHHHHHHHHhhcCChHHHHHHHhcCchHHhhccccCCC
Confidence            99999999999999999999999999999999999999983