Citrus Sinensis ID: 013386


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440----
MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGIV
cccccccccccccccccccccccccccccccccccccccccccccccccccEEEEEcccccccccccccccccccccHHHHcccccEEEEEEcccccccccccccEEEEEEccccccccccEEEcccEEEEEEEcccccccccccccccccccccccccccEEEEEEEcccccccccEEEEEEEEEccccccccccccHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHcccccEEEEEcccccccccccccccccHHHHcccccEEEEEEEEEEEEccccccccccEEEccccHHccccccccccHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccEEcccccccccccHHHHHHHHHHHHHHHccccHHHHHHHcccccccccHHHHHHHHHHHHHHcccccEEEEEEEccc
cccccccccEEccccEEEccccccccccccccccccccccccccccccccccEEEEccccccccccccccccccHHHHHHHHccHHcEEEEEEccEEEcccccccEEEEEEEcccccccccHEcccEEEEEEEEcccccccccccccccccEEEEcccccEEEEEEEEcccccccccEEEEEEEEEEcccccccHHccHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHcHHHHHHHccccccccccHHHHHccEEEEEEEEEEEEEEccccccccEEEEEccccHHHccccccccHHHHHHHHHHHHHHHHHcccccEcccccccEEEEccccccccccccHHHcccHHHHHEEcccccccccccHHHHHHHHHHHHHHcccccccEEEEEEcccccHHHHHHHHHHHHHHHHHcccccEEEEEEcccc
mdnsgnpqdsvvppvegvagggtaygwndncsqssgplngsinpteiptadlvhvwcmpstanvgpqemprplepiNLLAARNERESVQIAlrpkvswsssstagVVQVQCSDlcsasgdrlVVGQSLMLRRVvpmlgvpdalvpldlpvcqislipgettAVWVsidapyaqppglyegeiiitskadtelssqclgkgekHRLFMELRNCldnvepiegkplhEVVERAKSTATTLRRVIFsplfseffsdngpidmmdedaiSNLSVRVKLSLTVWdfilpatpslpavigisdtviedrfgvrhgsdEWYEALDQHFKWLLQYRIspffcrwgesmrvltytcpwpadhpksdeyfsdprlaayavpyspvlssndgakdYVRKEIELLRTKAHWKKAYFylwdeplnmehysSVRNMASElhayapdarvLTTYYCGIV
mdnsgnpqdsvvppVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTanvgpqemprPLEPINLLAARNERESVQIALrpkvswsssstAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNvepiegkplhevVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVpyspvlssndgaKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGIV
MDNSGNPQDSvvppvegvagggTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGIV
*******************GGGTAYGWND***************TEIPTADLVHVWCMPST****************************IALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGI*
*************PVEGVAGGGTAYGWNDNCSQS**********TEIPTADLVHVWCMPS*****************LLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADT*********GEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGIV
**********VVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRP*************QVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGIV
*******QDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGIV
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGIV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query444
356518975 649 PREDICTED: uncharacterized protein LOC10 0.993 0.679 0.774 0.0
356509690 644 PREDICTED: uncharacterized protein LOC10 0.981 0.677 0.756 0.0
449460114 649 PREDICTED: uncharacterized protein LOC10 0.993 0.679 0.762 0.0
224132110 652 predicted protein [Populus trichocarpa] 0.988 0.673 0.770 0.0
255538584 651 conserved hypothetical protein [Ricinus 0.988 0.674 0.768 0.0
225458333 638 PREDICTED: uncharacterized protein LOC10 0.966 0.672 0.752 0.0
297852256 643 hypothetical protein ARALYDRAFT_473837 [ 0.979 0.676 0.719 0.0
42562571 643 uncharacterized protein [Arabidopsis tha 0.979 0.676 0.703 0.0
7767671 687 F27F5.22 [Arabidopsis thaliana] 0.954 0.617 0.683 0.0
222637345 709 hypothetical protein OsJ_24889 [Oryza sa 0.979 0.613 0.653 1e-172
>gi|356518975|ref|XP_003528150.1| PREDICTED: uncharacterized protein LOC100782659 [Glycine max] Back     alignment and taxonomy information
 Score =  727 bits (1876), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/443 (77%), Positives = 389/443 (87%), Gaps = 2/443 (0%)

Query: 1   MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
           MDNSGNPQD VVPPVEGVAGGGTAYGWND  +     + G I+PT IPT DLVHVWCMPS
Sbjct: 1   MDNSGNPQDVVVPPVEGVAGGGTAYGWNDGGTHGLN-VKGPIDPTGIPTRDLVHVWCMPS 59

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           TANVGPQ+MPR LEPINLLAARNERESVQIA+RPKVSWS SS AG VQ+QCSDLCS SGD
Sbjct: 60  TANVGPQDMPRHLEPINLLAARNERESVQIAIRPKVSWSGSSVAGTVQIQCSDLCSTSGD 119

Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
           RL+VGQSL+LRRVVP+LGVPDALVP+DLPV QI+L PGETTA+W+SID P +QPPG YEG
Sbjct: 120 RLIVGQSLLLRRVVPILGVPDALVPVDLPVSQINLFPGETTALWISIDVPSSQPPGQYEG 179

Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
           EI IT+ KAD E   Q L K EKH+L+ +L+ CLD VEPI+GKPL EVVER KS  T+LR
Sbjct: 180 EIAITAIKADAESPVQILSKVEKHQLYRDLKGCLDIVEPIDGKPLDEVVERVKSATTSLR 239

Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
           R++ SP FSEFFSDNGP+D+MDEDAIS+LS+R+KL+LTVW+F+LP TPSLPAV GISDTV
Sbjct: 240 RILLSPSFSEFFSDNGPVDVMDEDAISSLSIRMKLNLTVWEFVLPETPSLPAVFGISDTV 299

Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
           IEDRFGV+ G+ EWYEALDQHFKWLLQYRISP+FC+W + MRVLTYT PWPADHPKSDEY
Sbjct: 300 IEDRFGVQQGTAEWYEALDQHFKWLLQYRISPYFCKWADGMRVLTYTSPWPADHPKSDEY 359

Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
           FSDPRLAAYAVPY  V+S ND AKDY++K++E+LRTK HW+KAYFYLWDEPLN+E Y SV
Sbjct: 360 FSDPRLAAYAVPYKQVVSGNDAAKDYLQKQVEILRTKTHWRKAYFYLWDEPLNLEQYDSV 419

Query: 420 RNMASELHAYAPDARVLTTYYCG 442
           RNMASE+HAYAPDAR+LTTYYCG
Sbjct: 420 RNMASEIHAYAPDARILTTYYCG 442




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|356509690|ref|XP_003523579.1| PREDICTED: uncharacterized protein LOC100799554 [Glycine max] Back     alignment and taxonomy information
>gi|449460114|ref|XP_004147791.1| PREDICTED: uncharacterized protein LOC101205217 [Cucumis sativus] gi|449476778|ref|XP_004154831.1| PREDICTED: uncharacterized LOC101205217 [Cucumis sativus] Back     alignment and taxonomy information
>gi|224132110|ref|XP_002321258.1| predicted protein [Populus trichocarpa] gi|222862031|gb|EEE99573.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255538584|ref|XP_002510357.1| conserved hypothetical protein [Ricinus communis] gi|223551058|gb|EEF52544.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|225458333|ref|XP_002283035.1| PREDICTED: uncharacterized protein LOC100243809 [Vitis vinifera] gi|302142468|emb|CBI19671.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|297852256|ref|XP_002894009.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp. lyrata] gi|297339851|gb|EFH70268.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|42562571|ref|NP_175129.3| uncharacterized protein [Arabidopsis thaliana] gi|30725314|gb|AAP37679.1| At1g45150 [Arabidopsis thaliana] gi|110742869|dbj|BAE99332.1| hypothetical protein [Arabidopsis thaliana] gi|332193963|gb|AEE32084.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|7767671|gb|AAF69168.1|AC007915_20 F27F5.22 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|222637345|gb|EEE67477.1| hypothetical protein OsJ_24889 [Oryza sativa Japonica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query444
TAIR|locus:2028275 643 AT1G45150 "AT1G45150" [Arabido 0.932 0.643 0.714 6.6e-168
TAIR|locus:2028275 AT1G45150 "AT1G45150" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1633 (579.9 bits), Expect = 6.6e-168, P = 6.6e-168
 Identities = 301/421 (71%), Positives = 355/421 (84%)

Query:    23 TAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAAR 82
             TAYG+ND     + PL  S +P+E+PTADLV+VWCMP+T NVG QE PR LEPINLLAAR
Sbjct:    21 TAYGFND-----AEPLKQSTDPSEVPTADLVNVWCMPNTVNVGSQETPRALEPINLLAAR 75

Query:    83 NERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDA 142
             NERES QIA+RPKVSW++SS +G+VQVQCSDLCS++GDRLVVGQSL LRRVVP+LGVPDA
Sbjct:    76 NERESFQIAMRPKVSWAASSPSGIVQVQCSDLCSSAGDRLVVGQSLKLRRVVPVLGVPDA 135

Query:   143 LVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS-KADTELSSQCLGKGE 201
             LVPLDLPV Q+SL PGET+ +WVSID P  QPPG YEGEIII++ K D   SS  L K E
Sbjct:   136 LVPLDLPVSQLSLFPGETSVIWVSIDVPTGQPPGQYEGEIIISAMKTDGGGSSH-LAKHE 194

Query:   202 KHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMD 261
             K +L +EL  CLD +EPIEGKP+ EVVER K  +++LRR++FSP FSEF S NG  DMM+
Sbjct:   195 KDQLCVELNTCLDIMEPIEGKPMDEVVERIKCASSSLRRILFSPSFSEFISTNGSTDMME 254

Query:   262 EDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHF 321
             ED +SNLS+R+KL LTVW+FI+P TPSLPAVIG+SDTVIEDRF V HGS++WY+ LD HF
Sbjct:   255 EDVVSNLSLRIKLRLTVWEFIIPVTPSLPAVIGVSDTVIEDRFAVEHGSEDWYKKLDLHF 314

Query:   322 KWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDG 381
             KWLLQYRISP+FC+WGESMRVLTYT PWPADHPKSDEY SD RLAAYAVPY  V++ +D 
Sbjct:   315 KWLLQYRISPYFCKWGESMRVLTYTSPWPADHPKSDEYLSDSRLAAYAVPYRQVIAGDDS 374

Query:   382 AKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYC 441
              + Y+RKE+E+LR+K HW KAYFYLWDEPLNMEH+ +VR MASE++AYAPD+RVLTTYYC
Sbjct:   375 RESYLRKEVEILRSKPHWNKAYFYLWDEPLNMEHFDNVRKMASEIYAYAPDSRVLTTYYC 434

Query:   442 G 442
             G
Sbjct:   435 G 435


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.318   0.134   0.420    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      444       432   0.00086  118 3  11 22  0.41    34
                                                     34  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  623 (66 KB)
  Total size of DFA:  307 KB (2157 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  34.13u 0.10s 34.23t   Elapsed:  00:00:02
  Total cpu time:  34.13u 0.10s 34.23t   Elapsed:  00:00:02
  Start:  Fri May 10 15:39:11 2013   End:  Fri May 10 15:39:13 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0045892 "negative regulation of transcription, DNA-dependent" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 444
PF1063378 NPCBM_assoc: NPCBM-associated, NEW3 domain of alph 93.45
PF00150 281 Cellulase: Cellulase (glycosyl hydrolase family 5) 90.14
PF15418132 DUF4625: Domain of unknown function (DUF4625) 89.85
PF01229 486 Glyco_hydro_39: Glycosyl hydrolases family 39; Int 89.78
COG1470513 Predicted membrane protein [Function unknown] 86.51
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known Back     alignment and domain information
Probab=93.45  E-value=0.14  Score=41.29  Aligned_cols=32  Identities=31%  Similarity=0.543  Sum_probs=25.7

Q ss_pred             eeCCCCeeEEEEEEEcCCCCCCceeEEEEEEE
Q 013386          154 SLIPGETTAVWVSIDAPYAQPPGLYEGEIIIT  185 (444)
Q Consensus       154 ~l~ag~~q~lWI~V~VP~~a~pG~Y~GtVtVt  185 (444)
                      .|++|+.+.+=++|.+|++++||.|..+++++
T Consensus        45 ~l~pG~s~~~~~~V~vp~~a~~G~y~v~~~a~   76 (78)
T PF10633_consen   45 SLPPGESVTVTFTVTVPADAAPGTYTVTVTAR   76 (78)
T ss_dssp             -B-TTSEEEEEEEEEE-TT--SEEEEEEEEEE
T ss_pred             cCCCCCEEEEEEEEECCCCCCCceEEEEEEEE
Confidence            68899999999999999999999999999886



It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.

>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3 Back     alignment and domain information
>PF15418 DUF4625: Domain of unknown function (DUF4625) Back     alignment and domain information
>PF01229 Glyco_hydro_39: Glycosyl hydrolases family 39; InterPro: IPR000514 O-Glycosyl hydrolases 3 Back     alignment and domain information
>COG1470 Predicted membrane protein [Function unknown] Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query444
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-07
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 1e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 51.8 bits (123), Expect = 4e-07
 Identities = 60/419 (14%), Positives = 114/419 (27%), Gaps = 122/419 (29%)

Query: 79  LAA---RNERESVQIALRPKVSW---SSSSTAGVVQVQCSDLCSASGDRLVVGQSLM--- 129
           +A     + +  VQ  +  K+ W    + ++   V      L                  
Sbjct: 166 VALDVCLSYK--VQCKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNI 223

Query: 130 ----------LRRVVPMLGVPDALVPLDLPV------------CQISLIPGETT---AVW 164
                     LRR++      + L+ L   V            C+I L+   TT    V 
Sbjct: 224 KLRIHSIQAELRRLLKSKPYENCLLVLL-NVQNAKAWNAFNLSCKI-LL---TTRFKQVT 278

Query: 165 VSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGK--------------------GEKHR 204
             + A       L    + +T      L  + L                       E  R
Sbjct: 279 DFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIR 338

Query: 205 LFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEF--FSDNGPI----- 257
              +     DN + +    L  ++E   S+   L    +  +F     F  +  I     
Sbjct: 339 ---DGLATWDNWKHVNCDKLTTIIE---SSLNVLEPAEYRKMFDRLSVFPPSAHIPTILL 392

Query: 258 -----DMMDEDAISNLSVRVKLSL----------TVWDFILPATPSLPAVIGISDTVIE- 301
                D++  D +  ++   K SL          ++    L     L     +  ++++ 
Sbjct: 393 SLIWFDVIKSDVMVVVNKLHKYSLVEKQPKESTISIPSIYLELKVKLENEYALHRSIVDH 452

Query: 302 ---------DRFGVRHGSDEWYEALDQHFKWLLQYRISPFF------CRWGESMRVLTYT 346
                    D     +    +Y  +  H K +        F       R+ E  ++   +
Sbjct: 453 YNIPKTFDSDDLIPPYLDQYFYSHIGHHLKNIEHPERMTLFRMVFLDFRFLEQ-KIRHDS 511

Query: 347 CPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGA--------KDYVRK-EIELLRTK 396
             W A     +      +       Y P +  ND           D++ K E  L+ +K
Sbjct: 512 TAWNASGSILNT-LQQLKF------YKPYICDNDPKYERLVNAILDFLPKIEENLICSK 563


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query444
4ekj_A 500 Beta-xylosidase; TIM-barrel fold, hemicellulase, h 93.83
3qr3_A 340 Endoglucanase EG-II; TIM barrel, hydrolase; 2.05A 88.09
1ece_A 358 Endocellulase E1; glycosyl hydrolase; HET: BGC; 2. 84.31
1uhv_A 500 Beta-xylosidase; family 39 glycoside hydrolase, xy 83.02
1tvn_A 293 Cellulase, endoglucanase G; glycoside hydrolase, C 81.24
3ndz_A 345 Endoglucanase D; cellotriose, xylanase, carbohydra 80.77
3aof_A 317 Endoglucanase; glycosyl hydrolase family 5, cellul 80.38
1egz_A 291 Endoglucanase Z, EGZ, CEL5; glycosyl hydrolase, CL 80.04
>4ekj_A Beta-xylosidase; TIM-barrel fold, hemicellulase, hydrolase; 2.50A {Caulobacter vibrioides} Back     alignment and structure
Probab=93.83  E-value=0.11  Score=52.83  Aligned_cols=111  Identities=22%  Similarity=0.301  Sum_probs=69.6

Q ss_pred             HHHHHHHHHHHHhCcccCCCcCCCCceEEEeecCCCCCCCCcccccccccccceeecccCCCCCChHHHHHHHHHHHHHH
Q 013386          314 YEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELL  393 (444)
Q Consensus       314 f~~L~~~~~~~~~~riS~~f~~Wg~~mrv~~~~~~w~~d~~~~~~y~~d~~~~~y~vp~~~~~~g~~~~~~~L~~~~~~L  393 (444)
                      |..||+.+++|++++|.|++.        +.+.-.|....+....|..+     +.-     ....+.+.++++++++|+
T Consensus        83 ~~~~D~~~d~~~~~G~~p~~~--------l~~~P~~~~~~~~~~~~~~~-----~~~-----~~~~~~w~~~~~~~~~~~  144 (500)
T 4ekj_A           83 WTKIDQLYDALLAKGIKPFIE--------LGFTPEAMKTSDQTIFYWKG-----NTS-----HPKLGPWRDLIDAFVHHL  144 (500)
T ss_dssp             CHHHHHHHHHHHHTTCEEEEE--------ECCBCGGGCSSCCEETTTTE-----ECS-----CCCHHHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHCCCEEEEE--------EeCCchhhcCCCCccccccC-----CCC-----cccHHHHHHHHHHHHHHH
Confidence            678999999999999988643        12222222222211111100     000     112346888899999999


Q ss_pred             Hhc-C--chhhhhhhhcCCCCC--------cccH-HHHHHHHHHHHhhCCCCcEEEEEeec
Q 013386          394 RTK-A--HWKKAYFYLWDEPLN--------MEHY-SSVRNMASELHAYAPDARVLTTYYCG  442 (444)
Q Consensus       394 r~k-G--w~~k~yfyl~DEP~~--------~e~~-~~~r~a~~~ir~~~Pd~ril~t~~~~  442 (444)
                      .++ |  ..+..||=+|.||..        +++| +.++++++.||+.+|+++|.....|+
T Consensus       145 ~~RYg~~~v~~w~~EvwNEp~~~~~~~~~~~~~y~~l~~~~~~aik~~~P~~~Vgg~~~~~  205 (500)
T 4ekj_A          145 RARYGVEEVRTWFFEVWNEPNLDGFWEKADQAAYFELYDVTARAIKAIDPSLRVGGPATAG  205 (500)
T ss_dssp             HHHHCHHHHHTSEEEESSCTTSTTTSGGGCHHHHHHHHHHHHHHHHHHCTTSEEEEEEEGG
T ss_pred             HHhhCccccceeEEEEEECCCCccCCCCCCHHHHHHHHHHHHHHHHhhCCccccccCccCC
Confidence            774 3  345667778999853        2233 46778889999999999997655544



>3qr3_A Endoglucanase EG-II; TIM barrel, hydrolase; 2.05A {Hypocrea jecorina} Back     alignment and structure
>1ece_A Endocellulase E1; glycosyl hydrolase; HET: BGC; 2.40A {Acidothermus cellulolyticus} SCOP: c.1.8.3 PDB: 1vrx_A Back     alignment and structure
>1uhv_A Beta-xylosidase; family 39 glycoside hydrolase, xylan, xylose, covalent glycosyl-enzyme intermediate; 2.10A {Thermoanaerobacterium saccharolyticum} SCOP: b.71.1.2 c.1.8.3 PDB: 1px8_A Back     alignment and structure
>1tvn_A Cellulase, endoglucanase G; glycoside hydrolase, CLAN GH-A, family 5-2; 1.41A {Pseudoalteromonas haloplanktis} SCOP: c.1.8.3 PDB: 1tvp_A* Back     alignment and structure
>3ndz_A Endoglucanase D; cellotriose, xylanase, carbohydrate binding D glucanase, hydrolase; HET: CT3; 2.08A {Clostridium cellulovorans} PDB: 3ndy_A* Back     alignment and structure
>3aof_A Endoglucanase; glycosyl hydrolase family 5, cellulase, biofuel, hyperthermo hydrolase; HET: BMA; 1.29A {Thermotoga maritima} PDB: 3amg_A* 3amc_A 3amd_A 3mmu_A 3mmw_A 3azs_A* 3azr_A* 3azt_A* Back     alignment and structure
>1egz_A Endoglucanase Z, EGZ, CEL5; glycosyl hydrolase, CLAN GH-A, family 5-2, cellulase; 2.30A {Erwinia chrysanthemi} SCOP: c.1.8.3 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query444
d1uhva2 346 Beta-D-xylosidase, catalytic domain {Thermoanaerob 94.13
d1w8oa1103 Sialidase, "linker" domain {Micromonospora viridif 93.36
d1ceoa_ 340 Endoglucanase CelC {Clostridium thermocellum [TaxI 88.1
d1ecea_ 358 Endocellulase E1 {Acidothermus cellulolyticus [Tax 85.0
d1nq6a_ 302 Xylanase A, catalytic core {Streptomyces halstedii 80.28
d1fh9a_ 312 Xylanase A, catalytic core {Cellulomonas fimi [Tax 80.23
>d1uhva2 c.1.8.3 (A:14-359) Beta-D-xylosidase, catalytic domain {Thermoanaerobacterium saccharolyticum [TaxId: 28896]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: TIM beta/alpha-barrel
superfamily: (Trans)glycosidases
family: beta-glycanases
domain: Beta-D-xylosidase, catalytic domain
species: Thermoanaerobacterium saccharolyticum [TaxId: 28896]
Probab=94.13  E-value=0.065  Score=48.21  Aligned_cols=111  Identities=17%  Similarity=0.248  Sum_probs=69.0

Q ss_pred             HHHHHHHHHHHHhCcccCCCcCCCCceEEEeecCCCCCCCCcccccccccccceeecccCCCCCChHHHHHHHHHHHHHH
Q 013386          314 YEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELL  393 (444)
Q Consensus       314 f~~L~~~~~~~~~~riS~~f~~Wg~~mrv~~~~~~w~~d~~~~~~y~~d~~~~~y~vp~~~~~~g~~~~~~~L~~~~~~L  393 (444)
                      |..|++.+++|.++++.++++        +.+.-+|....+....+.     ..+  +  ..-...+.+.+|++.+++|+
T Consensus        66 ~~~~D~~~~~~~~~g~~~~~~--------l~~~p~~~~~~~~~~~~~-----~~~--~--~~p~~~~~w~~~v~~~~~~y  128 (346)
T d1uhva2          66 FTYIDRIFDSFLEIGIRPFVE--------IGFMPKKLASGTQTVFYW-----EGN--V--TPPKDYEKWSDLVKAVLHHF  128 (346)
T ss_dssp             CHHHHHHHHHHHHHTCEECEE--------ECCCCTTTBSSCCEETTT-----TEE--C--SCBSCHHHHHHHHHHHHHHH
T ss_pred             hHhHHHHHHHHHHcCCCeEEE--------EeccCccccCCCCCcccc-----ccc--C--CChhhHHHHHHHHHHHHHHH
Confidence            578899999999999888743        122222322111110000     000  0  00112347888999999999


Q ss_pred             HhcCch---hhhhhhhcCCCCCc--------ccH-HHHHHHHHHHHhhCCCCcEEEEEee
Q 013386          394 RTKAHW---KKAYFYLWDEPLNM--------EHY-SSVRNMASELHAYAPDARVLTTYYC  441 (444)
Q Consensus       394 r~kGw~---~k~yfyl~DEP~~~--------e~~-~~~r~a~~~ir~~~Pd~ril~t~~~  441 (444)
                      +.+...   ...|+=+|.||...        ++| +.++++.+.||+.+|+++|+....+
T Consensus       129 ~~~~~~~~~~~~~~evwNEp~~~~~~~~~~~~~y~~~~~~~~~aik~~~P~~~v~~~~~~  188 (346)
T d1uhva2         129 ISRYGIEEVLKWPFEIWNEPNLKEFWKDADEKEYFKLYKVTAKAIKEVNENLKVGGPAIC  188 (346)
T ss_dssp             HHHHCHHHHTTCCEEESSCTTSTTTSGGGCHHHHHHHHHHHHHHHHHHCTTSCEEEEEEC
T ss_pred             HhhcCcccccccccccccCcccccCCCCCCHHHHHHHHHHHHHHHhccCCCceEeecccc
Confidence            876432   35567789999641        222 4677888999999999999876544



>d1w8oa1 b.1.18.2 (A:403-505) Sialidase, "linker" domain {Micromonospora viridifaciens [TaxId: 1881]} Back     information, alignment and structure
>d1ceoa_ c.1.8.3 (A:) Endoglucanase CelC {Clostridium thermocellum [TaxId: 1515]} Back     information, alignment and structure
>d1ecea_ c.1.8.3 (A:) Endocellulase E1 {Acidothermus cellulolyticus [TaxId: 28049]} Back     information, alignment and structure
>d1nq6a_ c.1.8.3 (A:) Xylanase A, catalytic core {Streptomyces halstedii [TaxId: 1944]} Back     information, alignment and structure
>d1fh9a_ c.1.8.3 (A:) Xylanase A, catalytic core {Cellulomonas fimi [TaxId: 1708]} Back     information, alignment and structure