Citrus Sinensis ID: 009770


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520------
MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEVICVESFWLLLWKTCVYYYPIMTADDFWN
ccccccccccccccccccccccccccccccccccccccccccccccccccccEEEEcccccccccccccccccccccHHHHccccEEEEEEEccccccccccccccEEEEEccccccccccEEEcccEEEEEEEcccccccccccccccccccccccccccEEEEEEEcccccccccEEEEEEEEEcccccccccccccHHHHHHHHHHHcccccccccccccHHHHHHHHHHHccccEEEEEccccccccccccccEEEcHHHcccccEEEEEEEEEEEEcccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccEEcccccccccccHHHHHHHHHHHHHHHcccHHHHHHcccccccccccHHHHHHHHHHHHHHcccccEEEEEEEcccccccccccEEEEEEcccccccccEEEEEEEccccccHHHHHHHHHHccccccEEEEEccEEEEEEcccccccEEEccccccc
ccccccccccEcccEEEEccccccccccccccccccccccccccccccccccEEEEccccccccccccccccccccEEHHHccccccEEEEEEccEEEcccccccEEEEEEEccccccccccccccEEEEEEEEcccccccccccccccccEEEEcccccEEEEEEEEcccccccccEEEEEEEEEEcccccccHHccHHHHHHHHHHHHHHHHHccccccccHHHHHHHHHHHHHHHHHHHHcHHHHHHHccccccccccHHHHHccEEEEEEEEEEEEEEccccccccEEEEEccccHHHccccccccHHHHHHHHHHHHHHHHEcccccEcccccccEEEEccccccccccccHHHcccHHHHHEEcccccccccccHHHHHHHHHHHHHHcccccccEEEEEEcccccHHHHHHHHHHHHHHHHHcccccEEEEEEcccccccccccccEEEEEcccccccccEEEEccHHHcccHHHHHHHHHHHHccccccEEEEcHHHHHHHHHccccccEEEcccccc
mdnsgnpqdsvvppvegvagggtaygwndncsqssgplngsinpteiptadlvhvwcmpstanvgpqemprplepiNLLAARNERESVQIAlrpkvswsssstagVVQVQCSDlcsasgdrlVVGQSLMLRRVvpmlgvpdalvpldlpvcqislipgettAVWVsidapyaqppglyegeiiitskadtelssqclgkgekHRLFMELRNCldnvepiegkplhEVVERAKSTATTLRRVIFsplfseffsdngpidmmdedaiSNLSVRVKLSLTVWdfilpatpslpavigisdtviedrfgvrhgsdEWYEALDQHFKWLLQYRIspffcrwgesmrvltytcpwpadhpksdeyfsdprlaayavpyspvlssndgakdYVRKEIELLRTKAHWKKAYFylwdeplnmehysSVRNMASElhayapdarvlttyycgpsdaplgptpfesfvkvpkflrphtqiyctsewvlgnREDLVKDIVTELQPENGEVICVESFWLLLWKTCVYyypimtaddfwn
mdnsgnpqdsvvppVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTanvgpqemprPLEPINLLAARNERESVQIALrpkvswsssstAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNvepiegkplhevVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVpyspvlssndgaKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEVICVESFWLLLWKTCVYYYPIMTADDFWN
MDNSGNPQDSvvppvegvagggTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEVICVESFWLLLWKTCVYYYPIMTADDFWN
*******************GGGTAYGWND***************TEIPTADLVHVWCMPST****************************IALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEVICVESFWLLLWKTCVYYYPIMTADDFW*
*************PVEGVAGGGTAYGWNDNCSQS**********TEIPTADLVHVWCMP*******************LAARNERESVQIALRPKVSWS*SSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTEL*******GEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEVICVESFWLLLWKTCVYYYPIMTADDFWN
**********VVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRP*************QVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEVICVESFWLLLWKTCVYYYPIMTADDFWN
********DSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEVICVESFWLLLWKTCVYYYPIMTADDFWN
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiHHHHHHHHHHHHHHHHHHHHoooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhoooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGEVICVESFWLLLWKTCVYYYPIMTADDFWN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

No hits with e-value below 0.001 by BLAST

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query526
356518975 649 PREDICTED: uncharacterized protein LOC10 0.942 0.764 0.787 0.0
449460114 649 PREDICTED: uncharacterized protein LOC10 0.942 0.764 0.777 0.0
356509690 644 PREDICTED: uncharacterized protein LOC10 0.933 0.762 0.765 0.0
224132110 652 predicted protein [Populus trichocarpa] 0.939 0.757 0.785 0.0
255538584 651 conserved hypothetical protein [Ricinus 0.939 0.758 0.776 0.0
225458333 638 PREDICTED: uncharacterized protein LOC10 0.920 0.758 0.764 0.0
297852256 643 hypothetical protein ARALYDRAFT_473837 [ 0.931 0.762 0.738 0.0
42562571 643 uncharacterized protein [Arabidopsis tha 0.931 0.762 0.720 0.0
7767671 687 F27F5.22 [Arabidopsis thaliana] 0.910 0.697 0.660 0.0
222637345 709 hypothetical protein OsJ_24889 [Oryza sa 0.931 0.691 0.661 0.0
>gi|356518975|ref|XP_003528150.1| PREDICTED: uncharacterized protein LOC100782659 [Glycine max] Back     alignment and taxonomy information
 Score =  830 bits (2143), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 392/498 (78%), Positives = 441/498 (88%), Gaps = 2/498 (0%)

Query: 1   MDNSGNPQDSVVPPVEGVAGGGTAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPS 60
           MDNSGNPQD VVPPVEGVAGGGTAYGWND  +     + G I+PT IPT DLVHVWCMPS
Sbjct: 1   MDNSGNPQDVVVPPVEGVAGGGTAYGWNDGGTHGLN-VKGPIDPTGIPTRDLVHVWCMPS 59

Query: 61  TANVGPQEMPRPLEPINLLAARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGD 120
           TANVGPQ+MPR LEPINLLAARNERESVQIA+RPKVSWS SS AG VQ+QCSDLCS SGD
Sbjct: 60  TANVGPQDMPRHLEPINLLAARNERESVQIAIRPKVSWSGSSVAGTVQIQCSDLCSTSGD 119

Query: 121 RLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEG 180
           RL+VGQSL+LRRVVP+LGVPDALVP+DLPV QI+L PGETTA+W+SID P +QPPG YEG
Sbjct: 120 RLIVGQSLLLRRVVPILGVPDALVPVDLPVSQINLFPGETTALWISIDVPSSQPPGQYEG 179

Query: 181 EIIITS-KADTELSSQCLGKGEKHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLR 239
           EI IT+ KAD E   Q L K EKH+L+ +L+ CLD VEPI+GKPL EVVER KS  T+LR
Sbjct: 180 EIAITAIKADAESPVQILSKVEKHQLYRDLKGCLDIVEPIDGKPLDEVVERVKSATTSLR 239

Query: 240 RVIFSPLFSEFFSDNGPIDMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTV 299
           R++ SP FSEFFSDNGP+D+MDEDAIS+LS+R+KL+LTVW+F+LP TPSLPAV GISDTV
Sbjct: 240 RILLSPSFSEFFSDNGPVDVMDEDAISSLSIRMKLNLTVWEFVLPETPSLPAVFGISDTV 299

Query: 300 IEDRFGVRHGSDEWYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEY 359
           IEDRFGV+ G+ EWYEALDQHFKWLLQYRISP+FC+W + MRVLTYT PWPADHPKSDEY
Sbjct: 300 IEDRFGVQQGTAEWYEALDQHFKWLLQYRISPYFCKWADGMRVLTYTSPWPADHPKSDEY 359

Query: 360 FSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSV 419
           FSDPRLAAYAVPY  V+S ND AKDY++K++E+LRTK HW+KAYFYLWDEPLN+E Y SV
Sbjct: 360 FSDPRLAAYAVPYKQVVSGNDAAKDYLQKQVEILRTKTHWRKAYFYLWDEPLNLEQYDSV 419

Query: 420 RNMASELHAYAPDARVLTTYYCGPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGN 479
           RNMASE+HAYAPDAR+LTTYYCGP+DAPL PTPFE+FVKVP FLRPH QIYCTSEWVLGN
Sbjct: 420 RNMASEIHAYAPDARILTTYYCGPNDAPLAPTPFEAFVKVPSFLRPHNQIYCTSEWVLGN 479

Query: 480 REDLVKDIVTELQPENGE 497
           REDLVKDI+TELQPENGE
Sbjct: 480 REDLVKDIITELQPENGE 497




Source: Glycine max

Species: Glycine max

Genus: Glycine

Family: Fabaceae

Order: Fabales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449460114|ref|XP_004147791.1| PREDICTED: uncharacterized protein LOC101205217 [Cucumis sativus] gi|449476778|ref|XP_004154831.1| PREDICTED: uncharacterized LOC101205217 [Cucumis sativus] Back     alignment and taxonomy information
>gi|356509690|ref|XP_003523579.1| PREDICTED: uncharacterized protein LOC100799554 [Glycine max] Back     alignment and taxonomy information
>gi|224132110|ref|XP_002321258.1| predicted protein [Populus trichocarpa] gi|222862031|gb|EEE99573.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|255538584|ref|XP_002510357.1| conserved hypothetical protein [Ricinus communis] gi|223551058|gb|EEF52544.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
>gi|225458333|ref|XP_002283035.1| PREDICTED: uncharacterized protein LOC100243809 [Vitis vinifera] gi|302142468|emb|CBI19671.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|297852256|ref|XP_002894009.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp. lyrata] gi|297339851|gb|EFH70268.1| hypothetical protein ARALYDRAFT_473837 [Arabidopsis lyrata subsp. lyrata] Back     alignment and taxonomy information
>gi|42562571|ref|NP_175129.3| uncharacterized protein [Arabidopsis thaliana] gi|30725314|gb|AAP37679.1| At1g45150 [Arabidopsis thaliana] gi|110742869|dbj|BAE99332.1| hypothetical protein [Arabidopsis thaliana] gi|332193963|gb|AEE32084.1| uncharacterized protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|7767671|gb|AAF69168.1|AC007915_20 F27F5.22 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|222637345|gb|EEE67477.1| hypothetical protein OsJ_24889 [Oryza sativa Japonica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query526
TAIR|locus:2028275 643 AT1G45150 "AT1G45150" [Arabido 0.891 0.729 0.731 6.3e-195
TAIR|locus:2028275 AT1G45150 "AT1G45150" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1888 (669.7 bits), Expect = 6.3e-195, P = 6.3e-195
 Identities = 348/476 (73%), Positives = 404/476 (84%)

Query:    23 TAYGWNDNCSQSSGPLNGSINPTEIPTADLVHVWCMPSTANVGPQEMPRPLEPINLLAAR 82
             TAYG+ND     + PL  S +P+E+PTADLV+VWCMP+T NVG QE PR LEPINLLAAR
Sbjct:    21 TAYGFND-----AEPLKQSTDPSEVPTADLVNVWCMPNTVNVGSQETPRALEPINLLAAR 75

Query:    83 NERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDA 142
             NERES QIA+RPKVSW++SS +G+VQVQCSDLCS++GDRLVVGQSL LRRVVP+LGVPDA
Sbjct:    76 NERESFQIAMRPKVSWAASSPSGIVQVQCSDLCSSAGDRLVVGQSLKLRRVVPVLGVPDA 135

Query:   143 LVPLDLPVCQISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS-KADTELSSQCLGKGE 201
             LVPLDLPV Q+SL PGET+ +WVSID P  QPPG YEGEIII++ K D   SS  L K E
Sbjct:   136 LVPLDLPVSQLSLFPGETSVIWVSIDVPTGQPPGQYEGEIIISAMKTDGGGSSH-LAKHE 194

Query:   202 KHRLFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEFFSDNGPIDMMD 261
             K +L +EL  CLD +EPIEGKP+ EVVER K  +++LRR++FSP FSEF S NG  DMM+
Sbjct:   195 KDQLCVELNTCLDIMEPIEGKPMDEVVERIKCASSSLRRILFSPSFSEFISTNGSTDMME 254

Query:   262 EDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDEWYEALDQHF 321
             ED +SNLS+R+KL LTVW+FI+P TPSLPAVIG+SDTVIEDRF V HGS++WY+ LD HF
Sbjct:   255 EDVVSNLSLRIKLRLTVWEFIIPVTPSLPAVIGVSDTVIEDRFAVEHGSEDWYKKLDLHF 314

Query:   322 KWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDG 381
             KWLLQYRISP+FC+WGESMRVLTYT PWPADHPKSDEY SD RLAAYAVPY  V++ +D 
Sbjct:   315 KWLLQYRISPYFCKWGESMRVLTYTSPWPADHPKSDEYLSDSRLAAYAVPYRQVIAGDDS 374

Query:   382 AKDYVRKEIELLRTKAHWKKAYFYLWDEPLNMEHYSSVRNMASELHAYAPDARVLTTYYC 441
              + Y+RKE+E+LR+K HW KAYFYLWDEPLNMEH+ +VR MASE++AYAPD+RVLTTYYC
Sbjct:   375 RESYLRKEVEILRSKPHWNKAYFYLWDEPLNMEHFDNVRKMASEIYAYAPDSRVLTTYYC 434

Query:   442 GPSDAPLGPTPFESFVKVPKFLRPHTQIYCTSEWVLGNREDLVKDIVTELQPENGE 497
             GP DAPL PTPFESFVKVP  LRP+TQIYCTSEWVLGNREDLVKDI+ ELQ ENGE
Sbjct:   435 GPGDAPLAPTPFESFVKVPNLLRPYTQIYCTSEWVLGNREDLVKDILDELQTENGE 490


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.135   0.430    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      526       514   0.00088  119 3  11 22  0.40    34
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  1
  No. of states in DFA:  627 (67 KB)
  Total size of DFA:  357 KB (2176 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  42.56u 0.21s 42.77t   Elapsed:  00:00:02
  Total cpu time:  42.56u 0.21s 42.77t   Elapsed:  00:00:02
  Start:  Sat May 11 02:36:35 2013   End:  Sat May 11 02:36:37 2013


GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
GO:0045892 "negative regulation of transcription, DNA-dependent" evidence=RCA

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 526
PF1063378 NPCBM_assoc: NPCBM-associated, NEW3 domain of alph 95.04
PF15418132 DUF4625: Domain of unknown function (DUF4625) 92.4
COG1470513 Predicted membrane protein [Function unknown] 89.7
PF01229 486 Glyco_hydro_39: Glycosyl hydrolases family 39; Int 87.74
PF00150281 Cellulase: Cellulase (glycosyl hydrolase family 5) 82.46
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known Back     alignment and domain information
Probab=95.04  E-value=0.14  Score=42.12  Aligned_cols=75  Identities=23%  Similarity=0.344  Sum_probs=43.0

Q ss_pred             ecCceEEEEEEEccCcccCCCCCCcceEEEEcccCCCCCCcccccCceEEEEeeecCCCCcccccCCCCcceeeecCCCe
Q 009770           81 ARNERESVQIALRPKVSWSSSSTAGVVQVQCSDLCSASGDRLVVGQSLMLRRVVPMLGVPDALVPLDLPVCQISLIPGET  160 (526)
Q Consensus        81 aRGErvSFQlVLrs~~~~~s~~~l~~V~V~vSDL~S~sG~~~i~g~~Itlr~V~yVLGyPD~LVP~d~p~~~v~V~agqt  160 (526)
                      -.||...+.+-+...    ....+.+++++++   .++|                   |-   +..+ +.....|++|++
T Consensus         2 ~~G~~~~~~~tv~N~----g~~~~~~v~~~l~---~P~G-------------------W~---~~~~-~~~~~~l~pG~s   51 (78)
T PF10633_consen    2 TPGETVTVTLTVTNT----GTAPLTNVSLSLS---LPEG-------------------WT---VSAS-PASVPSLPPGES   51 (78)
T ss_dssp             -TTEEEEEEEEEE------SSS-BSS-EEEEE-----TT-------------------SE------E-EEEE--B-TTSE
T ss_pred             CCCCEEEEEEEEEEC----CCCceeeEEEEEe---CCCC-------------------cc---ccCC-ccccccCCCCCE
Confidence            358888888888753    1234456666554   2333                   11   0011 112237899999


Q ss_pred             eEEEEEEEcCCCCCCceeEEEEEEE
Q 009770          161 TAVWVSIDAPYAQPPGLYEGEIIIT  185 (526)
Q Consensus       161 QpLWIdV~VP~dA~PG~Y~GtVtVt  185 (526)
                      +.+=++|.+|++++||.|..+++++
T Consensus        52 ~~~~~~V~vp~~a~~G~y~v~~~a~   76 (78)
T PF10633_consen   52 VTVTFTVTVPADAAPGTYTVTVTAR   76 (78)
T ss_dssp             EEEEEEEEE-TT--SEEEEEEEEEE
T ss_pred             EEEEEEEECCCCCCCceEEEEEEEE
Confidence            9999999999999999999999886



It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.

>PF15418 DUF4625: Domain of unknown function (DUF4625) Back     alignment and domain information
>COG1470 Predicted membrane protein [Function unknown] Back     alignment and domain information
>PF01229 Glyco_hydro_39: Glycosyl hydrolases family 39; InterPro: IPR000514 O-Glycosyl hydrolases 3 Back     alignment and domain information
>PF00150 Cellulase: Cellulase (glycosyl hydrolase family 5); InterPro: IPR001547 O-Glycosyl hydrolases 3 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query526
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 7e-08
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 4e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 54.5 bits (130), Expect = 7e-08
 Identities = 77/493 (15%), Positives = 137/493 (27%), Gaps = 150/493 (30%)

Query: 79  LAA---RNERESVQIALRPKVSW---SSSSTAGVVQVQCSDLCSASGDRLVVGQSLM--- 129
           +A     + +  VQ  +  K+ W    + ++   V      L                  
Sbjct: 166 VALDVCLSYK--VQCKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNI 223

Query: 130 ----------LRRVVPMLGVPDALVPLDLPV------------CQISLIPGETT---AVW 164
                     LRR++      + L+ L   V            C+I L+   TT    V 
Sbjct: 224 KLRIHSIQAELRRLLKSKPYENCLLVLL-NVQNAKAWNAFNLSCKI-LL---TTRFKQVT 278

Query: 165 VSIDAPYAQPPGLYEGEIIITSKADTELSSQCLGK--------------------GEKHR 204
             + A       L    + +T      L  + L                       E  R
Sbjct: 279 DFLSAATTTHISLDHHSMTLTPDEVKSLLLKYLDCRPQDLPREVLTTNPRRLSIIAESIR 338

Query: 205 LFMELRNCLDNVEPIEGKPLHEVVERAKSTATTLRRVIFSPLFSEF--FSDNGPI----- 257
              +     DN + +    L  ++E   S+   L    +  +F     F  +  I     
Sbjct: 339 ---DGLATWDNWKHVNCDKLTTIIE---SSLNVLEPAEYRKMFDRLSVFPPSAHIPTILL 392

Query: 258 -----DMMDEDAISNLSVRVKLSLTVWDFILPATPSLPAVIGISDTVIEDRFGVRHGSDE 312
                D++  D +  ++   K SL              + I I    +E +       +E
Sbjct: 393 SLIWFDVIKSDVMVVVNKLHKYSLVEKQ-------PKESTISIPSIYLELK---VKLENE 442

Query: 313 WYEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPY 372
              AL  H   +  Y I   F                    P  D+YF           Y
Sbjct: 443 --YAL--HRSIVDHYNIPKTFDSDDLIP-------------PYLDQYF-----------Y 474

Query: 373 SPVLSSNDGAKDYVRKEIELLRTKAHWKKAYF-YLWDEPLNMEHYSSVRNMASELHAYAP 431
           S +     G   +  K IE       ++  +  + + E   + H S+  N +  +     
Sbjct: 475 SHI-----G---HHLKNIEHPERMTLFRMVFLDFRFLEQ-KIRHDSTAWNASGSI---LN 522

Query: 432 DARVLTTY--YCGPSDAPLGPTPFESFVK-VPKFL-RPHTQIYCTSEWVLGNREDLVKDI 487
             + L  Y  Y   +D    P  +E  V  +  FL +    + C+             D+
Sbjct: 523 TLQQLKFYKPYICDND----PK-YERLVNAILDFLPKIEENLICSKY----------TDL 567

Query: 488 V-TELQPENGEVI 499
           +   L  E+  + 
Sbjct: 568 LRIALMAEDEAIF 580


>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query526
4ekj_A 500 Beta-xylosidase; TIM-barrel fold, hemicellulase, h 93.1
3qr3_A340 Endoglucanase EG-II; TIM barrel, hydrolase; 2.05A 85.73
>4ekj_A Beta-xylosidase; TIM-barrel fold, hemicellulase, hydrolase; 2.50A {Caulobacter vibrioides} Back     alignment and structure
Probab=93.10  E-value=0.17  Score=51.94  Aligned_cols=112  Identities=22%  Similarity=0.293  Sum_probs=70.9

Q ss_pred             HHHHHHHHHHHHhCCccCCCcCCCCceeEEeecCCCCCCCCcccccccccccceeeecCCCCCCChHHHHHHHHHHHHHH
Q 009770          314 YEALDQHFKWLLQYRISPFFCRWGESMRVLTYTCPWPADHPKSDEYFSDPRLAAYAVPYSPVLSSNDGAKDYVRKEIELL  393 (526)
Q Consensus       314 ~~aL~~~~~~ll~~risp~f~~Wg~~mrv~~y~~pWp~dh~~sd~y~sd~~l~aY~vP~~~~~~g~~a~~~~lk~~~ehL  393 (526)
                      |..+|+.++++++++|.|++.        +.+.-+|....+....|..        -+..  ....+.+.++++++++|+
T Consensus        83 ~~~~D~~~d~~~~~G~~p~~~--------l~~~P~~~~~~~~~~~~~~--------~~~~--~~~~~~w~~~~~~~~~~~  144 (500)
T 4ekj_A           83 WTKIDQLYDALLAKGIKPFIE--------LGFTPEAMKTSDQTIFYWK--------GNTS--HPKLGPWRDLIDAFVHHL  144 (500)
T ss_dssp             CHHHHHHHHHHHHTTCEEEEE--------ECCBCGGGCSSCCEETTTT--------EECS--CCCHHHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHCCCEEEEE--------EeCCchhhcCCCCcccccc--------CCCC--cccHHHHHHHHHHHHHHH
Confidence            457899999999999998833        2333333322221110000        0111  124567889999999999


Q ss_pred             Hhc-C--chhHHhhhhcCCCCC--------cccH-HHHHHHHHHHHHhCCCCcEEEEEeeCC
Q 009770          394 RTK-A--HWKKAYFYLWDEPLN--------MEHY-SSVRNMASELHAYAPDARVLTTYYCGP  443 (526)
Q Consensus       394 r~K-g--w~~kay~Yl~DEP~~--------~e~y-~~~r~~a~~ir~~aPd~riLtT~~~gp  443 (526)
                      ..+ |  .-+..|+=+|.||..        ++.| +.++.+++.||+++|+++|.....|++
T Consensus       145 ~~RYg~~~v~~w~~EvwNEp~~~~~~~~~~~~~y~~l~~~~~~aik~~~P~~~Vgg~~~~~~  206 (500)
T 4ekj_A          145 RARYGVEEVRTWFFEVWNEPNLDGFWEKADQAAYFELYDVTARAIKAIDPSLRVGGPATAGA  206 (500)
T ss_dssp             HHHHCHHHHHTSEEEESSCTTSTTTSGGGCHHHHHHHHHHHHHHHHHHCTTSEEEEEEEGGG
T ss_pred             HHhhCccccceeEEEEEECCCCccCCCCCCHHHHHHHHHHHHHHHHhhCCccccccCccCCc
Confidence            764 2  234445558999853        3444 356677899999999999987666554



>3qr3_A Endoglucanase EG-II; TIM barrel, hydrolase; 2.05A {Hypocrea jecorina} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query526
d1w8oa1103 Sialidase, "linker" domain {Micromonospora viridif 94.22
d1uhva2346 Beta-D-xylosidase, catalytic domain {Thermoanaerob 92.83
d1ceoa_340 Endoglucanase CelC {Clostridium thermocellum [TaxI 87.03
d1vjza_325 Endoglucanase homologue TM1752 {Thermotoga maritim 82.79
>d1w8oa1 b.1.18.2 (A:403-505) Sialidase, "linker" domain {Micromonospora viridifaciens [TaxId: 1881]} Back     information, alignment and structure
class: All beta proteins
fold: Immunoglobulin-like beta-sandwich
superfamily: E set domains
family: E-set domains of sugar-utilizing enzymes
domain: Sialidase, "linker" domain
species: Micromonospora viridifaciens [TaxId: 1881]
Probab=94.22  E-value=0.065  Score=43.94  Aligned_cols=35  Identities=26%  Similarity=0.433  Sum_probs=32.7

Q ss_pred             eeeecCCCeeEEEEEEEcCCCCCCceeEEEEEEEe
Q 009770          152 QISLIPGETTAVWVSIDAPYAQPPGLYEGEIIITS  186 (526)
Q Consensus       152 ~v~V~agqtQpLWIdV~VP~dA~PG~Y~GtVtVt~  186 (526)
                      ...|.||+.+.+=++|.+|.++.||.|..+++++.
T Consensus        52 ~~~L~pG~s~~~~~~Vt~p~~a~~G~Y~i~~~a~~   86 (103)
T d1w8oa1          52 VEPLMPGRQAKGQVTITVPAGTTPGRYRVGATLRT   86 (103)
T ss_dssp             ECCBCTTCEEEEEEEEECCTTCCCEEEEEEEEEEE
T ss_pred             ceeeCCCCcEEEEEEEECCCCCCCceEEEEEEEEe
Confidence            45689999999999999999999999999999987



>d1uhva2 c.1.8.3 (A:14-359) Beta-D-xylosidase, catalytic domain {Thermoanaerobacterium saccharolyticum [TaxId: 28896]} Back     information, alignment and structure
>d1ceoa_ c.1.8.3 (A:) Endoglucanase CelC {Clostridium thermocellum [TaxId: 1515]} Back     information, alignment and structure
>d1vjza_ c.1.8.3 (A:) Endoglucanase homologue TM1752 {Thermotoga maritima [TaxId: 2336]} Back     information, alignment and structure