Citrus Sinensis ID: 013500


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440--
MSSYLSGGGRTYALDLEIIKSPSHSTGTSHTSSSPLSTLSEASNSPLAISTRKARTPRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFNNSSGTGNSNRKGSSEKRKKKVGKEKPVAESKVSELPAKEESPIPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPNSNAQDERGKKKVVMLSDL
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHccccHHHHHHcHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHcccccccccHHHHHHcccccccccccccccccccccccccccccccccc
ccccccccccEccccHHHEccccccccccccccccccccccccccccEEcccccccccccccHHHHHHHHHHHHccccccccccccccccccccccccccccHccccHHHcccccccccccccccccHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHcccHHcccEEEEcccccccccccccccccccccccEEEccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHcccHHHccccccHEEEEccccccccccccccccccEEEEEcHHHHHHHHcccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccEEEEcccccHHHHccccHEEEcccc
mssylsgggrtyaLDLEiikspshstgtshtsssplstlseasnsplaistrkartprkrpnqtYNEAAALLStacpnifstkhltnlnkfttkppdhrrhhqqqepnqqffdesselllpfrviddsgfllhqpildkprfplepkvtkfcenklscqnsgevdfqganhnpmevqissefdedfdaesiildEEFEEGIDSIMgnlsvkkdaiesddghsqvhnsncwygnpmgigfggklelgfgmrSGMKALRQVdegnwwnfpidvlqispkfnnssgtgnsnrkgssekrkkkvgkekpvaeskvselpakeespipkansgellLKLQYDDVlnawsgkgspfldeslapdgqgndvSARLAQIDlfsdgglreaSVLRYKEKRRTRLFSKKIRYQVRKVnadrrprmkgrfvrrpnsnaqdergkkKVVMLSDL
mssylsggGRTYALDLEIIKSPshstgtshtsssplstlseasnsplaistrkartprkrpnQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQIspkfnnssgtgnsnrkgssekrkkkvgkekpvaeskvselpakeespipkansgeLLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIdlfsdgglreasvlrykekrrtrlfskkiryqvrkvnadrrprmkgrfvrrpnsnaqdergkkkvvmlsdl
MSSYLSGGGRTYALDLEIIKspshstgtshtsssplstlsEASNSPLAISTRKARTPRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFNNSSGTGNSNrkgssekrkkkvgkekPVAESKVSELPAKEESPIPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPNSNAQDERGKKKVVMLSDL
************AL******************************************************AALLSTACPNIFSTKHLTNL****************************ELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSC*****************************AESIILDEEFEEGIDSIMGNL***************VHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQIS******************************************************LLLKLQYDDVLNAWSGK********************RLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQV**************************************
******GGGRTYALDL*****************************************************ALLSTACP************************************ESSELLLPFRVIDDSGFLLHQPILDK***************************************SSEFDEDFDAESIILDEEFEEGIDSI******************************************************VDEGNWWNFPIDVLQISPK****************************************************************************************************************K*R*RLFSKKIRYQVRKVNADRRPRMKGRF***********************
********GRTYALDLEIIKSP****************************************QTYNEAAALLSTACPNIFSTKHLTNLNKFTT**************NQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAI********VHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFNN*****************************************IPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFV**********************
**********TYALD*EIIK*************************************RKRPNQTYNEAAALLSTACPNIFSTKH****************************DESSELLLPFRVIDDSGFLLHQP****************************************VQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFNN*************************VAESKVS*************NSGELLLKLQYDDVLNAWSGKGSPFLDE*LA***********L**********LREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPN******************
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSSYLSGGGRTYALDLEIIKSPSHSTGTSHTSSSPLSTLSEASNSPLAISTRKARTPRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFNNSSGTGNSNRKGSSEKRKKKVGKEKPVAESKVSELPAKEESPIPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPNSNAQDERGKKKVVMLSDL
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query442 2.2.26 [Sep-21-2011]
Q9LU68435 Protein CHLOROPLAST IMPOR no no 0.868 0.882 0.415 2e-64
Q8RWD0417 Zinc finger protein CONST no no 0.203 0.215 0.473 3e-18
Q8LG76406 Zinc finger protein CONST no no 0.210 0.229 0.459 3e-18
Q9C9A9392 Zinc finger protein CONST no no 0.208 0.234 0.473 9e-17
Q9M9B3319 Zinc finger protein CONST no no 0.187 0.260 0.5 8e-15
Q9FDX8395 Zinc finger protein HD1 O no no 0.158 0.177 0.471 3e-09
Q9FHH8355 Zinc finger protein CONST no no 0.176 0.219 0.418 4e-09
Q9SK53294 Zinc finger protein CONST no no 0.113 0.170 0.58 7e-09
Q940T9362 Zinc finger protein CONST no no 0.115 0.140 0.529 7e-09
Q96502347 Zinc finger protein CONST no no 0.097 0.123 0.627 2e-08
>sp|Q9LU68|CIA2_ARATH Protein CHLOROPLAST IMPORT APPARATUS 2 OS=Arabidopsis thaliana GN=CIA2 PE=2 SV=1 Back     alignment and function desciption
 Score =  246 bits (629), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 194/467 (41%), Positives = 252/467 (53%), Gaps = 83/467 (17%)

Query: 1   MSSYLS---GGGRTYALDLEIIKSPSHSTGTSHTSSSPLSTLSEASNS-PLAISTRKART 56
           MS+ LS   GG   Y+ +LE +KSP  S+ T+ T ++  S+    S++ PLAISTRK RT
Sbjct: 1   MSACLSSGGGGAAAYSFELEKVKSPPPSSSTTTTRATSPSSTISESSNSPLAISTRKPRT 60

Query: 57  PRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFF---- 112
            RKRPNQTYNEAA LLSTA PNIFS+    NL+          +       N  F+    
Sbjct: 61  QRKRPNQTYNEAATLLSTAYPNIFSS----NLSS---------KQKTHSSSNSHFYGPLL 107

Query: 113 ---DESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGA 169
              D++S+LLLP+  I++  FL H  I  K  F        F + K    NSG   + G 
Sbjct: 108 SDNDDASDLLLPYESIEEPDFLFHPTIQTKTEF--------FSDQKEV--NSGGDCYGG- 156

Query: 170 NHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNL----------SVKKDAIESDD 219
                E++   +F ++FDAESI LDE+ EEGIDSIMG +            +   + +  
Sbjct: 157 -----EIE-KFDFSDEFDAESI-LDEDIEEGIDSIMGTVVESNSNSGIYESRVPGMINRG 209

Query: 220 GHS---------QVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNF-PI 269
           G S         Q+   N W  +  G  F     LG G+RS   ALR+ D+   W    +
Sbjct: 210 GRSSSNRIGKLEQMMMINSWNRSSNGFNF----PLGLGLRS---ALRENDDTKLWKIHTV 262

Query: 270 DVLQISPKFNN--SSGTGNSNRKGSSEKRKKKVGKEK----------PVAESKVSELPAK 317
           D  QISP+     +    ++  +  S+ +K  +  EK           V  + ++E  + 
Sbjct: 263 DFEQISPRIQTVKTETAISTVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTLITESKSL 322

Query: 318 EESPIPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDG 377
           E++          LLKL YD VL AWS K SPF DE      +  DV+ARLAQIDLF D 
Sbjct: 323 EDTEETSLKRTGPLLKLDYDGVLEAWSDKTSPFPDE--IQGSEAVDVNARLAQIDLFGDS 380

Query: 378 GLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPN 424
           G+REASVLRYKEKRRTRLFSKKIRYQVRK+NAD+RPRMKGRFVRRPN
Sbjct: 381 GMREASVLRYKEKRRTRLFSKKIRYQVRKLNADQRPRMKGRFVRRPN 427




Responsible for specific up-regulation of the translocon genes TOC33 and TOC75 in leaves. Involved in the general chloroplast protein import pathway regulation, including protein import and protein translation efficiencies.
Arabidopsis thaliana (taxid: 3702)
>sp|Q8RWD0|COL16_ARATH Zinc finger protein CONSTANS-LIKE 16 OS=Arabidopsis thaliana GN=COL16 PE=2 SV=2 Back     alignment and function description
>sp|Q8LG76|COL6_ARATH Zinc finger protein CONSTANS-LIKE 6 OS=Arabidopsis thaliana GN=COL6 PE=2 SV=2 Back     alignment and function description
>sp|Q9C9A9|COL7_ARATH Zinc finger protein CONSTANS-LIKE 7 OS=Arabidopsis thaliana GN=COL7 PE=2 SV=1 Back     alignment and function description
>sp|Q9M9B3|COL8_ARATH Zinc finger protein CONSTANS-LIKE 8 OS=Arabidopsis thaliana GN=COL8 PE=2 SV=2 Back     alignment and function description
>sp|Q9FDX8|HD1_ORYSJ Zinc finger protein HD1 OS=Oryza sativa subsp. japonica GN=HD1 PE=2 SV=1 Back     alignment and function description
>sp|Q9FHH8|COL5_ARATH Zinc finger protein CONSTANS-LIKE 5 OS=Arabidopsis thaliana GN=COL5 PE=2 SV=2 Back     alignment and function description
>sp|Q9SK53|COL3_ARATH Zinc finger protein CONSTANS-LIKE 3 OS=Arabidopsis thaliana GN=COL3 PE=1 SV=1 Back     alignment and function description
>sp|Q940T9|COL4_ARATH Zinc finger protein CONSTANS-LIKE 4 OS=Arabidopsis thaliana GN=COL4 PE=2 SV=2 Back     alignment and function description
>sp|Q96502|COL2_ARATH Zinc finger protein CONSTANS-LIKE 2 OS=Arabidopsis thaliana GN=COL2 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query442
224087284448 predicted protein [Populus trichocarpa] 0.936 0.924 0.567 1e-109
147784441392 hypothetical protein VITISV_015092 [Viti 0.877 0.989 0.588 1e-106
225470656392 PREDICTED: protein CHLOROPLAST IMPORT AP 0.877 0.989 0.590 1e-105
449463579430 PREDICTED: protein CHLOROPLAST IMPORT AP 0.893 0.918 0.554 1e-100
296090276392 unnamed protein product [Vitis vinifera] 0.821 0.926 0.570 2e-97
255554380397 CIL, putative [Ricinus communis] gi|2235 0.864 0.962 0.587 7e-90
356549250399 PREDICTED: protein CHLOROPLAST IMPORT AP 0.859 0.952 0.534 4e-85
356562078398 PREDICTED: protein CHLOROPLAST IMPORT AP 0.873 0.969 0.538 2e-82
356549252393 PREDICTED: protein CHLOROPLAST IMPORT AP 0.839 0.944 0.526 2e-80
449532521380 PREDICTED: protein CHLOROPLAST IMPORT AP 0.787 0.915 0.514 7e-76
>gi|224087284|ref|XP_002308112.1| predicted protein [Populus trichocarpa] gi|222854088|gb|EEE91635.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 265/467 (56%), Positives = 317/467 (67%), Gaps = 53/467 (11%)

Query: 2   SSYLSGGGRTYALDLEIIKSPSHSTGTSHTSSSPLSTLSEASNSPLAISTRKARTPRKRP 61
           S  +SGGGRTY  DLEI+KS S S+  +  +SSP ST+SE+SNSPLAISTRK+RTPRKRP
Sbjct: 3   SPCISGGGRTYGFDLEIVKSSSTSSTRTSHTSSPSSTISESSNSPLAISTRKSRTPRKRP 62

Query: 62  NQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLP 121
           NQTYNEAAALLSTA PNIFSTKHLT  +KF TKP D          N    D+SS+LLLP
Sbjct: 63  NQTYNEAAALLSTAYPNIFSTKHLTKSSKF-TKPQD----------NSLLLDQSSDLLLP 111

Query: 122 FRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSE 181
           FRV D+SGFL+HQPI  KP +  E +   F + K SCQ+SGEVDF G +     V++   
Sbjct: 112 FRVFDNSGFLIHQPIQGKPSYGNESRFANFTD-KSSCQSSGEVDFHGNS-----VELCDG 165

Query: 182 FDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAI-ESDDGHSQVHNS--NCWYGNPMGIG 238
            DEDFDAES ILDEEFEEGIDSIMG+ S+  + + E  +G S       N WYG+ MG  
Sbjct: 166 LDEDFDAES-ILDEEFEEGIDSIMGHSSIGNEMVDEVPNGISSSFGGQMNSWYGSSMGYN 224

Query: 239 FGGKLELGFG--MRSGMKALRQVDEGNWWNFPI-DVLQISPKF----------------- 278
           FGGK + G G  MR G++ALR VDEGNWW+FPI D+LQISP+                  
Sbjct: 225 FGGKSQYGHGIAMRRGVRALRHVDEGNWWDFPIVDMLQISPRLTTTVAATANANSNNGPE 284

Query: 279 -----NNSSGTGNSNRKGSSEKRKKKVGKEKPVAESKVSEL----PAKEESPIPKANSGE 329
                          +  SS ++KKK  ++  V E K  EL    P KE S IP+++ G 
Sbjct: 285 CNSIPKPKPKPKLKLKPNSSSEKKKKKVEKPAVMEEKNVELKDENPVKENS-IPQSSQG- 342

Query: 330 LLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKE 389
           L+LKL YD VL+ WS +GSPF DES+    +GNDVSARLAQIDLFS+ G+REASVLRYKE
Sbjct: 343 LILKLNYDHVLSEWSDRGSPFSDESMGC-AEGNDVSARLAQIDLFSENGMREASVLRYKE 401

Query: 390 KRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPNSNAQDERGKKKV 436
           KRRTRLFSKKIRYQVRKVNAD+RPRMKGRFVRRPN  + ++  KKK+
Sbjct: 402 KRRTRLFSKKIRYQVRKVNADQRPRMKGRFVRRPNPRSDEQEKKKKL 448




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|147784441|emb|CAN72725.1| hypothetical protein VITISV_015092 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225470656|ref|XP_002268213.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|449463579|ref|XP_004149511.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|296090276|emb|CBI40095.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|255554380|ref|XP_002518229.1| CIL, putative [Ricinus communis] gi|223542576|gb|EEF44115.1| CIL, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|356549250|ref|XP_003543009.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like isoform 1 [Glycine max] Back     alignment and taxonomy information
>gi|356562078|ref|XP_003549301.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Glycine max] Back     alignment and taxonomy information
>gi|356549252|ref|XP_003543010.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like isoform 2 [Glycine max] Back     alignment and taxonomy information
>gi|449532521|ref|XP_004173229.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query442
TAIR|locus:2175564435 CIA2 "chloroplast import appar 0.918 0.933 0.387 7.3e-59
TAIR|locus:2120850409 CIL "AT4G25990" [Arabidopsis t 0.638 0.689 0.426 4.2e-47
TAIR|locus:2031180417 BBX15 "B-box domain protein 15 0.108 0.115 0.795 1.5e-17
TAIR|locus:2007482195 AT1G07050 "AT1G07050" [Arabido 0.203 0.461 0.516 8.9e-17
TAIR|locus:2201277406 BBX14 "B-box domain protein 14 0.104 0.113 0.808 3.8e-16
TAIR|locus:2145698339 AT5G14370 [Arabidopsis thalian 0.217 0.283 0.459 1.1e-15
TAIR|locus:2031581392 BBX16 "B-box domain protein 16 0.106 0.119 0.812 2.9e-15
TAIR|locus:2028421326 BBX17 "B-box domain protein 17 0.187 0.254 0.5 2.1e-13
TAIR|locus:2062749183 AT2G46670 "AT2G46670" [Arabido 0.253 0.612 0.319 3.3e-09
UNIPROTKB|Q9FDX8395 HD1 "Zinc finger protein HD1" 0.167 0.187 0.473 8.4e-08
TAIR|locus:2175564 CIA2 "chloroplast import apparatus 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 604 (217.7 bits), Expect = 7.3e-59, P = 7.3e-59
 Identities = 174/449 (38%), Positives = 228/449 (50%)

Query:     1 MSSYLSGGG---RTYALDLEIIKXXXXXXXXXXXXXXX-XXXXXEASNSPLAISTRKART 56
             MS+ LS GG     Y+ +LE +K                     E+SNSPLAISTRK RT
Sbjct:     1 MSACLSSGGGGAAAYSFELEKVKSPPPSSSTTTTRATSPSSTISESSNSPLAISTRKPRT 60

Query:    57 PRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQ-QEPNQQFFDES 115
              RKRPNQTYNEAA LLSTA PNIFS+    NL+   +K   H   +     P     D++
Sbjct:    61 QRKRPNQTYNEAATLLSTAYPNIFSS----NLS---SKQKTHSSSNSHFYGPLLSDNDDA 113

Query:   116 SELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVD-FQGANHNPM 174
             S+LLLP+  I++  FL H  I  K  F  + K      +   C   GE++ F  ++    
Sbjct:   114 SDLLLPYESIEEPDFLFHPTIQTKTEFFSDQKEVN---SGGDCYG-GEIEKFDFSDEFDA 169

Query:   175 EVQISSEFDEDFDA-ESIILDEEFEEGI-DS-IMGNLSVKKDAIESDDGH-SQVHNSNCW 230
             E  +  + +E  D+    +++     GI +S + G ++    +  +  G   Q+   N W
Sbjct:   170 ESILDEDIEEGIDSIMGTVVESNSNSGIYESRVPGMINRGGRSSSNRIGKLEQMMMINSW 229

Query:   231 YGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNF-PIDVLQISPKF----------- 278
               +  G  F     LG G+RS   ALR+ D+   W    +D  QISP+            
Sbjct:   230 NRSSNGFNF----PLGLGLRS---ALRENDDTKLWKIHTVDFEQISPRIQTVKTETAIST 282

Query:   279 -NNSSGTGNSNXXXXXXXXXXXXXXXXPVAESKVSELPAKEESPIPKANSGELLLKLQYD 337
              +     G                    V  + ++E  + E++          LLKL YD
Sbjct:   283 VDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTLITESKSLEDTEETSLKRTGPLLKLDYD 342

Query:   338 DVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFS 397
              VL AWS K SPF DE      +  DV+ARLAQIDLF D G+REASVLRYKEKRRTRLFS
Sbjct:   343 GVLEAWSDKTSPFPDE--IQGSEAVDVNARLAQIDLFGDSGMREASVLRYKEKRRTRLFS 400

Query:   398 KKIRYQVRKVNADRRPRMKGRFVRRPNSN 426
             KKIRYQVRK+NAD+RPRMKGRFVRRPN +
Sbjct:   401 KKIRYQVRKLNADQRPRMKGRFVRRPNES 429




GO:0005634 "nucleus" evidence=ISM;IDA
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEP
GO:0045036 "protein targeting to chloroplast" evidence=RCA;IMP
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=RCA
TAIR|locus:2120850 CIL "AT4G25990" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031180 BBX15 "B-box domain protein 15" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2007482 AT1G07050 "AT1G07050" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2201277 BBX14 "B-box domain protein 14" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2145698 AT5G14370 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031581 BBX16 "B-box domain protein 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2028421 BBX17 "B-box domain protein 17" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2062749 AT2G46670 "AT2G46670" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q9FDX8 HD1 "Zinc finger protein HD1" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00060622
hypothetical protein (449 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query442
pfam0620345 pfam06203, CCT, CCT motif 1e-21
>gnl|CDD|203407 pfam06203, CCT, CCT motif Back     alignment and domain information
 Score = 87.3 bits (217), Expect = 1e-21
 Identities = 29/44 (65%), Positives = 36/44 (81%)

Query: 380 REASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRP 423
           REA++LRYKEKR+TR F KKIRY  RK  A+ RPR+KGRFV++ 
Sbjct: 1   REAALLRYKEKRKTRKFDKKIRYASRKAVAESRPRVKGRFVKQS 44


This short motif is found in a number of plant proteins. It is rich in basic amino acids and has been called a CCT motif after Co, Col and Toc1. The CCT motif is about 45 amino acids long and contains a putative nuclear localisation signal within the second half of the CCT motif. Toc1 mutants have been identified in this region. Length = 45

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 442
PF0620345 CCT: CCT motif; InterPro: IPR010402 The CCT (CONST 99.74
KOG1601340 consensus GATA-4/5/6 transcription factors [Transc 96.43
PF0942527 CCT_2: Divergent CCT motif; InterPro: IPR018467 Th 93.08
>PF06203 CCT: CCT motif; InterPro: IPR010402 The CCT (CONSTANS, CO-like, and TOC1) domain is a highly conserved basic module of ~43 amino acids, which is found near the C terminus of plant proteins often involved in light signal transduction Back     alignment and domain information
Probab=99.74  E-value=5.5e-19  Score=132.22  Aligned_cols=45  Identities=58%  Similarity=1.007  Sum_probs=43.2

Q ss_pred             HHHHHHHHHHHhhhccCCCcccchhhhcccccCCCcCcccccCCC
Q 013500          380 REASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKGRFVRRPN  424 (442)
Q Consensus       380 Rea~v~RYreKRk~R~f~KkIRY~~RK~~Ae~RPRvKGRFVk~~~  424 (442)
                      |+++|+||+|||++|+|+|+|+|+|||.+|+.|||||||||+.++
T Consensus         1 R~~~l~Ry~~Kr~~R~f~kkirY~~Rk~~A~~R~RvkGRFvk~~e   45 (45)
T PF06203_consen    1 REEKLQRYREKRKRRNFEKKIRYESRKAVADKRPRVKGRFVKKSE   45 (45)
T ss_pred             CHHHHHHHHHHHHhhcccccCCcchHHHHHhhCCccCCcccCCCC
Confidence            789999999999999999999999999999999999999999864



The CCT domain is found in association with other domains, such as the B-box zinc finger, the GATA-type zinc finger, the ZIM motif or the response regulatory domain. The CCT domain contains a putative nuclear localisation signal within the second half of the CCT motif and has been shown to be involved in nuclear localization and probably also has a role in protein-protein interaction [].; GO: 0005515 protein binding

>KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] Back     alignment and domain information
>PF09425 CCT_2: Divergent CCT motif; InterPro: IPR018467 The short CCT (CO, COL, TOC1) motif is found in a number of plant proteins, including Constans (CO), Constans-like (COL) and TOC1 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query442
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 8e-05
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 44.1 bits (103), Expect = 8e-05
 Identities = 56/384 (14%), Positives = 101/384 (26%), Gaps = 107/384 (27%)

Query: 98  HRRHHQQQEPNQQFFDESSELLLPFR--VIDDSGFLLHQPILDKPRFPLEPKVTKFCENK 155
           H  HH   E  +  + +  ++L  F    +D+      + + D P+  L  +        
Sbjct: 2   HHHHHMDFETGEHQY-QYKDILSVFEDAFVDNFDC---KDVQDMPKSILSKEEIDHIIMS 57

Query: 156 --------------LSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGI 201
                         LS Q      F           + S    +    S++    + E  
Sbjct: 58  KDAVSGTLRLFWTLLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMT-RMYIEQR 116

Query: 202 DSIM-GNLSVKKDAIESDDGHSQVHN------SNCW---YGNPMGIGFGGKLELGFGMRS 251
           D +   N    K  +     + ++                G  +  G  GK  +   +  
Sbjct: 117 DRLYNDNQVFAKYNVSRLQPYLKLRQALLELRPAKNVLIDG--VL-GS-GKTWVALDVCL 172

Query: 252 GMKALRQVDEGNWW-NF-----PIDVL--------QISPKFNN--SSGTGNSNRKGSSEK 295
             K   ++D   +W N      P  VL        QI P + +     +    R  S + 
Sbjct: 173 SYKVQCKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQA 232

Query: 296 RKKKVGKEKPVAESKVSELPAKEESPIPKANSGELLLKLQYDDVLNAWSGK----GSPFL 351
             +++ K KP                         LL L   +V NA +          L
Sbjct: 233 ELRRLLKSKPYENC---------------------LLVL--LNVQNAKAWNAFNLSCKIL 269

Query: 352 ----DESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSK-------KI 400
                + +  D      +  ++         L    V          L  K        +
Sbjct: 270 LTTRFKQVT-DFLSAATTTHISLDHH--SMTLTPDEVK--------SLLLKYLDCRPQDL 318

Query: 401 RYQVRKVNADRRPR---MKGRFVR 421
             +V   N    PR   +    +R
Sbjct: 319 PREVLTTN----PRRLSIIAESIR 338


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query442
3ogl_Q21 JAZ1 incomplete degron peptide; leucine-rich repea 89.99
3ogk_Q22 JAZ1 incomplete degron peptide; leucine rich repea 89.07
>3ogk_Q JAZ1 incomplete degron peptide; leucine rich repeat, ubiquitin ligase, SCF, protein binding; HET: OGK; 2.80A {Arabidopsis thaliana} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00