Citrus Sinensis ID: 014844


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------
MSSYLSGGGRTYALDLEIIKSPSHSTGTSHTSSSPLSTLSEASNSPLAISTRKARTPRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFNNSSGTGNSNRKGSSEKRKKKVGKEKPVAESKVSELPAKEESPIPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKV
ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHcHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcccccccccccccccccccccccccccccccccccccccHHHHHHHcccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHcccccccccHHHHHHcccccccccc
ccccccccccEccccHHHEccccccccccccccccccccccccccccEEcccccccccccccHHHHHHHHHHHHccccccccccccccccccccccccccccHHccccHccccccccccccccccccHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHccccHcccEEEEcccccccccccccccccccccccEEEcccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHcccHHHHcccccccEccccccHHHcccccccccccEEEEcHHHHHHHHHccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccc
mssylsgggrtyaLDLEiikspshstgtshtsssplstlseasnsplaistrkartprkrpnqtYNEAAALLStacpnifstkhltnlnkfttkppdhrrhhqqqepnqqffdesselllpfrviddsgfllhqpildkprfplepkvtkfcenklscqnsgevdfqganhnpmevqissefdedfdaesiildEEFEEGIDSIMgnlsvkkdaiesddghsqvhnsncwygnpmgigfggklelgfgmrSGMKALRQVdegnwwnfpidvlqispkfnnssgtgnsnrkgssekrkkkvgkekpvaeskvselpakeespipkansgellLKLQYDDVlnawsgkgspfldeslapdgqgndvSARLAQIDlfsdgglreaSVLRYKEKRRTRLFSKKIRYQVRKVnadrrprmkv
mssylsggGRTYALDLEIIKSPshstgtshtsssplstlseasnsplaistrkartprkrpnQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQIspkfnnssgtgnsnrkgssekrkkkvgkekpvaeskvselpakeespipkansgeLLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIdlfsdgglreasvlrykekrrtrlfskkiryqvrkvnadrrprmkv
MSSYLSGGGRTYALDLEIIKspshstgtshtsssplstlsEASNSPLAISTRKARTPRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFNNSSGTGNSNrkgssekrkkkvgkekPVAESKVSELPAKEESPIPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKV
************AL******************************************************AALLSTACPNIFSTKHLTNL****************************ELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSC*****************************AESIILDEEFEEGIDSIMGNL***************VHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQIS******************************************************LLLKLQYDDVLNAWSGK********************RLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQV*************
*****SGGGRTYALDL********************************************************ST***************************************ESSELLLPFRVIDDSGFLLHQPILDKPR***********************************QISSEFDEDFDAESIILDEEFEEGIDSIMG********************************FGGKLEL************QVDEGNWWNFPIDVLQ**************************************************************************************************************************FSKKIRYQVRKVNADRRPRM**
********GRTYALDLEIIKSP****************************************QTYNEAAALLSTACPNIFSTKHLTNLNKFTT**************NQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAI********VHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFNN*****************************************IPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQVRKV**********
**********TYALD*EIIK*************************************RKRPNQTYNEAAALLSTACPNIFSTKH*************************Q*FDESSELLLPFRVIDDSGFLLHQP****************************************VQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFN***********************************************SGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQI*****GGLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRR*****
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MSSYLSGGGRTYALDLEIIKSPSHSTGTSHTSSSPLSTLSEASNSPLAISTRKARTPRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAIESDDGHSQVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNFPIDVLQISPKFNNSSGTGNSNRKGSSEKRKKKVGKEKPVAESKVSELPAKEESPIPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKV
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query417 2.2.26 [Sep-21-2011]
Q9LU68435 Protein CHLOROPLAST IMPOR no no 0.901 0.864 0.405 5e-59
Q8RWD0417 Zinc finger protein CONST no no 0.201 0.201 0.453 8e-15
Q8LG76406 Zinc finger protein CONST no no 0.208 0.214 0.438 8e-15
Q9C9A9392 Zinc finger protein CONST no no 0.206 0.219 0.452 2e-13
Q9M9B3319 Zinc finger protein CONST no no 0.184 0.241 0.465 4e-11
Q9FDX8395 Zinc finger protein HD1 O no no 0.148 0.156 0.467 2e-06
Q9FHH8355 Zinc finger protein CONST no no 0.155 0.183 0.424 5e-06
Q96502347 Zinc finger protein CONST no no 0.088 0.106 0.621 2e-05
Q9SK53294 Zinc finger protein CONST no no 0.088 0.125 0.621 3e-05
Q39057373 Zinc finger protein CONST no no 0.086 0.096 0.638 3e-05
>sp|Q9LU68|CIA2_ARATH Protein CHLOROPLAST IMPORT APPARATUS 2 OS=Arabidopsis thaliana GN=CIA2 PE=2 SV=1 Back     alignment and function desciption
 Score =  228 bits (582), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 186/459 (40%), Positives = 244/459 (53%), Gaps = 83/459 (18%)

Query: 1   MSSYLS---GGGRTYALDLEIIKSPSHSTGTSHTSSSPLSTLSEASNS-PLAISTRKART 56
           MS+ LS   GG   Y+ +LE +KSP  S+ T+ T ++  S+    S++ PLAISTRK RT
Sbjct: 1   MSACLSSGGGGAAAYSFELEKVKSPPPSSSTTTTRATSPSSTISESSNSPLAISTRKPRT 60

Query: 57  PRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFF---- 112
            RKRPNQTYNEAA LLSTA PNIFS+    NL+          +       N  F+    
Sbjct: 61  QRKRPNQTYNEAATLLSTAYPNIFSS----NLSS---------KQKTHSSSNSHFYGPLL 107

Query: 113 ---DESSELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGA 169
              D++S+LLLP+  I++  FL H  I  K  F        F + K    NSG   + G 
Sbjct: 108 SDNDDASDLLLPYESIEEPDFLFHPTIQTKTEF--------FSDQKEV--NSGGDCYGG- 156

Query: 170 NHNPMEVQISSEFDEDFDAESIILDEEFEEGIDSIMGNL----------SVKKDAIESDD 219
                E++   +F ++FDAESI LDE+ EEGIDSIMG +            +   + +  
Sbjct: 157 -----EIE-KFDFSDEFDAESI-LDEDIEEGIDSIMGTVVESNSNSGIYESRVPGMINRG 209

Query: 220 GHS---------QVHNSNCWYGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNF-PI 269
           G S         Q+   N W  +  G  F     LG G+RS   ALR+ D+   W    +
Sbjct: 210 GRSSSNRIGKLEQMMMINSWNRSSNGFNF----PLGLGLRS---ALRENDDTKLWKIHTV 262

Query: 270 DVLQISPKFNN--SSGTGNSNRKGSSEKRKKKVGKEK----------PVAESKVSELPAK 317
           D  QISP+     +    ++  +  S+ +K  +  EK           V  + ++E  + 
Sbjct: 263 DFEQISPRIQTVKTETAISTVDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTLITESKSL 322

Query: 318 EESPIPKANSGELLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDG 377
           E++          LLKL YD VL AWS K SPF DE      +  DV+ARLAQIDLF D 
Sbjct: 323 EDTEETSLKRTGPLLKLDYDGVLEAWSDKTSPFPDE--IQGSEAVDVNARLAQIDLFGDS 380

Query: 378 GLREASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMK 416
           G+REASVLRYKEKRRTRLFSKKIRYQVRK+NAD+RPRMK
Sbjct: 381 GMREASVLRYKEKRRTRLFSKKIRYQVRKLNADQRPRMK 419




Responsible for specific up-regulation of the translocon genes TOC33 and TOC75 in leaves. Involved in the general chloroplast protein import pathway regulation, including protein import and protein translation efficiencies.
Arabidopsis thaliana (taxid: 3702)
>sp|Q8RWD0|COL16_ARATH Zinc finger protein CONSTANS-LIKE 16 OS=Arabidopsis thaliana GN=COL16 PE=2 SV=2 Back     alignment and function description
>sp|Q8LG76|COL6_ARATH Zinc finger protein CONSTANS-LIKE 6 OS=Arabidopsis thaliana GN=COL6 PE=2 SV=2 Back     alignment and function description
>sp|Q9C9A9|COL7_ARATH Zinc finger protein CONSTANS-LIKE 7 OS=Arabidopsis thaliana GN=COL7 PE=2 SV=1 Back     alignment and function description
>sp|Q9M9B3|COL8_ARATH Zinc finger protein CONSTANS-LIKE 8 OS=Arabidopsis thaliana GN=COL8 PE=2 SV=2 Back     alignment and function description
>sp|Q9FDX8|HD1_ORYSJ Zinc finger protein HD1 OS=Oryza sativa subsp. japonica GN=HD1 PE=2 SV=1 Back     alignment and function description
>sp|Q9FHH8|COL5_ARATH Zinc finger protein CONSTANS-LIKE 5 OS=Arabidopsis thaliana GN=COL5 PE=2 SV=2 Back     alignment and function description
>sp|Q96502|COL2_ARATH Zinc finger protein CONSTANS-LIKE 2 OS=Arabidopsis thaliana GN=COL2 PE=1 SV=1 Back     alignment and function description
>sp|Q9SK53|COL3_ARATH Zinc finger protein CONSTANS-LIKE 3 OS=Arabidopsis thaliana GN=COL3 PE=1 SV=1 Back     alignment and function description
>sp|Q39057|CONS_ARATH Zinc finger protein CONSTANS OS=Arabidopsis thaliana GN=CO PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query417
224087284448 predicted protein [Populus trichocarpa] 0.944 0.879 0.568 1e-102
449463579430 PREDICTED: protein CHLOROPLAST IMPORT AP 0.949 0.920 0.555 1e-101
147784441392 hypothetical protein VITISV_015092 [Viti 0.894 0.951 0.583 2e-99
225470656392 PREDICTED: protein CHLOROPLAST IMPORT AP 0.894 0.951 0.585 2e-98
296090276392 unnamed protein product [Vitis vinifera] 0.834 0.887 0.564 2e-90
255554380397 CIL, putative [Ricinus communis] gi|2235 0.908 0.954 0.589 7e-90
356549252393 PREDICTED: protein CHLOROPLAST IMPORT AP 0.887 0.941 0.527 1e-80
356549250399 PREDICTED: protein CHLOROPLAST IMPORT AP 0.884 0.924 0.526 4e-80
356562078398 PREDICTED: protein CHLOROPLAST IMPORT AP 0.889 0.932 0.536 1e-77
449532521380 PREDICTED: protein CHLOROPLAST IMPORT AP 0.834 0.915 0.514 6e-76
>gi|224087284|ref|XP_002308112.1| predicted protein [Populus trichocarpa] gi|222854088|gb|EEE91635.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 254/447 (56%), Positives = 302/447 (67%), Gaps = 53/447 (11%)

Query: 2   SSYLSGGGRTYALDLEIIKSPSHSTGTSHTSSSPLSTLSEASNSPLAISTRKARTPRKRP 61
           S  +SGGGRTY  DLEI+KS S S+  +  +SSP ST+SE+SNSPLAISTRK+RTPRKRP
Sbjct: 3   SPCISGGGRTYGFDLEIVKSSSTSSTRTSHTSSPSSTISESSNSPLAISTRKSRTPRKRP 62

Query: 62  NQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQQEPNQQFFDESSELLLP 121
           NQTYNEAAALLSTA PNIFSTKHLT  +KF TKP D          N    D+SS+LLLP
Sbjct: 63  NQTYNEAAALLSTAYPNIFSTKHLTKSSKF-TKPQD----------NSLLLDQSSDLLLP 111

Query: 122 FRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVDFQGANHNPMEVQISSE 181
           FRV D+SGFL+HQPI  KP +  E +   F + K SCQ+SGEVDF G +     V++   
Sbjct: 112 FRVFDNSGFLIHQPIQGKPSYGNESRFANFTD-KSSCQSSGEVDFHGNS-----VELCDG 165

Query: 182 FDEDFDAESIILDEEFEEGIDSIMGNLSVKKDAI-ESDDGHSQVHNS--NCWYGNPMGIG 238
            DEDFDAES ILDEEFEEGIDSIMG+ S+  + + E  +G S       N WYG+ MG  
Sbjct: 166 LDEDFDAES-ILDEEFEEGIDSIMGHSSIGNEMVDEVPNGISSSFGGQMNSWYGSSMGYN 224

Query: 239 FGGKLELGFG--MRSGMKALRQVDEGNWWNFPI-DVLQISPKF----------------- 278
           FGGK + G G  MR G++ALR VDEGNWW+FPI D+LQISP+                  
Sbjct: 225 FGGKSQYGHGIAMRRGVRALRHVDEGNWWDFPIVDMLQISPRLTTTVAATANANSNNGPE 284

Query: 279 -----NNSSGTGNSNRKGSSEKRKKKVGKEKPVAESKVSEL----PAKEESPIPKANSGE 329
                          +  SS ++KKK  ++  V E K  EL    P KE S IP+++ G 
Sbjct: 285 CNSIPKPKPKPKLKLKPNSSSEKKKKKVEKPAVMEEKNVELKDENPVKENS-IPQSSQG- 342

Query: 330 LLLKLQYDDVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKE 389
           L+LKL YD VL+ WS +GSPF DES+    +GNDVSARLAQIDLFS+ G+REASVLRYKE
Sbjct: 343 LILKLNYDHVLSEWSDRGSPFSDESMGC-AEGNDVSARLAQIDLFSENGMREASVLRYKE 401

Query: 390 KRRTRLFSKKIRYQVRKVNADRRPRMK 416
           KRRTRLFSKKIRYQVRKVNAD+RPRMK
Sbjct: 402 KRRTRLFSKKIRYQVRKVNADQRPRMK 428




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|449463579|ref|XP_004149511.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Cucumis sativus] Back     alignment and taxonomy information
>gi|147784441|emb|CAN72725.1| hypothetical protein VITISV_015092 [Vitis vinifera] Back     alignment and taxonomy information
>gi|225470656|ref|XP_002268213.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Vitis vinifera] Back     alignment and taxonomy information
>gi|296090276|emb|CBI40095.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|255554380|ref|XP_002518229.1| CIL, putative [Ricinus communis] gi|223542576|gb|EEF44115.1| CIL, putative [Ricinus communis] Back     alignment and taxonomy information
>gi|356549252|ref|XP_003543010.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like isoform 2 [Glycine max] Back     alignment and taxonomy information
>gi|356549250|ref|XP_003543009.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like isoform 1 [Glycine max] Back     alignment and taxonomy information
>gi|356562078|ref|XP_003549301.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Glycine max] Back     alignment and taxonomy information
>gi|449532521|ref|XP_004173229.1| PREDICTED: protein CHLOROPLAST IMPORT APPARATUS 2-like [Cucumis sativus] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query417
TAIR|locus:2175564435 CIA2 "chloroplast import appar 0.949 0.910 0.378 4.3e-54
TAIR|locus:2120850409 CIL "AT4G25990" [Arabidopsis t 0.657 0.669 0.434 1.1e-46
TAIR|locus:2031180417 BBX15 "B-box domain protein 15 0.100 0.100 0.790 2.8e-14
TAIR|locus:2007482195 AT1G07050 "AT1G07050" [Arabido 0.201 0.430 0.494 9.2e-14
TAIR|locus:2201277406 BBX14 "B-box domain protein 14 0.095 0.098 0.804 6.6e-13
TAIR|locus:2031581392 BBX16 "B-box domain protein 16 0.098 0.104 0.809 5.4e-12
TAIR|locus:2145698339 AT5G14370 [Arabidopsis thalian 0.208 0.256 0.441 6.3e-12
TAIR|locus:2028421326 BBX17 "B-box domain protein 17 0.184 0.236 0.465 1.1e-09
UNIPROTKB|Q9FDX8395 HD1 "Zinc finger protein HD1" 0.148 0.156 0.467 4.6e-05
UNIPROTKB|Q9FE92407 Hd1 "Heading day 1" [Oryza sat 0.148 0.152 0.467 4.9e-05
TAIR|locus:2175564 CIA2 "chloroplast import apparatus 2" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 559 (201.8 bits), Expect = 4.3e-54, P = 4.3e-54
 Identities = 166/439 (37%), Positives = 219/439 (49%)

Query:     1 MSSYLSGGG---RTYALDLEIIKXXXXXXXXXXXXXXX-XXXXXEASNSPLAISTRKART 56
             MS+ LS GG     Y+ +LE +K                     E+SNSPLAISTRK RT
Sbjct:     1 MSACLSSGGGGAAAYSFELEKVKSPPPSSSTTTTRATSPSSTISESSNSPLAISTRKPRT 60

Query:    57 PRKRPNQTYNEAAALLSTACPNIFSTKHLTNLNKFTTKPPDHRRHHQQ-QEPNQQFFDES 115
              RKRPNQTYNEAA LLSTA PNIFS+    NL+   +K   H   +     P     D++
Sbjct:    61 QRKRPNQTYNEAATLLSTAYPNIFSS----NLS---SKQKTHSSSNSHFYGPLLSDNDDA 113

Query:   116 SELLLPFRVIDDSGFLLHQPILDKPRFPLEPKVTKFCENKLSCQNSGEVD-FQGANHNPM 174
             S+LLLP+  I++  FL H  I  K  F  + K      +   C   GE++ F  ++    
Sbjct:   114 SDLLLPYESIEEPDFLFHPTIQTKTEFFSDQKEVN---SGGDCYG-GEIEKFDFSDEFDA 169

Query:   175 EVQISSEFDEDFDA-ESIILDEEFEEGI-DS-IMGNLSVKKDAIESDDGH-SQVHNSNCW 230
             E  +  + +E  D+    +++     GI +S + G ++    +  +  G   Q+   N W
Sbjct:   170 ESILDEDIEEGIDSIMGTVVESNSNSGIYESRVPGMINRGGRSSSNRIGKLEQMMMINSW 229

Query:   231 YGNPMGIGFGGKLELGFGMRSGMKALRQVDEGNWWNF-PIDVLQISPKF----------- 278
               +  G  F     LG G+RS   ALR+ D+   W    +D  QISP+            
Sbjct:   230 NRSSNGFNF----PLGLGLRS---ALRENDDTKLWKIHTVDFEQISPRIQTVKTETAIST 282

Query:   279 -NNSSGTGNSNXXXXXXXXXXXXXXXXPVAESKVSELPAKEESPIPKANSGELLLKLQYD 337
              +     G                    V  + ++E  + E++          LLKL YD
Sbjct:   283 VDEEKSDGKKVVISGEKSNKKKKKKKMTVTTTLITESKSLEDTEETSLKRTGPLLKLDYD 342

Query:   338 DVLNAWSGKGSPFLDESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFS 397
              VL AWS K SPF DE      +  DV+ARLAQIDLF D G+REASVLRYKEKRRTRLFS
Sbjct:   343 GVLEAWSDKTSPFPDE--IQGSEAVDVNARLAQIDLFGDSGMREASVLRYKEKRRTRLFS 400

Query:   398 KKIRYQVRKVNADRRPRMK 416
             KKIRYQVRK+NAD+RPRMK
Sbjct:   401 KKIRYQVRKLNADQRPRMK 419




GO:0005634 "nucleus" evidence=ISM;IDA
GO:0006355 "regulation of transcription, DNA-dependent" evidence=IEP
GO:0045036 "protein targeting to chloroplast" evidence=RCA;IMP
GO:0045893 "positive regulation of transcription, DNA-dependent" evidence=RCA
TAIR|locus:2120850 CIL "AT4G25990" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031180 BBX15 "B-box domain protein 15" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2007482 AT1G07050 "AT1G07050" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2201277 BBX14 "B-box domain protein 14" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2031581 BBX16 "B-box domain protein 16" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2145698 AT5G14370 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2028421 BBX17 "B-box domain protein 17" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|Q9FDX8 HD1 "Zinc finger protein HD1" [Oryza sativa Japonica Group (taxid:39947)] Back     alignment and assigned GO terms
UNIPROTKB|Q9FE92 Hd1 "Heading day 1" [Oryza sativa (taxid:4530)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Fail to connect to STRING server


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query417
pfam0620345 pfam06203, CCT, CCT motif 1e-17
>gnl|CDD|203407 pfam06203, CCT, CCT motif Back     alignment and domain information
 Score = 75.7 bits (187), Expect = 1e-17
 Identities = 25/37 (67%), Positives = 30/37 (81%)

Query: 380 REASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMK 416
           REA++LRYKEKR+TR F KKIRY  RK  A+ RPR+K
Sbjct: 1   REAALLRYKEKRKTRKFDKKIRYASRKAVAESRPRVK 37


This short motif is found in a number of plant proteins. It is rich in basic amino acids and has been called a CCT motif after Co, Col and Toc1. The CCT motif is about 45 amino acids long and contains a putative nuclear localisation signal within the second half of the CCT motif. Toc1 mutants have been identified in this region. Length = 45

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 417
PF0620345 CCT: CCT motif; InterPro: IPR010402 The CCT (CONST 99.55
KOG1601340 consensus GATA-4/5/6 transcription factors [Transc 96.99
PF0942527 CCT_2: Divergent CCT motif; InterPro: IPR018467 Th 94.21
>PF06203 CCT: CCT motif; InterPro: IPR010402 The CCT (CONSTANS, CO-like, and TOC1) domain is a highly conserved basic module of ~43 amino acids, which is found near the C terminus of plant proteins often involved in light signal transduction Back     alignment and domain information
Probab=99.55  E-value=1.2e-15  Score=113.64  Aligned_cols=38  Identities=58%  Similarity=0.915  Sum_probs=37.3

Q ss_pred             HHHHHHHHHHHhhccCCCCcccccccccccccCCCCCC
Q 014844          380 REASVLRYKEKRRTRLFSKKIRYQVRKVNADRRPRMKV  417 (417)
Q Consensus       380 Rea~v~RYkEKRk~R~F~KkIRY~~RK~~Ad~RPRvKG  417 (417)
                      |+++|+||+|||++|+|+|+|||+|||.+||.||||||
T Consensus         1 R~~~l~Ry~~Kr~~R~f~kkirY~~Rk~~A~~R~RvkG   38 (45)
T PF06203_consen    1 REEKLQRYREKRKRRNFEKKIRYESRKAVADKRPRVKG   38 (45)
T ss_pred             CHHHHHHHHHHHHhhcccccCCcchHHHHHhhCCccCC
Confidence            78999999999999999999999999999999999998



The CCT domain is found in association with other domains, such as the B-box zinc finger, the GATA-type zinc finger, the ZIM motif or the response regulatory domain. The CCT domain contains a putative nuclear localisation signal within the second half of the CCT motif and has been shown to be involved in nuclear localization and probably also has a role in protein-protein interaction [].; GO: 0005515 protein binding

>KOG1601 consensus GATA-4/5/6 transcription factors [Transcription] Back     alignment and domain information
>PF09425 CCT_2: Divergent CCT motif; InterPro: IPR018467 The short CCT (CO, COL, TOC1) motif is found in a number of plant proteins, including Constans (CO), Constans-like (COL) and TOC1 Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query417
1vt4_I 1221 APAF-1 related killer DARK; drosophila apoptosome, 2e-04
>1vt4_I APAF-1 related killer DARK; drosophila apoptosome, apoptosis, programmed cell death; HET: DTP; 6.90A {Drosophila melanogaster} PDB: 3iz8_A* Length = 1221 Back     alignment and structure
 Score = 42.9 bits (100), Expect = 2e-04
 Identities = 53/368 (14%), Positives = 96/368 (26%), Gaps = 100/368 (27%)

Query: 98  HRRHHQQQEPNQQFFDESSELLLPFR--VIDDSGFLLHQPILDKPRFPLEPKVTKFCENK 155
           H  HH   E  +  + +  ++L  F    +D+      + + D P+  L  +        
Sbjct: 2   HHHHHMDFETGEHQY-QYKDILSVFEDAFVDNFDC---KDVQDMPKSILSKEEIDHIIMS 57

Query: 156 --------------LSCQNSGEVDFQGANHNPMEVQISSEFDEDFDAESIILDEEFEEGI 201
                         LS Q      F           + S    +    S++    + E  
Sbjct: 58  KDAVSGTLRLFWTLLSKQEEMVQKFVEEVLRINYKFLMSPIKTEQRQPSMMT-RMYIEQR 116

Query: 202 DSIM-GNLSVKKDAIESDDGHSQVHN------SNCW---YGNPMGIGFGGKLELGFGMRS 251
           D +   N    K  +     + ++                G  +  G  GK  +   +  
Sbjct: 117 DRLYNDNQVFAKYNVSRLQPYLKLRQALLELRPAKNVLIDG--VL-GS-GKTWVALDVCL 172

Query: 252 GMKALRQVDEGNWW-NF-----PIDVL--------QISPKFNN--SSGTGNSNRKGSSEK 295
             K   ++D   +W N      P  VL        QI P + +     +    R  S + 
Sbjct: 173 SYKVQCKMDFKIFWLNLKNCNSPETVLEMLQKLLYQIDPNWTSRSDHSSNIKLRIHSIQA 232

Query: 296 RKKKVGKEKPVAESKVSELPAKEESPIPKANSGELLLKLQYDDVLNAWSGK----GSPFL 351
             +++ K KP                         LL L   +V NA +          L
Sbjct: 233 ELRRLLKSKPYENC---------------------LLVL--LNVQNAKAWNAFNLSCKIL 269

Query: 352 ----DESLAPDGQGNDVSARLAQIDLFSDGGLREASVLRYKEKRRTRLFSK-------KI 400
                + +  D      +  ++         L    V          L  K        +
Sbjct: 270 LTTRFKQVT-DFLSAATTTHISLDHH--SMTLTPDEVK--------SLLLKYLDCRPQDL 318

Query: 401 RYQVRKVN 408
             +V   N
Sbjct: 319 PREVLTTN 326


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query417
3ogl_Q21 JAZ1 incomplete degron peptide; leucine-rich repea 90.79
3ogk_Q22 JAZ1 incomplete degron peptide; leucine rich repea 90.35
>3ogk_Q JAZ1 incomplete degron peptide; leucine rich repeat, ubiquitin ligase, SCF, protein binding; HET: OGK; 2.80A {Arabidopsis thaliana} Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

No hit with probability above 80.00