Citrus Sinensis ID: 041521


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------270-------280-------290-------300-------310-------320-------330-------340-------350-------360-------370-------380-------390-------400-------410-------420-------430-------440-------450-------460-------470-------480-------490-------500-------510-------520----
MEITRFPFLNQEEDYSHLLDLLPEMESRSTFINNNNSSNNNNNNNNNLKKRRRSDDVLNKSAAWSDILTSLILLDEEEKREQQQYSIHSHQDKLLVDDNHKRKEQAMNDYFHQLQDHYTDLDVMDQLRTNKRSRRTASAVATVAASASASASASEDASADNPTTAGGSAQHRRLWVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDPQLKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGLAGTSFLH
cccccccccccccccHHHHHHccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHcccHHHHHHHHHHHHccccccHHHccccHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccHHHHHHHccccHHHHHHHHHHHHcccccccccccccccHHHHHHHHHHHHcccccHHHHHHHHcccHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHcccccccEEEEEccEEEEEcccccccccccccccccccccccccEEEEEEcccccEEEEEcccccccccHHHHHccHHHHHHHccccccEEEEEccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccHHHHHHHHHHHHHHHccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHcccccccccc
cccccccccccccHHHHHHHHHHHccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccEEEEEccccHHHHHHHccccccHHHHHHHHcccHHHHHHHHHHHHHHHHcccccccccccHHHHHHHHHHHHHccccHHHHHHHccccHHHHHHHHHHHHHHHHHHHHHHHcccccHHHHHHHHHHHcccccccccEEEEEccEEEEEcccHHHHHHcccccEEccccccEEEEEEEEEccccEEEEEEEcccccccHHHHHHHcHHHHHHccccccccEEEEcccccccccEEcccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccHHHHHHHHHHHHHHHHHHHHcccccccHHcccccccccccccccccHHHHHHHHHHHHHHHHcccccccccc
meitrfpflnqeeDYSHLLDllpemesrstfinnnnssnnnnnnnnnlkkrrrsddvlnksaAWSDILTSLILLDEEEKREQQQYSIHshqdkllvddnHKRKEQAMNDYFHQLQDHYTDLDVMDQLRTNKRSRRTASAVATVAASASasasasedasadnpttaggsaqhrrLWVKDrskdwwdernhpdfpeeefWRDFRMSKATFEMICEELESTVMKKNtmlrdaipVRQRVAVCVWRlatgeplrVVSKRFGLGISTCHKLVLEVCSAIKTVlmpkflqwpdelKMKQIKEEFqgisgipnvggsmytthipiiapkisvASYFNKrhternqktsySITVQGVvdtkgvftdvcigwpgsmpddqvLERSALFQRAdrgllkdvwivgnsgyplmdwvmvpytqknltwTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKrtevklqdlpvvLGACCVLHNICEmrnevmdpqlkfdlfddemipdnsvrSMASAQARDHIAHNLlhhglagtsflh
meitrfpflnqeeDYSHLLDLLPEMESRSTFinnnnssnnnnnnnnnlkkRRRSDDVLNKSAAWSDILTSLILLDEEEKREQQQYsihshqdkllvDDNHKRKEQAMNDYFHQLQDHYTDLDVMDQLRTNKRSRRTASAVATVAASAsasasasedasadnpttaggsaqhrrlwvkdrskdwwdernhpdfpeeefwRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRlatgeplrvvSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHternqktsysitvqgvvdTKGVFTDVCIGWPGSMPDDQVLERSALFQradrgllkdvWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKrtevklqdlpvVLGACCVLHNICEMRNEVMDPQLKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGlagtsflh
MEITRFPFLNQEEDYSHLLDLLPEMESRSTFInnnnssnnnnnnnnnLKKRRRSDDVLNKSAAWSDILTSLILLDEEEKREQQQYSIHSHQDKLLVDDNHKRKEQAMNDYFHQLQDHYTDLDVMDQLRTNKRsrrtasavatvaasasasasasedasadNPTTAGGSAQHRRLWVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDPQLKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGLAGTSFLH
*************************************************************AAWSDILTSLILL***********************************YFHQLQDHYTDLDV**************************************************LWVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDPQLKFDLFDDE******************IAHNLLHHG********
****RFPFLNQEEDYSHLLDLLPEM****************************************DILTSLILLD**********************DNHKRKEQAMNDYFHQL************************************************************WVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNT**RDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDPQLKFDLFDDEMI**********AQARDHIAHNLLHHGL*****L*
MEITRFPFLNQEEDYSHLLDLLPEMESRSTFINNNNSSNNNNNNNNNLKKRRRSDDVLNKSAAWSDILTSLILLDEE***********SHQDKLLVDDNHKRKEQAMNDYFHQLQDHYTDLDVMDQLRTNK*****************************************RLWVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDPQLKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGLAGTSFLH
****RFPFLNQEEDYSHLLDLLPEMESR*******************************KSAAWSDILTSLILLDEEEKREQQQYSIHSHQDKLLVDDNHKRKEQAMNDYFHQLQDHYTDLDVMDQLRTNKR*************************************QHRRLWVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDPQLKFDLFDD**IPDNSVRSMASAQARDHIAHNLLHHGLAGTSFL*
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MEITRFPFLNQEEDYSHLLDLLPEMESRSTFINNNNSSNNNNNNNNNLKKRRRSDDVLNKSAAWSDILTSLILLDEEEKREQQQYSIHSHQDKLLVDDNHKRKEQAMNDYFHQLQDHYTDLDVMDQLRTNKRSRRTASAVATVAASASASASASEDASADNPTTAGGSAQHRRLWVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDPQLKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGLAGTSFLH
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query524 2.2.26 [Sep-21-2011]
Q96MB7349 Putative nuclease HARBI1 yes no 0.343 0.515 0.324 3e-21
Q17QR8349 Putative nuclease HARBI1 yes no 0.343 0.515 0.324 9e-21
Q8BR93349 Putative nuclease HARBI1 yes no 0.343 0.515 0.319 1e-20
B0BN95349 Putative nuclease HARBI1 yes no 0.343 0.515 0.314 3e-20
Q6AZB8349 Putative nuclease HARBI1 no no 0.488 0.733 0.277 5e-19
Q5U538347 Putative nuclease HARBI1 N/A no 0.341 0.515 0.287 2e-11
>sp|Q96MB7|HARB1_HUMAN Putative nuclease HARBI1 OS=Homo sapiens GN=HARBI1 PE=1 SV=1 Back     alignment and function desciption
 Score =  103 bits (258), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 63/194 (32%), Positives = 102/194 (52%), Gaps = 14/194 (7%)

Query: 281 KFLQWP-DELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQK 339
           +F+++P DE  ++ +K+EF G++G+P V G +   H+ I AP     SY N+       K
Sbjct: 117 QFIRFPADEASIQALKDEFYGLAGMPGVMGVVDCIHVAIKAPNAEDLSYVNR-------K 169

Query: 340 TSYSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYP 399
             +S+    V D +G    V   WPGS+ D  VL++S+L  + + G+ KD W++G+S + 
Sbjct: 170 GLHSLNCLMVCDIRGTLMTVETNWPGSLQDCAVLQQSSLSSQFEAGMHKDSWLLGDSSFF 229

Query: 400 LMDWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPV- 458
           L  W+M P      T  ++ +N       +V +  F  L  R+ CL   ++  LQ  P  
Sbjct: 230 LRTWLMTPLHIPE-TPAEYRYNMAHSATHSVIEKTFRTLCSRFRCLDG-SKGALQYSPEK 287

Query: 459 ---VLGACCVLHNI 469
              ++ ACCVLHNI
Sbjct: 288 SSHIILACCVLHNI 301




Transposase-derived protein that may have nuclease activity (Potential). Does not have transposase activity.
Homo sapiens (taxid: 9606)
EC: 3EC: .EC: 1EC: .EC: -EC: .EC: -
>sp|Q17QR8|HARB1_BOVIN Putative nuclease HARBI1 OS=Bos taurus GN=HARBI1 PE=2 SV=1 Back     alignment and function description
>sp|Q8BR93|HARB1_MOUSE Putative nuclease HARBI1 OS=Mus musculus GN=Harbi1 PE=2 SV=1 Back     alignment and function description
>sp|B0BN95|HARB1_RAT Putative nuclease HARBI1 OS=Rattus norvegicus GN=Harbi1 PE=2 SV=1 Back     alignment and function description
>sp|Q6AZB8|HARB1_DANRE Putative nuclease HARBI1 OS=Danio rerio GN=harbi1 PE=2 SV=1 Back     alignment and function description
>sp|Q5U538|HARB1_XENLA Putative nuclease HARBI1 OS=Xenopus laevis GN=harbi1 PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query524
255546077541 conserved hypothetical protein [Ricinus 0.996 0.964 0.688 0.0
224114577456 predicted protein [Populus trichocarpa] 0.868 0.997 0.755 0.0
224085922453 predicted protein [Populus trichocarpa] 0.862 0.997 0.748 0.0
225429570484 PREDICTED: uncharacterized protein LOC10 0.906 0.981 0.677 0.0
224092160507 predicted protein [Populus trichocarpa] 0.904 0.934 0.660 0.0
356532740527 PREDICTED: putative nuclease HARBI1-like 0.990 0.984 0.638 0.0
356552466534 PREDICTED: uncharacterized protein LOC10 0.862 0.846 0.687 0.0
343887274515 hypothetical protein [Citrus unshiu] 0.868 0.883 0.691 0.0
357437265 1056 hypothetical protein MTR_1g015070 [Medic 0.868 0.430 0.683 0.0
356563940536 PREDICTED: putative nuclease HARBI1-like 0.866 0.847 0.687 0.0
>gi|255546077|ref|XP_002514098.1| conserved hypothetical protein [Ricinus communis] gi|223546554|gb|EEF48052.1| conserved hypothetical protein [Ricinus communis] Back     alignment and taxonomy information
 Score =  731 bits (1886), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/542 (68%), Positives = 438/542 (80%), Gaps = 20/542 (3%)

Query: 1   MEITRFPFLNQEEDYSHLLDLLPEMESRSTFINNNNSSNNNNNN--NNNLKKRRR----- 53
           MEI+ FP LNQ+ D+SH   LL +M+S +   NN N S+  ++   NN++KKR+R     
Sbjct: 1   MEISSFPLLNQD-DFSHFYSLLQDMDSSNADTNNTNGSSAASSFGLNNDIKKRKRKSDNE 59

Query: 54  -----------SDDVLNKSAAWSDILTSLILLDEEEKREQQQYSIHSHQDKLLVDDNHKR 102
                      ++D L KS    +IL SL++LDEEEK+EQQQ+ I S QDK L D N KR
Sbjct: 60  DNDPFLLQLPNTEDDLKKSTTMRNILASLLMLDEEEKQEQQQWLIDSQQDKSLFDSNSKR 119

Query: 103 KEQAMNDYFHQLQDHYTDLDVMDQLRTNKRSRRTASAVATVAASASASASASE-DASADN 161
           K QAM+ Y   LQ+H+TDL  +D  R  K  R  +++ A  AA A AS + +  D++  N
Sbjct: 120 KIQAMDAYSSHLQNHFTDLHELDHSRIKKARRSASASAAAAAAVAVASNNVASADSAQSN 179

Query: 162 PTTAGGSAQHRRLWVKDRSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMK 221
              A GS QHRRLWVKDRSKDWW++ +HPDFPEEEF + FRMSKATFE IC EL+S V K
Sbjct: 180 GGGASGSGQHRRLWVKDRSKDWWEKCSHPDFPEEEFKKAFRMSKATFETICNELDSAVTK 239

Query: 222 KNTMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPK 281
           KNTMLRDAIPVRQRVAVC+WRLATGEPLR+VSKRFGLGISTCHKLVLEVCSAI+ VLMP+
Sbjct: 240 KNTMLRDAIPVRQRVAVCIWRLATGEPLRLVSKRFGLGISTCHKLVLEVCSAIRNVLMPR 299

Query: 282 FLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTS 341
           FLQWPDE K++ IK E++ +SGIPNVGGSMYTTHIPIIAPKISVA+YFNKRHTERNQKTS
Sbjct: 300 FLQWPDEEKLRMIKSEYEAVSGIPNVGGSMYTTHIPIIAPKISVAAYFNKRHTERNQKTS 359

Query: 342 YSITVQGVVDTKGVFTDVCIGWPGSMPDDQVLERSALFQRADRGLLKDVWIVGNSGYPLM 401
           YS+TVQGVVD +GVFTDVCIGWPGSM DDQVLE+SALFQRA+RG LKDVWIVGNSGYPLM
Sbjct: 360 YSMTVQGVVDPRGVFTDVCIGWPGSMTDDQVLEKSALFQRANRGALKDVWIVGNSGYPLM 419

Query: 402 DWVMVPYTQKNLTWTQHAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLG 461
           DWV+VPYT +NLTWTQHAFNEKIG++Q V K+AFARLK RW+CLQKRTEVKLQDLPVVLG
Sbjct: 420 DWVLVPYTHQNLTWTQHAFNEKIGEVQRVGKEAFARLKARWSCLQKRTEVKLQDLPVVLG 479

Query: 462 ACCVLHNICEMRNEVMDPQLKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGLAGTS 521
           ACCVLHNICEMRNE ++P  ++++FDDEMIP+N +RS+ + QARDHIAHNLLHHGLAGTS
Sbjct: 480 ACCVLHNICEMRNEEIEPGHRYEIFDDEMIPENGLRSVGAIQARDHIAHNLLHHGLAGTS 539

Query: 522 FL 523
           FL
Sbjct: 540 FL 541




Source: Ricinus communis

Species: Ricinus communis

Genus: Ricinus

Family: Euphorbiaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|224114577|ref|XP_002332342.1| predicted protein [Populus trichocarpa] gi|222831909|gb|EEE70386.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|224085922|ref|XP_002307739.1| predicted protein [Populus trichocarpa] gi|222857188|gb|EEE94735.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|225429570|ref|XP_002280033.1| PREDICTED: uncharacterized protein LOC100262760 [Vitis vinifera] gi|296081670|emb|CBI20675.3| unnamed protein product [Vitis vinifera] Back     alignment and taxonomy information
>gi|224092160|ref|XP_002309488.1| predicted protein [Populus trichocarpa] gi|222855464|gb|EEE93011.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
>gi|356532740|ref|XP_003534929.1| PREDICTED: putative nuclease HARBI1-like [Glycine max] Back     alignment and taxonomy information
>gi|356552466|ref|XP_003544588.1| PREDICTED: uncharacterized protein LOC100804219 [Glycine max] Back     alignment and taxonomy information
>gi|343887274|dbj|BAK61820.1| hypothetical protein [Citrus unshiu] Back     alignment and taxonomy information
>gi|357437265|ref|XP_003588908.1| hypothetical protein MTR_1g015070 [Medicago truncatula] gi|355477956|gb|AES59159.1| hypothetical protein MTR_1g015070 [Medicago truncatula] Back     alignment and taxonomy information
>gi|356563940|ref|XP_003550215.1| PREDICTED: putative nuclease HARBI1-like [Glycine max] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query524
TAIR|locus:2143104502 AT5G12010 [Arabidopsis thalian 0.885 0.924 0.609 3.2e-162
TAIR|locus:2123874540 AT4G29780 "AT4G29780" [Arabido 0.994 0.964 0.562 1.4e-160
ZFIN|ZDB-GENE-050327-32415 zgc:113227 "zgc:113227" [Danio 0.553 0.698 0.366 2.7e-52
TAIR|locus:2099901406 AT3G55350 [Arabidopsis thalian 0.561 0.724 0.307 9.2e-36
TAIR|locus:2077259396 AT3G63270 [Arabidopsis thalian 0.534 0.707 0.294 4.7e-32
UNIPROTKB|E1BQ99348 HARBI1 "Uncharacterized protei 0.511 0.770 0.275 1.7e-22
UNIPROTKB|E2RCW9349 HARBI1 "Uncharacterized protei 0.469 0.704 0.283 1e-21
UNIPROTKB|Q96MB7349 HARBI1 "Putative nuclease HARB 0.469 0.704 0.283 1.3e-21
UNIPROTKB|Q17QR8349 HARBI1 "Putative nuclease HARB 0.469 0.704 0.279 1.5e-20
UNIPROTKB|F1SIA2349 HARBI1 "Uncharacterized protei 0.469 0.704 0.275 1.5e-20
TAIR|locus:2143104 AT5G12010 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 1563 (555.3 bits), Expect = 3.2e-162, Sum P(2) = 3.2e-162
 Identities = 284/466 (60%), Positives = 357/466 (76%)

Query:    59 NKSAAWSDILTSLILLDEEEKREQQQYSIHSHQDKLLVDDNHKRKEQAMNDYFHQLQDHY 118
             N++       TSL+L++E EK++Q+  +  S ++      N++++ + M+DY+  L D+Y
Sbjct:    38 NETKNLKGFFTSLLLMEEHEKQDQEARNAASRREMSDFQSNYRKRARTMSDYYSDLNDYY 97

Query:   119 TDLDVMDQLRTNKRXXXXXXXXXXXXXXXXXXXXXXXXXXXXNPTTAGGSAQHRRLWVKD 178
              D +    +   K+                                  GS Q RRLWVKD
Sbjct:    98 ADAEESGDINL-KKSRVSRAVASVAVAAASEIEAESSEITGSGSVRGTGSGQQRRLWVKD 156

Query:   179 RSKDWWDERNHPDFPEEEFWRDFRMSKATFEMICEELESTVMKKNTMLRDAIPVRQRVAV 238
             RS+ WW+E +  D+PEE+F + FRMSK+TFE+IC+EL S V K++T LR+AIPVRQRVAV
Sbjct:   157 RSRAWWEECSRLDYPEEDFKKAFRMSKSTFELICDELNSAVAKEDTALRNAIPVRQRVAV 216

Query:   239 CVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTVLMPKFLQWPDELKMKQIKEEF 298
             C+WRLATGEPLR+VSK+FGLGISTCHKLVLEVC AIK VLMPK+LQWPD+  ++ I+E F
Sbjct:   217 CIWRLATGEPLRLVSKKFGLGISTCHKLVLEVCKAIKDVLMPKYLQWPDDESLRNIRERF 276

Query:   299 QGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTD 358
             + +SGIPNV GSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSIT+Q VV+ KGVFTD
Sbjct:   277 ESVSGIPNVVGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITIQAVVNPKGVFTD 336

Query:   359 VCIGWPGSMPDDQVLERSALFQRADRG-LLKDVWIVGNSGYPLMDWVMVPYTQKNLTWTQ 417
             +CIGWPGSMPDD+VLE+S L+QRA+ G LLK +W+ G  G+PL+DWV+VPYTQ+NLTWTQ
Sbjct:   337 LCIGWPGSMPDDKVLEKSLLYQRANNGGLLKGMWVAGGPGHPLLDWVLVPYTQQNLTWTQ 396

Query:   418 HAFNEKIGDIQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVM 477
             HAFNEK+ ++Q VAK+AF RLKGRWACLQKRTEVKLQDLP VLGACCVLHNICEMR E M
Sbjct:   397 HAFNEKMSEVQGVAKEAFGRLKGRWACLQKRTEVKLQDLPTVLGACCVLHNICEMREEKM 456

Query:   478 DPQLKFDLFDDEMIPDNSVRSMASAQARDHIAHNLLHHGLAGTSFL 523
             +P+L  ++ DDE++P+N +RS+ + +ARD I+HNLLHHGLAGTSFL
Sbjct:   457 EPELMVEVIDDEVLPENVLRSVNAMKARDTISHNLLHHGLAGTSFL 502


GO:0005634 "nucleus" evidence=ISM
GO:0005886 "plasma membrane" evidence=IDA
GO:0005774 "vacuolar membrane" evidence=IDA
GO:0016020 "membrane" evidence=IDA
GO:0009507 "chloroplast" evidence=IDA
GO:0015824 "proline transport" evidence=RCA
TAIR|locus:2123874 AT4G29780 "AT4G29780" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
ZFIN|ZDB-GENE-050327-32 zgc:113227 "zgc:113227" [Danio rerio (taxid:7955)] Back     alignment and assigned GO terms
TAIR|locus:2099901 AT3G55350 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2077259 AT3G63270 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|E1BQ99 HARBI1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|E2RCW9 HARBI1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|Q96MB7 HARBI1 "Putative nuclease HARBI1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|Q17QR8 HARBI1 "Putative nuclease HARBI1" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|F1SIA2 HARBI1 "Uncharacterized protein" [Sus scrofa (taxid:9823)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
gw1.207.4.1
hypothetical protein (456 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query524
pfam13359155 pfam13359, DDE_4, DDE superfamily endonuclease 1e-40
>gnl|CDD|222071 pfam13359, DDE_4, DDE superfamily endonuclease Back     alignment and domain information
 Score =  143 bits (362), Expect = 1e-40
 Identities = 48/166 (28%), Positives = 75/166 (45%), Gaps = 19/166 (11%)

Query: 311 MYTTHIPIIAPK---ISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFTDVCIGWPGSM 367
           +  T IPI  PK       +Y   +          ++ V  VVD  G    V +GWPGS+
Sbjct: 1   IDGTEIPIERPKSLEEQAQTYSGYKKH-------NTLKVLIVVDPDGRIIFVSVGWPGSV 53

Query: 368 PDDQVLERSALFQRADRGLLKDVWIVGNSGYPLMDWVMVP----YTQKNLTWTQHAFNEK 423
            D ++LE S L ++   G     +++ + G+PL D ++ P         LT  +  FN +
Sbjct: 54  SDKRILENSGLLEKLPPGD----YVLADRGFPLSDSLLAPPAKKPGGAQLTEEEVEFNRR 109

Query: 424 IGDIQAVAKDAFARLKGRWACLQKRTEVK-LQDLPVVLGACCVLHN 468
           I   +   +    RLKGR+  L+ R  +  L  L  ++  CC LHN
Sbjct: 110 IASARIHVERVIGRLKGRFRILRGRLRISKLTTLDKIVLVCCALHN 155


This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which contain three carboxylate residues that are believed to be responsible for coordinating metal ions needed for catalysis. The catalytic activity of this enzyme involves DNA cleavage at a specific site followed by a strand transfer reaction. Length = 155

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 524
KOG4585326 consensus Predicted transposase [Replication, reco 100.0
PF13359158 DDE_Tnp_4: DDE superfamily endonuclease 100.0
PF04827205 Plant_tran: Plant transposon protein; InterPro: IP 99.96
PF13612155 DDE_Tnp_1_3: Transposase DDE domain 98.74
PF1361353 HTH_Tnp_4: Helix-turn-helix of DDE superfamily end 98.67
PF01609213 DDE_Tnp_1: Transposase DDE domain; InterPro: IPR00 97.54
PF1358688 DDE_Tnp_1_2: Transposase DDE domain 95.76
PF0421853 CENP-B_N: CENP-B N-terminal DNA-binding domain; In 93.86
PF0454550 Sigma70_r4: Sigma-70, region 4; InterPro: IPR00763 92.96
PF0279645 HTH_7: Helix-turn-helix domain of resolvase; Inter 92.79
PF1393644 HTH_38: Helix-turn-helix domain; PDB: 2W48_A. 92.73
PF1351852 HTH_28: Helix-turn-helix domain 88.96
PF1338450 HTH_23: Homeodomain-like domain; PDB: 2X48_C. 88.75
PF1211682 SpoIIID: Stage III sporulation protein D; InterPro 88.56
PF0828154 Sigma70_r4_2: Sigma-70, region 4; InterPro: IPR013 87.49
smart00351125 PAX Paired Box domain. 86.95
PF1334075 DUF4096: Putative transposase of IS4/5 family (DUF 86.71
PF0220936 VHP: Villin headpiece domain; InterPro: IPR003128 86.33
PF1301185 LZ_Tnp_IS481: leucine-zipper of insertion element 85.51
PF00872381 Transposase_mut: Transposase, Mutator family; Inte 84.55
cd0056942 HTH_Hin_like Helix-turn-helix domain of Hin and re 84.48
cd00131128 PAX Paired Box domain 84.34
smart0015336 VHP Villin headpiece domain. 83.85
smart0042158 HTH_LUXR helix_turn_helix, Lux Regulon. lux regulo 83.67
COG3415138 Transposase and inactivated derivatives [DNA repli 81.96
PRK09413121 IS2 repressor TnpA; Reviewed 81.61
PF0522545 HTH_psq: helix-turn-helix, Psq domain; InterPro: I 81.38
PRK04217110 hypothetical protein; Provisional 81.18
PRK09639166 RNA polymerase sigma factor SigX; Provisional 80.33
cd0617155 Sigma70_r4 Sigma70, region (SR) 4 refers to the mo 80.11
>KOG4585 consensus Predicted transposase [Replication, recombination and repair] Back     alignment and domain information
Probab=100.00  E-value=1.5e-39  Score=334.59  Aligned_cols=297  Identities=38%  Similarity=0.616  Sum_probs=238.4

Q ss_pred             HcCCCHHHHHHHHHHhhccccccC--CCCCCCCChhhhhhhhhhccccCCCccchhhccCccccceehhhhhhHHHHHhh
Q 041521          200 DFRMSKATFEMICEELESTVMKKN--TMLRDAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLEVCSAIKTV  277 (524)
Q Consensus       200 ~fRmSr~tF~~Lv~~L~~~i~~~~--t~~r~~l~~e~rL~i~L~~LatG~s~~~la~~FgiS~STvsriv~~v~~aI~~~  277 (524)
                      .|++++.+|+.++...........  ......+++..+++++|+.++++.+.+.++..||...+|+      ++......
T Consensus         7 ~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~i~~~fg~~~~~~------~~~~~~~~   80 (326)
T KOG4585|consen    7 EFRKSYTTFDKICSLVQSLNVVKNSGFMLSSLLPADTLVAVALWRLKTGESLRTVEKKFGLGQSTC------KFLEEKED   80 (326)
T ss_pred             HHHHHHHHHHHHhhhhhhhhhhcccchhhhccccHHhhhhhhhccccccchHHHHHHHcCCcchhh------hHHHhhhc
Confidence            788999999999998765433222  1222333388899999999999999999999999999998      34444445


Q ss_pred             cccccccCcchHHHHHHHhhhccccCCCccccCCcceeeeEecCCCccccccccccccccCCCCccccccccccCCCcee
Q 041521          278 LMPKFLQWPDELKMKQIKEEFQGISGIPNVGGSMYTTHIPIIAPKISVASYFNKRHTERNQKTSYSITVQGVVDTKGVFT  357 (524)
Q Consensus       278 L~~~~I~~P~~ee~~~i~~~F~~~~gfP~~vGaIDgThIpI~~P~~~~~~y~n~~~~~rn~K~~~Si~vq~v~D~~g~fi  357 (524)
                      +++.++.||....+..+.+.|+.   +|+|+|+||+||+++..|....+.|+|+         .++.++|+|||++++|+
T Consensus        81 ~~~~~~~~p~~~~~~~i~~~~~~---~~~~~g~~d~~hi~~~~~~~~~~~~~n~---------~~~~Nvlav~n~d~~f~  148 (326)
T KOG4585|consen   81 LAPHFLKWPSRRILYEIRERFES---LPNCVGAIDTTHIPIRVPPKSGSVYFNK---------EQSKNLLAVCNFDMRFI  148 (326)
T ss_pred             ccchhhcCchhhhhhhhcccccc---ccchhccccccccceecCcccccccccc---------ccchhhhheecCCceEE
Confidence            68999999998888889899877   9999999999999999999988888763         67888999999999999


Q ss_pred             eecCCCCCCCCChHHHHhHHHHhhhhccccc-----------eEEEEcccCCCCccccccccccCCcccchhhccccccc
Q 041521          358 DVCIGWPGSMPDDQVLERSALFQRADRGLLK-----------DVWIVGNSGYPLMDWVMVPYTQKNLTWTQHAFNEKIGD  426 (524)
Q Consensus       358 ~v~~g~pGS~hDs~V~~~S~L~~~l~~~~~~-----------g~~lLGD~gYpl~~~LmtPy~~~~lt~~q~~FN~~ls~  426 (524)
                      +|++||||+.||+.|++.+.+......+.+.           ..+++|+.+||+.+++|+||.++.++..++.||.+|+.
T Consensus       149 ~v~vg~~Gs~~D~kvl~~~~~~~~~~~~~~~k~yl~d~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~elFN~rh~~  228 (326)
T KOG4585|consen  149 YVDVGWPGSAHDTKVLQDSLLYKRNFPHPPLKYYLVDSGYPLRPGLLGPIGFPLYSLLMFPYGGPQPTNSQELFNKRHSS  228 (326)
T ss_pred             EEEccCCCCccHHHHHHhhcccccccccCCccccccccCcccccccccccccccchhhhcccCCCCCCchHHHHhhhhhh
Confidence            9999999999999999999988766544442           45666667777777788888887778899999999999


Q ss_pred             chhhhhhcccccccccccccccccccCCCccceehhhHHHHhhhhhcCCCCCCCCCcCCCCCCC-----CCCCcccChhH
Q 041521          427 IQAVAKDAFARLKGRWACLQKRTEVKLQDLPVVLGACCVLHNICEMRNEVMDPQLKFDLFDDEM-----IPDNSVRSMAS  501 (524)
Q Consensus       427 ~R~~VE~aFG~LK~RFriL~~~~~~~~~~~~~iI~AccvLHNi~~~~~~~~~~~~~~~~~dd~~-----~p~~~~~~~~~  501 (524)
                      +|.++|++||+||+||+||+........+.+.||.|||+|||||++..+..+++...+..++..     .........-+
T Consensus       229 ~r~v~e~~fg~lk~rw~il~~~~~~~~~~~~~iV~a~caLHN~~~~~~~~~~~~~~~e~~~d~~~~~~~~~~~~~~~~~~  308 (326)
T KOG4585|consen  229 LRSVAERAFGVLKAKWRILQRREKYDLKKLPKIVTACCALHNIIRDSDEEDPDDPKWEKFDDYGENVAHLRYAPQQRDYM  308 (326)
T ss_pred             HHHHHHHHHHHhhhhhHHHhhcccccccchHHHHHHHHHHHHHHHhhcccccccccccccccccccchhcccchhHHHHH
Confidence            9999999999999999999998667788999999999999999999877543322222223322     11111223346


Q ss_pred             HHHHHHHHHHHhh
Q 041521          502 AQARDHIAHNLLH  514 (524)
Q Consensus       502 ~~~Rd~ia~~l~~  514 (524)
                      ..+||.|+..||.
T Consensus       309 ~~~r~~l~~~l~~  321 (326)
T KOG4585|consen  309 EKIRDNLLSELWN  321 (326)
T ss_pred             HHHHHHHHHHHHh
Confidence            7788999988864



>PF13359 DDE_Tnp_4: DDE superfamily endonuclease Back     alignment and domain information
>PF04827 Plant_tran: Plant transposon protein; InterPro: IPR006912 This entry represents a putative Harbinger transposase-derived nuclease, which is thought to have nuclease activity Back     alignment and domain information
>PF13612 DDE_Tnp_1_3: Transposase DDE domain Back     alignment and domain information
>PF13613 HTH_Tnp_4: Helix-turn-helix of DDE superfamily endonuclease Back     alignment and domain information
>PF01609 DDE_Tnp_1: Transposase DDE domain; InterPro: IPR002559 Autonomous mobile genetic elements such as transposon or insertion sequences (IS) encode an enzyme, transposase, that is required for excising and inserting the mobile element Back     alignment and domain information
>PF13586 DDE_Tnp_1_2: Transposase DDE domain Back     alignment and domain information
>PF04218 CENP-B_N: CENP-B N-terminal DNA-binding domain; InterPro: IPR006695 Centromere Protein B (CENP-B) is a DNA-binding protein localized to the centromere Back     alignment and domain information
>PF04545 Sigma70_r4: Sigma-70, region 4; InterPro: IPR007630 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription Back     alignment and domain information
>PF02796 HTH_7: Helix-turn-helix domain of resolvase; InterPro: IPR006120 Site-specific recombination plays an important role in DNA rearrangement in prokaryotic organisms Back     alignment and domain information
>PF13936 HTH_38: Helix-turn-helix domain; PDB: 2W48_A Back     alignment and domain information
>PF13518 HTH_28: Helix-turn-helix domain Back     alignment and domain information
>PF13384 HTH_23: Homeodomain-like domain; PDB: 2X48_C Back     alignment and domain information
>PF12116 SpoIIID: Stage III sporulation protein D; InterPro: IPR014208 Members of this entry represent the transcriptional regulator SpoIIID, or stage III sporulation protein D Back     alignment and domain information
>PF08281 Sigma70_r4_2: Sigma-70, region 4; InterPro: IPR013249 The bacterial core RNA polymerase complex, which consists of five subunits, is sufficient for transcription elongation and termination but is unable to initiate transcription Back     alignment and domain information
>smart00351 PAX Paired Box domain Back     alignment and domain information
>PF13340 DUF4096: Putative transposase of IS4/5 family (DUF4096) Back     alignment and domain information
>PF02209 VHP: Villin headpiece domain; InterPro: IPR003128 Villin is an F-actin bundling protein involved in the maintenance of the microvilli of the absorptive epithelia Back     alignment and domain information
>PF13011 LZ_Tnp_IS481: leucine-zipper of insertion element IS481 Back     alignment and domain information
>PF00872 Transposase_mut: Transposase, Mutator family; InterPro: IPR001207 Autonomous mobile genetic elements such as transposon or insertion sequences (IS) encode an enzyme, transposase, that is required for excising and inserting the mobile element Back     alignment and domain information
>cd00569 HTH_Hin_like Helix-turn-helix domain of Hin and related proteins, a family of DNA-binding domains unique to bacteria and represented by the Hin protein of Salmonella Back     alignment and domain information
>cd00131 PAX Paired Box domain Back     alignment and domain information
>smart00153 VHP Villin headpiece domain Back     alignment and domain information
>smart00421 HTH_LUXR helix_turn_helix, Lux Regulon Back     alignment and domain information
>COG3415 Transposase and inactivated derivatives [DNA replication, recombination, and repair] Back     alignment and domain information
>PRK09413 IS2 repressor TnpA; Reviewed Back     alignment and domain information
>PF05225 HTH_psq: helix-turn-helix, Psq domain; InterPro: IPR007889 This DNA-binding motif is found in four copies in the pipsqueak protein of Drosophila melanogaster [] Back     alignment and domain information
>PRK04217 hypothetical protein; Provisional Back     alignment and domain information
>PRK09639 RNA polymerase sigma factor SigX; Provisional Back     alignment and domain information
>cd06171 Sigma70_r4 Sigma70, region (SR) 4 refers to the most C-terminal of four conserved domains found in Escherichia coli (Ec) sigma70, the main housekeeping sigma, and related sigma-factors (SFs) Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query524
1tc3_C51 Protein (TC3 transposase); DNA binding, helix-turn 93.14
2p7v_B68 Sigma-70, RNA polymerase sigma factor RPOD; RSD, r 91.43
2glo_A59 Brinker CG9653-PA; protein-DNA complex, helix-turn 90.49
2w7n_A101 TRFB transcriptional repressor protein; INCP, plas 90.17
2jn6_A97 Protein CGL2762, transposase; GFT PSI-2, protein s 90.16
2o8x_A70 Probable RNA polymerase sigma-C factor; promoter r 89.71
1ku3_A73 Sigma factor SIGA; helix-turn-helix, transcription 89.21
1jko_C52 HIN recombinase, DNA-invertase HIN; water-mediated 88.7
1tty_A87 Sigma-A, RNA polymerase sigma factor RPOD; helix-t 88.57
1wy3_A35 Villin; structural protein; HET: NLE; 0.95A {Synth 86.97
3hug_A92 RNA polymerase sigma factor; ECF sigma factor, zin 86.35
1und_A37 Advillin, P92; actin binding, F-actin binding, cyt 85.6
1jhg_A101 Trp operon repressor; complex (regulatory protein- 85.47
3t72_q99 RNA polymerase sigma factor RPOD, DNA-directed RN 84.1
1u78_A141 TC3 transposase, transposable element TC3 transpos 82.01
2elh_A87 CG11849-PA, LD40883P; structural genomics, NPPSFA, 81.5
1fse_A74 GERE; helix-turn-helix DNA-binding protein transcr 80.8
>1tc3_C Protein (TC3 transposase); DNA binding, helix-turn-helix, TC1/mariner family, complex (transposase/DNA), DNA binding protein/DNA complex; HET: DNA; 2.45A {Caenorhabditis elegans} SCOP: a.4.1.2 Back     alignment and structure
Probab=93.14  E-value=0.02  Score=40.19  Aligned_cols=41  Identities=12%  Similarity=0.110  Sum_probs=32.7

Q ss_pred             CCCChhhhhhhhhhccccCCCccchhhccCccccceehhhhh
Q 041521          228 DAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLVLE  269 (524)
Q Consensus       228 ~~l~~e~rL~i~L~~LatG~s~~~la~~FgiS~STvsriv~~  269 (524)
                      ..++.+++..+... +..|.++..||..+|||.+||.+++.+
T Consensus         4 ~~l~~~~~~~i~~~-~~~g~s~~~IA~~lgis~~Tv~~~~~~   44 (51)
T 1tc3_C            4 SALSDTERAQLDVM-KLLNVSLHEMSRKISRSRHCIRVYLKD   44 (51)
T ss_dssp             CCCCHHHHHHHHHH-HHTTCCHHHHHHHHTCCHHHHHHHHHC
T ss_pred             CCCCHHHHHHHHHH-HHcCCCHHHHHHHHCcCHHHHHHHHhh
Confidence            46777777555444 468999999999999999999988764



>2p7v_B Sigma-70, RNA polymerase sigma factor RPOD; RSD, regulator of sigma 70, sigma 70 domain 4, transcription, regulation, helix-turn-helix; 2.60A {Escherichia coli} SCOP: a.4.13.2 Back     alignment and structure
>2glo_A Brinker CG9653-PA; protein-DNA complex, helix-turn-helix motif, transcription/DNA complex; NMR {Drosophila melanogaster} Back     alignment and structure
>2w7n_A TRFB transcriptional repressor protein; INCP, plasmid, repressor, DNA-binding, transcription/DNA; HET: BRU; 1.85A {Escherichia coli} Back     alignment and structure
>2jn6_A Protein CGL2762, transposase; GFT PSI-2, protein structure, structural genomics, protein structure initiative; NMR {Corynebacterium glutamicum} SCOP: a.4.1.19 Back     alignment and structure
>2o8x_A Probable RNA polymerase sigma-C factor; promoter recognition, transcription regulation, helix-turn-H motif, transcription; 3.00A {Mycobacterium tuberculosis} Back     alignment and structure
>1ku3_A Sigma factor SIGA; helix-turn-helix, transcription; 1.80A {Thermus aquaticus} SCOP: a.4.13.2 PDB: 1ku7_A 1rio_H 3n97_A* Back     alignment and structure
>1jko_C HIN recombinase, DNA-invertase HIN; water-mediated recognition, protein-DNA complex, A10G mutant, DNA binding protein/DNA complex; 2.24A {Synthetic} SCOP: a.4.1.2 PDB: 1ijw_C* 1jj6_C* 1jj8_C* 1hcr_A 1jkp_C 1jkq_C 1jkr_C Back     alignment and structure
>1tty_A Sigma-A, RNA polymerase sigma factor RPOD; helix-turn-helix, transcription; NMR {Thermotoga maritima} SCOP: a.4.13.2 Back     alignment and structure
>1wy3_A Villin; structural protein; HET: NLE; 0.95A {Synthetic} PDB: 1wy4_A 1yri_A* 1yrf_A* 2f4k_A* 1vii_A 3trv_A* 3trw_A 3tjw_B* 3trv_B* 3try_A* 2ppz_A 2jm0_A* 3tjw_A* 3iur_B* Back     alignment and structure
>3hug_A RNA polymerase sigma factor; ECF sigma factor, zinc binding anti-sigma factor, oxidative transcription regulation; 2.35A {Mycobacterium tuberculosis} Back     alignment and structure
>1und_A Advillin, P92; actin binding, F-actin binding, cytoskeleton, headpiece subdomain; NMR {Homo sapiens} SCOP: a.14.1.1 Back     alignment and structure
>1jhg_A Trp operon repressor; complex (regulatory protein-peptide), DNA-binding regulatory complex (regulatory protein-peptide) complex; HET: TRP; 1.30A {Escherichia coli} SCOP: a.4.12.1 PDB: 1co0_A* 1mi7_R 1p6z_R 1wrp_R* 1zt9_A* 2oz9_R* 3ssw_R 3wrp_A 1rcs_A* 1wrs_R* 1wrt_R 2xdi_A 3ssx_R* 1trr_A* 1tro_A* Back     alignment and structure
>3t72_q RNA polymerase sigma factor RPOD, DNA-directed RN polymerase subunit beta; winged-helix motif, transcription activation, DNA-binding; 4.33A {Escherichia coli} PDB: 1tlh_B Back     alignment and structure
>1u78_A TC3 transposase, transposable element TC3 transposase; transposon DNA, bipartite DNA-binding, HTH- motif, DNA binding protein/DNA complex; 2.69A {Caenorhabditis elegans} SCOP: a.4.1.2 a.4.1.2 Back     alignment and structure
>2elh_A CG11849-PA, LD40883P; structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Drosophila melanogaster} Back     alignment and structure
>1fse_A GERE; helix-turn-helix DNA-binding protein transcriptional regulat transcription; 2.05A {Bacillus subtilis} SCOP: a.4.6.2 Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query524
d1ijwc_47 HIN recombinase (DNA-binding domain) {Synthetic} 94.51
d1hlva166 DNA-binding domain of centromere binding protein B 91.08
d1rp3a271 Sigma factor sigma-28 (FliA) {Aquifex aeolicus [Ta 90.91
d1or7a168 SigmaE factor (RpoE) {Escherichia coli [TaxId: 562 88.75
d1unda_36 Advillin {Human (Homo sapiens) [TaxId: 9606]} 85.5
d1yu8x135 Villin {Chicken (Gallus gallus) [TaxId: 9031]} 84.82
d1s7oa_106 Hypothetical protein SPy1201 {Streptococcus pyogen 83.75
d2jn6a189 Uncharacterized protein Cgl2762 {Corynebacterium g 83.45
>d1ijwc_ a.4.1.2 (C:) HIN recombinase (DNA-binding domain) {Synthetic} Back     information, alignment and structure
class: All alpha proteins
fold: DNA/RNA-binding 3-helical bundle
superfamily: Homeodomain-like
family: Recombinase DNA-binding domain
domain: HIN recombinase (DNA-binding domain)
species: Synthetic
Probab=94.51  E-value=0.0031  Score=44.31  Aligned_cols=39  Identities=26%  Similarity=0.352  Sum_probs=33.6

Q ss_pred             CCCChhhhhhhhhhccccCCCccchhhccCccccceehhh
Q 041521          228 DAIPVRQRVAVCVWRLATGEPLRVVSKRFGLGISTCHKLV  267 (524)
Q Consensus       228 ~~l~~e~rL~i~L~~LatG~s~~~la~~FgiS~STvsriv  267 (524)
                      +.++. +++..+...++.|.+..+|+..||||.+|++|++
T Consensus         4 ~~lt~-~q~~~a~~l~~~G~s~~~iA~~~gVSr~TiYry~   42 (47)
T d1ijwc_           4 RAINK-HEQEQISRLLEKGHPRQQLAIIFGIGVSTLYRYF   42 (47)
T ss_dssp             CSSCH-HHHHHHHHHHHTTCCHHHHHHHHTCCHHHHHHHS
T ss_pred             CcCCH-HHHHHHHHHHHCCCCHHHHHHHHCcCHHHHHhhC
Confidence            45554 6777888889999999999999999999999986



>d1hlva1 a.4.1.7 (A:1-66) DNA-binding domain of centromere binding protein B (CENP-B) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1rp3a2 a.4.13.2 (A:164-234) Sigma factor sigma-28 (FliA) {Aquifex aeolicus [TaxId: 63363]} Back     information, alignment and structure
>d1or7a1 a.4.13.2 (A:120-187) SigmaE factor (RpoE) {Escherichia coli [TaxId: 562]} Back     information, alignment and structure
>d1unda_ a.14.1.1 (A:) Advillin {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1yu8x1 a.14.1.1 (X:42-76) Villin {Chicken (Gallus gallus) [TaxId: 9031]} Back     information, alignment and structure
>d1s7oa_ a.4.13.3 (A:) Hypothetical protein SPy1201 {Streptococcus pyogenes [TaxId: 1314]} Back     information, alignment and structure
>d2jn6a1 a.4.1.19 (A:1-89) Uncharacterized protein Cgl2762 {Corynebacterium glutamicum [TaxId: 1718]} Back     information, alignment and structure