Citrus Sinensis ID: 035734


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-----
MPKPDCEHIRDMMKKKKMMILVVNFNPFCWLRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLFSKHGLSMSRLRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKKH
ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHcccccccccccccccccccHHHHHHHHHHHccHHHHHHHHHccccccccccHHHHHHHHHHHHHHHHHHHHHHHcccEEEEEEccEEccccccEEEEEEEEEcccccEEEEEEEEEEcccccHHHHHHHHHHHHHHcccccccEEEEEccccccccccHHHHHHHHHHHccccEEEccHHHHHHHHHHHHHHcc
cccccHHHHHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccHHHHHHHHHHHccHHHHHHHHHHcccccccccHHHHHHHHHHHHHHHHHHHHHHHccccEEEEEEccccccccccEEEEEEEEcccccEEEEEEEEEEcccccHHHHHHHHHHHHHHccccHHHcccccccccccccccHHHHHHHHHHHcccEEEEHHHHHHHHEEHHHHHHcc
MPKPDCEHIRDMMKKKKMMILVVNFNPFCWLRQRlafhghdeskdssnrGNFLELLRFLtdhnedinvvtfdnapgnlqmtSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLFSKhglsmsrlrgqgydgasnmqgeFNGLkalilednkcpyfiHCFAHQLQLALISVAKKH
MPKPDCEHIRDMMKKKKMMILVVNFNPFCWLRQRLAFHGhdeskdssnRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGiehitsttsiSLKEALDRLFSKHGLSMSRLRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKKH
MPKPDCEHIRDmmkkkkmmILVVNFNPFCWLRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLFSKHGLSMSRLRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKKH
***************KKMMILVVNFNPFCWLRQRLAFHG**********GNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLFSKHGLS********Y*****MQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISV****
***********MMKKKKMMILVVNFNPFCWLRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLFSKHGLSMSRLRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKK*
MPKPDCEHIRDMMKKKKMMILVVNFNPFCWLRQRLAFHG********NRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLFSKHGLSMSRLRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKKH
****DCEHIRDMMKKKKMMILVVNFNPFCWLRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLFSKHGLSMSRLRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKKH
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhooooo
SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MPKPDCEHIRDMMKKKKMMILVVNFNPFCWLRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLFSKHGLSMSRLRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKKH
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query225 2.2.26 [Sep-21-2011]
Q5SVZ6 1142 Zinc finger MYM-type prot yes no 0.924 0.182 0.290 8e-20
O95789 1325 Zinc finger MYM-type prot no no 0.475 0.080 0.318 6e-06
O43422 761 52 kDa repressor of the i no no 0.813 0.240 0.230 4e-05
Q9CUX1 758 52 kDa repressor of the i no no 0.764 0.226 0.234 0.0006
Q4R6P1 573 Protein FAM200A OS=Macaca N/A no 0.662 0.260 0.225 0.0007
Q8TCP9 573 Protein FAM200A OS=Homo s no no 0.662 0.260 0.225 0.0008
>sp|Q5SVZ6|ZMYM1_HUMAN Zinc finger MYM-type protein 1 OS=Homo sapiens GN=ZMYM1 PE=2 SV=1 Back     alignment and function desciption
 Score = 97.1 bits (240), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 66/227 (29%), Positives = 114/227 (50%), Gaps = 19/227 (8%)

Query: 8   HIRDMMKKKKMMILVVNFNPFCWLRQRLAFHGHDESKDSSNRGNFLELLRF-LTDHNEDI 66
           H + +   KK + L++  N     +Q L   G+D+S  S N+GNFLELL     D  E+ 
Sbjct: 543 HSKQIEGNKKYLKLIIE-NILFLGKQCLPLRGNDQSVSSVNKGNFLELLEMRAKDKGEET 601

Query: 67  -----NVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDV-LFSLLINESHDI 120
                + V F N        S +I+ +I+  +  E    I+NE+ D   FS++ +E+ + 
Sbjct: 602 FRLMNSQVDFYN--------STQIQSDIIEIIKTEMLQDIVNEINDSSAFSIICDETINS 653

Query: 121 SVKEQMVVVLRYVDKNGYVV---ERFIGIEHITSTTSISLKEALDRLFSKHGLSMSRLRG 177
           ++KEQ+ + +RY  K+   +   ERF+G       T   L   +     + G+ M ++ G
Sbjct: 654 AMKEQLSICVRYPQKSSKAILIKERFLGFVDTEEMTGTHLHRTIKTYLQQIGVDMDKIHG 713

Query: 178 QGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKK 224
           Q YD  +N++ +FN + A   ++     +IHC+AH L L++I   K+
Sbjct: 714 QAYDSTTNLKIKFNKIAAEFKKEEPRALYIHCYAHFLDLSIIRFCKE 760





Homo sapiens (taxid: 9606)
>sp|O95789|ZMYM6_HUMAN Zinc finger MYM-type protein 6 OS=Homo sapiens GN=ZMYM6 PE=2 SV=2 Back     alignment and function description
>sp|O43422|P52K_HUMAN 52 kDa repressor of the inhibitor of the protein kinase OS=Homo sapiens GN=PRKRIR PE=1 SV=2 Back     alignment and function description
>sp|Q9CUX1|P52K_MOUSE 52 kDa repressor of the inhibitor of the protein kinase OS=Mus musculus GN=Prkrir PE=2 SV=2 Back     alignment and function description
>sp|Q4R6P1|F200A_MACFA Protein FAM200A OS=Macaca fascicularis GN=FAM200A PE=2 SV=1 Back     alignment and function description
>sp|Q8TCP9|F200A_HUMAN Protein FAM200A OS=Homo sapiens GN=FAM200A PE=2 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query225
224126313 788 predicted protein [Populus trichocarpa] 0.866 0.247 0.630 4e-67
357490617 892 Zinc finger MYM-type protein [Medicago t 0.866 0.218 0.594 2e-63
15230069 505 General transcription factor 2-related z 0.866 0.386 0.579 2e-61
15221982 769 TTF-type zinc finger protein with HAT di 0.866 0.253 0.579 5e-61
8954063 811 Contains similarity to a transposable el 0.866 0.240 0.579 6e-61
77550806 876 hAT family dimerisation domain containin 0.857 0.220 0.544 9e-59
242080863 776 hypothetical protein SORBIDRAFT_07g00579 0.866 0.251 0.553 1e-58
242080751 635 hypothetical protein SORBIDRAFT_07g00480 0.866 0.307 0.533 3e-57
19225003 811 putative transposase [Oryza sativa Japon 0.857 0.237 0.549 3e-57
116317825 793 OSIGBa0126J24.6 [Oryza sativa Indica Gro 0.866 0.245 0.517 4e-57
>gi|224126313|ref|XP_002319808.1| predicted protein [Populus trichocarpa] gi|222858184|gb|EEE95731.1| predicted protein [Populus trichocarpa] Back     alignment and taxonomy information
 Score =  260 bits (664), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 123/195 (63%), Positives = 156/195 (80%)

Query: 31  LRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIV 90
           L Q L F GHDE + SSN+GN+LELL FL+ +NE I  VTF  AP + ++TS +I+K+I 
Sbjct: 203 LHQGLPFRGHDECECSSNQGNYLELLHFLSRNNEAIKRVTFSEAPRHNKLTSPDIQKDIT 262

Query: 91  SCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHIT 150
              A E TN+II ++GD LFS+LI+ES DIS+KEQM VVLRYVD NG+++ERF+GI+H+ 
Sbjct: 263 QAAAEEITNVIIKDLGDSLFSILIDESRDISIKEQMAVVLRYVDNNGHIIERFLGIQHVR 322

Query: 151 STTSISLKEALDRLFSKHGLSMSRLRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCF 210
            TT+ SLK A++ LFSKHGLS+SRLRGQGYDGASNM+GEFNGLKALIL  N   Y++HCF
Sbjct: 323 DTTASSLKAAIEALFSKHGLSISRLRGQGYDGASNMRGEFNGLKALILNSNPSAYYVHCF 382

Query: 211 AHQLQLALISVAKKH 225
           AH+LQL L++V KKH
Sbjct: 383 AHRLQLTLVAVTKKH 397




Source: Populus trichocarpa

Species: Populus trichocarpa

Genus: Populus

Family: Salicaceae

Order: Malpighiales

Class:

Phylum: Streptophyta

Superkingdom: Eukaryota

>gi|357490617|ref|XP_003615596.1| Zinc finger MYM-type protein [Medicago truncatula] gi|355516931|gb|AES98554.1| Zinc finger MYM-type protein [Medicago truncatula] Back     alignment and taxonomy information
>gi|15230069|ref|NP_189620.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] gi|332644085|gb|AEE77606.1| General transcription factor 2-related zinc finger protein [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|15221982|ref|NP_173360.1| TTF-type zinc finger protein with HAT dimerization domain [Arabidopsis thaliana] gi|332191703|gb|AEE29824.1| TTF-type zinc finger protein with HAT dimerization domain [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|8954063|gb|AAF82236.1|AC069143_12 Contains similarity to a transposable element Tip100 protein for transposase from Ipomoea purpurea gb|4063769 and is a member of the transmembrane 4 family PF|00335 [Arabidopsis thaliana] Back     alignment and taxonomy information
>gi|77550806|gb|ABA93603.1| hAT family dimerisation domain containing protein [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|242080863|ref|XP_002445200.1| hypothetical protein SORBIDRAFT_07g005790 [Sorghum bicolor] gi|241941550|gb|EES14695.1| hypothetical protein SORBIDRAFT_07g005790 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|242080751|ref|XP_002445144.1| hypothetical protein SORBIDRAFT_07g004800 [Sorghum bicolor] gi|241941494|gb|EES14639.1| hypothetical protein SORBIDRAFT_07g004800 [Sorghum bicolor] Back     alignment and taxonomy information
>gi|19225003|gb|AAL86479.1|AC077693_18 putative transposase [Oryza sativa Japonica Group] Back     alignment and taxonomy information
>gi|116317825|emb|CAH65861.1| OSIGBa0126J24.6 [Oryza sativa Indica Group] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query225
TAIR|locus:2829839 505 AT3G29763 "AT3G29763" [Arabido 0.866 0.386 0.579 2.5e-58
TAIR|locus:2202165 769 AT1G19260 [Arabidopsis thalian 0.866 0.253 0.579 5.1e-58
TAIR|locus:4515102803 592 AT2G06541 [Arabidopsis thalian 0.866 0.329 0.558 2.6e-54
TAIR|locus:2118430 664 AT4G09660 [Arabidopsis thalian 0.755 0.256 0.582 6.7e-49
TAIR|locus:2033523 496 AT1G41920 "AT1G41920" [Arabido 0.853 0.387 0.505 1.6e-47
TAIR|locus:2093847 522 AT3G29450 "AT3G29450" [Arabido 0.311 0.134 0.7 2.2e-44
TAIR|locus:2829849 536 AT3G29765 [Arabidopsis thalian 0.648 0.272 0.595 8.1e-44
TAIR|locus:5019474775 393 AT3G31402 "AT3G31402" [Arabido 0.364 0.208 0.512 6.8e-27
UNIPROTKB|E2R5I7 1073 ZMYM1 "Uncharacterized protein 0.844 0.177 0.297 1.5e-20
UNIPROTKB|Q5SW02 1067 ZMYM1 "Zinc finger MYM-type pr 0.84 0.177 0.301 2.5e-20
TAIR|locus:2829839 AT3G29763 "AT3G29763" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
 Score = 599 (215.9 bits), Expect = 2.5e-58, P = 2.5e-58
 Identities = 113/195 (57%), Positives = 150/195 (76%)

Query:    31 LRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIV 90
             L Q L F GHDES++S+N+GNFLELL+++   NE +  V   NAP N QMTS  I+K+IV
Sbjct:    53 LHQGLPFRGHDESEESTNKGNFLELLKYIAGQNEVVKKVVLKNAPKNNQMTSPPIQKDIV 112

Query:    91 SCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHIT 150
              C + E T  II EM + +F LL++ES D S KEQM VV R+VDK G V ERFIG+ H+ 
Sbjct:   113 HCFSEEVTRSIIEEMDNDVFGLLVDESADASDKEQMAVVFRFVDKYGVVKERFIGVIHVQ 172

Query:   151 STTSISLKEALDRLFSKHGLSMSRLRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCF 210
              T+S+SLK A+D LF+K+GLS+ +LRG+GYDGASNM+G+FNGL++LIL++N   Y++HCF
Sbjct:   173 ETSSLSLKSAIDSLFAKYGLSLKKLRGKGYDGASNMKGKFNGLRSLILKENSSAYYVHCF 232

Query:   211 AHQLQLALISVAKKH 225
             AHQLQL +++VAKKH
Sbjct:   233 AHQLQLVVMAVAKKH 247




GO:0003674 "molecular_function" evidence=ND
GO:0005634 "nucleus" evidence=ISM
GO:0008150 "biological_process" evidence=ND
TAIR|locus:2202165 AT1G19260 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:4515102803 AT2G06541 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2118430 AT4G09660 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2033523 AT1G41920 "AT1G41920" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2093847 AT3G29450 "AT3G29450" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:2829849 AT3G29765 [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
TAIR|locus:5019474775 AT3G31402 "AT3G31402" [Arabidopsis thaliana (taxid:3702)] Back     alignment and assigned GO terms
UNIPROTKB|E2R5I7 ZMYM1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|Q5SW02 ZMYM1 "Zinc finger MYM-type protein 1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by Ezypred Server ?

Fail to connect to Ezypred Server

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins

Functionally Associated Proteins Detected by STRING ?

Your Input:
eugene3.00130851
hypothetical protein (788 aa)
(Populus trichocarpa)
Predicted Functional Partners:
 
Sorry, there are no predicted associations at the current settings.
 

Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query225
pfam14291179 pfam14291, DUF4371, Domain of unknown function (DU 1e-47
>gnl|CDD|222655 pfam14291, DUF4371, Domain of unknown function (DUF4371) Back     alignment and domain information
 Score =  154 bits (391), Expect = 1e-47
 Identities = 71/184 (38%), Positives = 90/184 (48%), Gaps = 61/184 (33%)

Query: 9   IRDMMKKKKMMILVVNFNPFCWLRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINV 68
                 +    I    F     LRQ LAF GHDES++S N+GNF                
Sbjct: 57  KAQYKARLNYSIDCARF----LLRQGLAFRGHDESEESLNKGNF---------------- 96

Query: 69  VTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVV 128
                              E++  +A                        D+S KEQM V
Sbjct: 97  ------------------LELLEWLA-----------------------EDVSQKEQMAV 115

Query: 129 VLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLFSKHGLSMSRLRGQGYDGASNMQG 188
           VLRYVDK G V ERF+G+ H+  TTS++LK A+D L +KHGLS+S++RGQGYDGASNM+G
Sbjct: 116 VLRYVDKKGQVKERFLGVVHVEETTSLTLKSAIDSLLAKHGLSLSKVRGQGYDGASNMKG 175

Query: 189 EFNG 192
           EFNG
Sbjct: 176 EFNG 179


Length = 179

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 225
PF14291235 DUF4371: Domain of unknown function (DUF4371) 100.0
PF04937153 DUF659: Protein of unknown function (DUF 659); Int 99.67
KOG1121 641 consensus Tam3-transposase (Ac family) [Replicatio 99.32
PF00872 381 Transposase_mut: Transposase, Mutator family; Inte 84.5
>PF14291 DUF4371: Domain of unknown function (DUF4371) Back     alignment and domain information
Probab=100.00  E-value=3.2e-48  Score=321.24  Aligned_cols=185  Identities=41%  Similarity=0.615  Sum_probs=177.0

Q ss_pred             hHHHHHHHHHhHHHHHHHHHHHHH-hhcCcccccCCCCCCCCCcccHHHHHHhhhccCcchhhhhhccCCCcccccChhH
Q 035734            7 EHIRDMMKKKKMMILVVNFNPFCW-LRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEI   85 (225)
Q Consensus         7 ~~~~~~~~~~~~~~l~~~~~~~~~-~~~~lp~rg~~~~~~~~~~gnf~~l~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~   85 (225)
                      ...++...++||++|++++++++| |+||||||||++..++.|+|||++|+++++++||.+++|+...++.+..|+++++
T Consensus        50 ~~~~~~~~~~nR~~L~~ii~~i~fL~~QgLa~RGh~e~~~s~n~GNFl~ll~l~~~~d~~l~~~~~~~~~~~~~~~s~~i  129 (235)
T PF14291_consen   50 QKQRKEEIEENRQYLKRIIDVILFLARQGLAFRGHDESEDSLNNGNFLELLELLAKYDPELKKHLSKNAPKNAKYSSKTI  129 (235)
T ss_pred             HHHhHHHHHHHHHHHHHHHHHHHHHHhcccccccCCccccccccccHHHHHHHHHhhcccchhhhhcccccceeccHHHH
Confidence            445667788999999999999999 9999999999999999999999999999999999999998788889999999999


Q ss_pred             HHHHHHHHHHHHHHHHHHhhccceeEEEecccCCCcchhhheeeeeeeccCceeeeeeeceeecccCCHHHHHHHHHHHH
Q 035734           86 KKEIVSCVAIETTNIIINEMGDVLFSLLINESHDISVKEQMVVVLRYVDKNGYVVERFIGIEHITSTTSISLKEALDRLF  165 (225)
Q Consensus        86 ~~~~i~~~~~~~~~~i~~~l~~~~fSi~~D~~td~~~~~~l~v~vr~vd~~~~~~e~~L~~~~l~~~tae~i~~~i~~~L  165 (225)
                      ++++ +.++..+++.|.+++++++|||++||+||+++.+||+|++||||.++.++|+||||.++++.||++|++.|.++|
T Consensus       130 q~~i-~~~a~~v~~~I~~~v~~~~FSii~DettDis~~eQl~i~vRyv~~~~~i~E~Fl~f~~~~~~ta~~l~~~i~~~L  208 (235)
T PF14291_consen  130 QNEI-EILADHVRQSIVEEVKSKYFSIIVDETTDISNKEQLSICVRYVDKDGKIKERFLGFVELEDTTAESLFNAIKDVL  208 (235)
T ss_pred             HHHH-HHHHHHHHHHHHhhccccceeeeeeccccccccchhhheeeeeccCcceeeeeeeeeccCCccHHHHHHHHHHHH
Confidence            9998 999999999999999989999999999999999999999999998899999999999999999999999999999


Q ss_pred             hhCCCCCCCeEEEeccCccccccchhH
Q 035734          166 SKHGLSMSRLRGQGYDGASNMQGEFNG  192 (225)
Q Consensus       166 ~~~~i~~~~~~~~~~Dgas~m~g~~~g  192 (225)
                      +++||++++|+||+|||||+|+|..+|
T Consensus       209 ~~~~l~~~~~~gq~yDgas~M~G~~~G  235 (235)
T PF14291_consen  209 EKLGLDLSNCRGQCYDGASNMSGKHNG  235 (235)
T ss_pred             HHcCCCHHHcCcccccChHhheeccCC
Confidence            999999999999999999999999886



>PF04937 DUF659: Protein of unknown function (DUF 659); InterPro: IPR007021 These are transposase-like proteins with no known function Back     alignment and domain information
>KOG1121 consensus Tam3-transposase (Ac family) [Replication, recombination and repair] Back     alignment and domain information
>PF00872 Transposase_mut: Transposase, Mutator family; InterPro: IPR001207 Autonomous mobile genetic elements such as transposon or insertion sequences (IS) encode an enzyme, transposase, that is required for excising and inserting the mobile element Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query225
2bw3_A 534 Transposase; DNA recombination, transposition; 2.0 99.93
>2bw3_A Transposase; DNA recombination, transposition; 2.0A {Musca domestica} SCOP: a.270.1.1 c.55.3.12 PDB: 2bw3_B Back     alignment and structure
Probab=99.93  E-value=1.3e-25  Score=204.54  Aligned_cols=175  Identities=17%  Similarity=0.187  Sum_probs=146.6

Q ss_pred             HHHH-hhcCcccccCCCCCCCCCcccHHHHHHhhhccCcchhhhhhccCCCcccccChhHHHHHHHHHHHHHHHHHHHhh
Q 035734           27 PFCW-LRQRLAFHGHDESKDSSNRGNFLELLRFLTDHNEDINVVTFDNAPGNLQMTSNEIKKEIVSCVAIETTNIIINEM  105 (225)
Q Consensus        27 ~~~~-~~~~lp~rg~~~~~~~~~~gnf~~l~~l~~~~~~~~~~~~~~~~~~~~~~~s~~~~~~~i~~~~~~~~~~i~~~l  105 (225)
                      +++| +++++||+..+.       ++|+.+++.+.+.+|.+.+.+.    .+...+|+......+..++..+++++++++
T Consensus        23 l~~~i~~~~~Pf~~ve~-------~~F~~~l~~l~~~~p~~~~~~~----~~~~~ps~~~~~~~i~~~~~~~~~~l~~~l   91 (534)
T 2bw3_A           23 CAQWVVRDCRPFSAVSG-------SGFIDMIKFFIKVKAEYGEHVN----VEELLPSPITLSRKVTSDAKEKKALIGREI   91 (534)
T ss_dssp             HHHHHHHTTCCGGGGGC-------HHHHHHHHHHHHHHHHHCSCBC----HHHHSCCHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHhcCCcchhhCC-------HHHHHHHHHHHhhhhhhccccc----hhhcCCCcccHHHHHHHHHHHHHHHHHHHH
Confidence            4888 999999998774       4999999999877776644221    122345666666666677888999999999


Q ss_pred             cc----ceeEEEecccCCC-cchhhheeeeeeeccCceeeeeeeceeeccc--CCHHHHHHHHHHHHhhCCCCCCCeEEE
Q 035734          106 GD----VLFSLLINESHDI-SVKEQMVVVLRYVDKNGYVVERFIGIEHITS--TTSISLKEALDRLFSKHGLSMSRLRGQ  178 (225)
Q Consensus       106 ~~----~~fSi~~D~~td~-~~~~~l~v~vr~vd~~~~~~e~~L~~~~l~~--~tae~i~~~i~~~L~~~~i~~~~~~~~  178 (225)
                      +.    ++|||++|+|||. .+.+||+|++||+ ++|.+++++|++.++++  +||++|++.|.+++++|||+..+++++
T Consensus        92 ~~~~~~~~~sl~~D~wt~~~~~~~~l~i~v~~i-~~~~~~~~~L~~~~~~~~~~t~~~i~~~i~~~l~~~~l~~~~~v~~  170 (534)
T 2bw3_A           92 KSAVEKDGASATIDLWTDNYIKRNFLGVTLHYH-ENNELRDLILGLKSLDFERSTAENIYKKLKAIFSQFNVEDLSSIKF  170 (534)
T ss_dssp             HHHHHTTCCEEEEEEEEETTTTEEEEEEEEEEE-ETTEEEEEEEEEEEECGGGCSHHHHHHHHHHHHHTTTCCCCTTCEE
T ss_pred             HhhhcCCceEEEEecccCCCCCcceeEEEEEEe-eCCceEEEEEeEeecCCCCCCHHHHHHHHHHHHHHcCCCcccceeE
Confidence            86    6899999999998 5789999999999 58999999999999984  999999999999999999985567778


Q ss_pred             eccCccccccchhHHHHHHHHhCCCcEEecchhHHHHHHHHHHhhh
Q 035734          179 GYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKK  224 (225)
Q Consensus       179 ~~Dgas~m~g~~~gl~~~l~~~~p~~~~~~C~aH~lnL~v~~~~~~  224 (225)
                      ++|||++|.++.++           ..+++|++|.+||++++++++
T Consensus       171 vtDnasn~~~~~~~-----------~~~i~C~~H~lnLi~~~~l~~  205 (534)
T 2bw3_A          171 VTDRGANVVKSLAN-----------NIRINCSSHLLSNVLENSFEE  205 (534)
T ss_dssp             EECCCHHHHHHTTT-----------SEEEECHHHHHHHHHHHHHHH
T ss_pred             ECCCHHHHHHHHhc-----------CcccccHHHHHHHHHHHHhhh
Confidence            88999999877553           379999999999999998764




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 225
d2bw3a2 447 c.55.3.12 (A:163-609) Transposase Hermes, catalyti 6e-08
>d2bw3a2 c.55.3.12 (A:163-609) Transposase Hermes, catalytic domain {House fly (Musca domestica) [TaxId: 7370]} Length = 447 Back     information, alignment and structure

class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Hermes transposase-like
domain: Transposase Hermes, catalytic domain
species: House fly (Musca domestica) [TaxId: 7370]
 Score = 50.1 bits (118), Expect = 6e-08
 Identities = 16/110 (14%), Positives = 42/110 (38%), Gaps = 14/110 (12%)

Query: 117 SHDISVKEQMVVVLRYVDKNGYVVERFIGIEHIT--STTSISLKEALDRLFSKHGLSMSR 174
           + +   +  + V L Y + N  + +  +G++ +    +T+ ++ + L  +FS+  +    
Sbjct: 21  TDNYIKRNFLGVTLHYHE-NNELRDLILGLKSLDFERSTAENIYKKLKAIFSQFNVEDLS 79

Query: 175 LRGQGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKK 224
                 D  +N+                    I+C +H L   L +  ++
Sbjct: 80  SIKFVTDRGANVVKSLANN-----------IRINCSSHLLSNVLENSFEE 118


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query225
d2bw3a2 447 Transposase Hermes, catalytic domain {House fly (M 99.86
>d2bw3a2 c.55.3.12 (A:163-609) Transposase Hermes, catalytic domain {House fly (Musca domestica) [TaxId: 7370]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Hermes transposase-like
domain: Transposase Hermes, catalytic domain
species: House fly (Musca domestica) [TaxId: 7370]
Probab=99.86  E-value=3.9e-22  Score=172.92  Aligned_cols=113  Identities=18%  Similarity=0.161  Sum_probs=99.6

Q ss_pred             HHHHhhccceeEEEecccCC-CcchhhheeeeeeeccCceeeeeeeceeecc-c-CCHHHHHHHHHHHHhhCCCCCCCeE
Q 035734          100 IIINEMGDVLFSLLINESHD-ISVKEQMVVVLRYVDKNGYVVERFIGIEHIT-S-TTSISLKEALDRLFSKHGLSMSRLR  176 (225)
Q Consensus       100 ~i~~~l~~~~fSi~~D~~td-~~~~~~l~v~vr~vd~~~~~~e~~L~~~~l~-~-~tae~i~~~i~~~L~~~~i~~~~~~  176 (225)
                      +|++.+....+|+++|+||+ .++.+||+|++||++ ++.+++++||+.+++ + +||++|++.|.+++++|||++.+++
T Consensus         3 ~~k~~~~~~~~s~~~D~Wts~~~~~~~l~v~~~yi~-~~~~~~~~L~~~~~~~~~~tg~~i~~~l~~~l~~~~i~~~~~~   81 (447)
T d2bw3a2           3 EIKSAVEKDGASATIDLWTDNYIKRNFLGVTLHYHE-NNELRDLILGLKSLDFERSTAENIYKKLKAIFSQFNVEDLSSI   81 (447)
T ss_dssp             HHHHHHHTTCCEEEEEEEEETTTTEEEEEEEEEEEE-TTEEEEEEEEEEEECGGGCSHHHHHHHHHHHHHTTTCCCCTTC
T ss_pred             hHHHHHHcCCEEEEEEeecCCCCCeEEEEEEEEEEE-CCceeEEEEEEEEeCCCCCcHHHHHHHHHHHHHHcCCCcceEE
Confidence            34555556789999999997 456799999999997 589999999999986 3 8999999999999999999999999


Q ss_pred             EEeccCccccccchhHHHHHHHHhCCCcEEecchhHHHHHHHHHHhhh
Q 035734          177 GQGYDGASNMQGEFNGLKALILEDNKCPYFIHCFAHQLQLALISVAKK  224 (225)
Q Consensus       177 ~~~~Dgas~m~g~~~gl~~~l~~~~p~~~~~~C~aH~lnL~v~~~~~~  224 (225)
                      ++++|||++|.+.           .+...++||++|+|||++.+++++
T Consensus        82 ~~~~D~a~n~~~~-----------~~~~~~i~C~aH~Lnl~v~d~~~~  118 (447)
T d2bw3a2          82 KFVTDRGANVVKS-----------LANNIRINCSSHLLSNVLENSFEE  118 (447)
T ss_dssp             EEEECCCHHHHHH-----------TTTSEEEECHHHHHHHHHHHHHHH
T ss_pred             EEEcCCcHhhHHH-----------hhccccccchhhhhhhhhHHHHhh
Confidence            9999999999764           456789999999999999998864