Psyllid ID: psy5177


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210---
MMGTACLTAYTDYWNPDSLVPPPKVVAKKRTKKLGQAASAERGELVTMIGIIRASGEAIPPVFVFPRVFMKDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSSWPLMAMWPKISESSRRKFLYRVSRERLYNA
cccccccccccccccccccccccEEEEEccccccccccccccccEEEEEEEccccccccccEEEEcccccccccccccccccEEEEccccccccHHHHHHHHHccccccccccccEEEEEccccccccHHHHHHHHHcccEEEEEccccccccccccccccHHHHHHHHHHHHHHHHHccccccHHHHHHHHHHHHHHHHHcccccccccccc
cccHHHHHccHccccHHHHHcccEEEEcccccccccccccccccEEEEEEEEcccccccccEEEEcccHHHHHHHcccccccEEEEccccccccHHHHHHHHHcccccccccccEEEEEEccccccccHHHHHHHHHccEEEEEccccccccccccccccHHHHHHHHHHHHHHHHHHcccccccccccccccHHHHHHHccHHHHHHHHccc
MMGTACLTAytdywnpdslvpppkvvAKKRTKKLGQAASAERGELVTMIGIIrasgeaippvfvfprvfmkdefllgapegslgfanksgwssTEIFFETIKHIQNQIKaskenpalilmdnhethQSMEAIEYCRASgitlltfpphktdelqpldvavfsPFKNRLKAGFkkwtsdnpgsswplmamwpkisessrRKFLYRVSRERLYNA
MMGTACLTaytdywnpdslvpppkvvAKKRTKKlgqaasaergeLVTMIGIIRASGEAIPPVFVFPRVFMKDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGfkkwtsdnpgsswplmamwpkisessrrkflyrvsrerlyna
MMGTACLTAYTDYWNPDSLVPPPKVVAKKRTKKLGQAASAERGELVTMIGIIRASGEAIPPVFVFPRVFMKDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSSWPLMAMWPKISESSRRKFLYRVSRERLYNA
****ACLTAYTDYWNPDSLV**********************GELVTMIGIIRASGEAIPPVFVFPRVFMKDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSSWPLMAMWPKI*******FLY**********
**GTACLTAYTDYWNPDSLVPPPKVVAK*****************VTMIGIIRASGEAIPPVFVFPRVFMKDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSSWPLMAMWPKISESSRRKFLYRVSRE*LY**
MMGTACLTAYTDYWNPDSLVPPPKVV**************ERGELVTMIGIIRASGEAIPPVFVFPRVFMKDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSSWPLMAMWPKISESSRRKFLYRVSRERLYNA
*MGTACLTAYTDYWNPDSLVPPPKVVAKKRTKKLGQAASAERGELVTMIGIIRASGEAIPPVFVFPRVFMKDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSSWPLMAMWPKISESSRRKFLYRVSRERLYN*
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhoooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MMGTACLTAYTDYWNPDSLVPPPKVVAKKRTKKLGQAASAERGELVTMIGIIRASGEAIPPVFVFPRVFMKDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSSWPLMAMWPKISESSRRKFLYRVSRERLYNA
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query213 2.2.26 [Sep-21-2011]
Q7Z3K31410 Pogo transposable element yes N/A 0.553 0.083 0.306 0.0002
Q8BZH41409 Pogo transposable element yes N/A 0.553 0.083 0.298 0.0005
>sp|Q7Z3K3|POGZ_HUMAN Pogo transposable element with ZNF domain OS=Homo sapiens GN=POGZ PE=1 SV=2 Back     alignment and function desciption
 Score = 45.8 bits (107), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 38/124 (30%), Positives = 54/124 (43%), Gaps = 6/124 (4%)

Query: 52   IRASGEAIPPVFVFPRVFMKDEFLLGAPEGSLGFANKSGWSSTEIFFETIKHIQNQIKAS 111
            I A G  +P   VF R  M        P+  L  A +SG+S  EI       +  +  A 
Sbjct: 1163 ILADGTVLP-TLVFYRGQMDQP--ANMPDSILLEAKESGYSDDEIMELWSTRVWQKHTAC 1219

Query: 112  KENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAG 171
            + +  +++MD H TH S E +    AS       P   + ++QPLDV +    KN L   
Sbjct: 1220 QRSKGMLVMDCHRTHLSEEVLAMLSASSTLPAVVPAGCSSKIQPLDVCIKRTVKNFL--- 1276

Query: 172  FKKW 175
             KKW
Sbjct: 1277 HKKW 1280




Plays a role in mitotic cell cycle progression and is involved in kinetochore assembly and mitotic sister chromatid cohesion. Probably through its association with CBX5 plays a role in mitotic chromosome segregation by regulating aurora kinase B/AURKB activation and AURKB and CBX5 dissociation from chromosome arms.
Homo sapiens (taxid: 9606)
>sp|Q8BZH4|POGZ_MOUSE Pogo transposable element with ZNF domain OS=Mus musculus GN=Pogz PE=1 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query213
357618507 764 hypothetical protein KGM_18910 [Danaus p 0.849 0.236 0.486 6e-50
427791397 487 Putative tick transposon, partial [Rhipi 0.746 0.326 0.522 3e-44
449692570 318 PREDICTED: uncharacterized protein LOC10 0.769 0.515 0.493 5e-44
449670659 1644 PREDICTED: uncharacterized protein LOC10 0.769 0.099 0.481 3e-43
449688819 419 PREDICTED: uncharacterized protein LOC10 0.769 0.391 0.481 2e-42
449676617 1115 PREDICTED: uncharacterized protein LOC10 0.769 0.147 0.481 3e-42
449666581 468 PREDICTED: uncharacterized protein LOC10 0.877 0.399 0.456 3e-42
449672758 1952 PREDICTED: kinesin family member 21A [Hy 0.877 0.095 0.451 4e-42
449690888 420 PREDICTED: uncharacterized protein LOC10 0.877 0.445 0.456 4e-42
443733699298 hypothetical protein CAPTEDRAFT_141083 [ 0.751 0.536 0.506 7e-42
>gi|357618507|gb|EHJ71459.1| hypothetical protein KGM_18910 [Danaus plexippus] Back     alignment and taxonomy information
 Score =  202 bits (515), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 90/185 (48%), Positives = 129/185 (69%), Gaps = 4/185 (2%)

Query: 18  SLVPPPKVVAKKRTKKLGQAASAERGELVTMIGIIRASGEAIPPVFVFPRVFMKDEFLLG 77
           +++  PKV+A ++ +++GQ  SAERGELVT  G+I A+G ++PPV+VFPRV  KD FL G
Sbjct: 252 TVLSTPKVIAGRKQRQVGQIVSAERGELVTFCGVITATGSSLPPVYVFPRVHYKDHFLNG 311

Query: 78  APEGSLGFANKSGWSSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRA 137
           AP+GSLG  N+SGW ++E+F   +KHIQ    ++K NP LI+ DNHE+H S+EA+ YCR 
Sbjct: 312 APDGSLGLVNRSGWMTSELFIRVLKHIQRLTSSNKNNPILIICDNHESHISIEAVNYCRD 371

Query: 138 SGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSDNPGSSWPLMAMWPKISESS 197
           +G+  L+ PPH + +LQPLDV VF PFK +LK  F  W   N G    ++ ++  I+E S
Sbjct: 372 NGMVYLSLPPHTSHKLQPLDVGVFGPFKGKLKIAFNDWHIQNVGK---ILTIY-SIAELS 427

Query: 198 RRKFL 202
           +  +L
Sbjct: 428 KLAYL 432




Source: Danaus plexippus

Species: Danaus plexippus

Genus: Danaus

Family: Nymphalidae

Order: Lepidoptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|427791397|gb|JAA61150.1| Putative tick transposon, partial [Rhipicephalus pulchellus] Back     alignment and taxonomy information
>gi|449692570|ref|XP_004213088.1| PREDICTED: uncharacterized protein LOC101237143, partial [Hydra magnipapillata] Back     alignment and taxonomy information
>gi|449670659|ref|XP_004207313.1| PREDICTED: uncharacterized protein LOC101240164 [Hydra magnipapillata] Back     alignment and taxonomy information
>gi|449688819|ref|XP_004211858.1| PREDICTED: uncharacterized protein LOC101241720, partial [Hydra magnipapillata] Back     alignment and taxonomy information
>gi|449676617|ref|XP_002162876.2| PREDICTED: uncharacterized protein LOC100213942 [Hydra magnipapillata] Back     alignment and taxonomy information
>gi|449666581|ref|XP_004206373.1| PREDICTED: uncharacterized protein LOC101237149 [Hydra magnipapillata] Back     alignment and taxonomy information
>gi|449672758|ref|XP_002157234.2| PREDICTED: kinesin family member 21A [Hydra magnipapillata] Back     alignment and taxonomy information
>gi|449690888|ref|XP_004212494.1| PREDICTED: uncharacterized protein LOC101235029, partial [Hydra magnipapillata] Back     alignment and taxonomy information
>gi|443733699|gb|ELU17960.1| hypothetical protein CAPTEDRAFT_141083 [Capitella teleta] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query213
ASPGD|ASPL0000031765 627 AN5233 [Emericella nidulans (t 0.563 0.191 0.314 2.1e-10
ASPGD|ASPL0000014032 461 AN11044 [Emericella nidulans ( 0.643 0.297 0.295 1.4e-08
ASPGD|ASPL0000052324 462 AN0975 [Emericella nidulans (t 0.643 0.296 0.295 1.4e-08
ASPGD|ASPL0000063833 526 AN7516 [Emericella nidulans (t 0.643 0.260 0.295 1.8e-08
ASPGD|ASPL0000080187 526 AN9505 [Emericella nidulans (t 0.643 0.260 0.295 1.8e-08
ASPGD|ASPL0000071149 554 AN4391 [Emericella nidulans (t 0.643 0.247 0.295 1.9e-08
ASPGD|ASPL0000008186 554 AN6559 [Emericella nidulans (t 0.643 0.247 0.295 1.9e-08
ASPGD|ASPL0000073891 555 AN8788 [Emericella nidulans (t 0.643 0.246 0.295 1.9e-08
CGD|CAL0005175 516 orf19.2866 [Candida albicans ( 0.699 0.288 0.271 2.4e-08
CGD|CAL0001980 516 orf19.3820 [Candida albicans ( 0.699 0.288 0.271 2.4e-08
ASPGD|ASPL0000031765 AN5233 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
 Score = 157 (60.3 bits), Expect = 2.1e-10, P = 2.1e-10
 Identities = 40/127 (31%), Positives = 66/127 (51%)

Query:    44 ELVTMIGIIRASGEAIPPVFVFPRVFMK--DEFLLGA-PEGSLGF--ANKSGWSSTEIFF 98
             E +T I  I A G  +PP  V+     K  D +L    PE    F  ++ SGW++ ++  
Sbjct:   194 ERITAIPCICADGSKLPPGLVYQSASSKIQDTWLQDFNPEYHNCFFTSSPSGWTNDDVCL 253

Query:    99 ETIKHIQNQIKASKENPA--LILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPL 156
               ++ + ++    K      L+L+D + +  +M+ ++YC  + I L T+PPH T  LQPL
Sbjct:   254 AWLRDVFDRETKPKARGRWRLLLLDGYGSFVTMKFLDYCEENKILLATYPPHSTHTLQPL 313

Query:   157 DVAVFSP 163
             DV + SP
Sbjct:   314 DVGIISP 320




GO:0008150 "biological_process" evidence=ND
GO:0003677 "DNA binding" evidence=IEA
GO:0005575 "cellular_component" evidence=ND
ASPGD|ASPL0000014032 AN11044 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
ASPGD|ASPL0000052324 AN0975 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
ASPGD|ASPL0000063833 AN7516 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
ASPGD|ASPL0000080187 AN9505 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
ASPGD|ASPL0000071149 AN4391 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
ASPGD|ASPL0000008186 AN6559 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
ASPGD|ASPL0000073891 AN8788 [Emericella nidulans (taxid:162425)] Back     alignment and assigned GO terms
CGD|CAL0005175 orf19.2866 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms
CGD|CAL0001980 orf19.3820 [Candida albicans (taxid:5476)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query213
pfam03184215 pfam03184, DDE_1, DDE superfamily endonuclease 2e-20
pfam13358146 pfam13358, DDE_3, DDE superfamily endonuclease 2e-04
>gnl|CDD|217410 pfam03184, DDE_1, DDE superfamily endonuclease Back     alignment and domain information
 Score = 84.7 bits (210), Expect = 2e-20
 Identities = 46/171 (26%), Positives = 73/171 (42%), Gaps = 12/171 (7%)

Query: 13  YWNPDSLVPPPKVVAKKRTKKLGQAASAERGELVTMIGIIRASGEAIPPVFVF----PRV 68
           +W    L+P       ++ +  G+     R     +I    A GE +PP+ +     PR 
Sbjct: 15  FWG---LLPNGTSSQPEKRRVPGKKPGKRR--WTVVICANAAGGEKLPPLVIGKGKNPRA 69

Query: 69  FMKDEFLLGAPEGSLGFANKSGWSSTEIFFE-TIKHIQNQIKASKENPALILMDNHETHQ 127
           F  ++     P      ANK+ W + EIF E   K    +++ +     L+++D H +H 
Sbjct: 70  FKFEKK--PKPLPWEYKANKNAWMTDEIFEEWLEKVFDPRMQPAPGRKVLLILDGHGSHP 127

Query: 128 SMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAGFKKWTSD 178
           + E +  C+ + I LL  P H T  LQPLD  VFS  K   +    K    
Sbjct: 128 TDEFMRSCKLNNIILLFLPAHSTSLLQPLDQGVFSSLKAAYRRQLLKKALA 178


This family of proteins are related to pfam00665 and are probably endonucleases of the DDE superfamily. Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which contain three carboxylate residues that are believed to be responsible for coordinating metal ions needed for catalysis. The catalytic activity of this enzyme involves DNA cleavage at a specific site followed by a strand transfer reaction. Interestingly this family also includes the CENP-B protein. This domain in that protein appears to have lost the metal binding residues and is unlikely to have endonuclease activity. Centromere Protein B (CENP-B) is a DNA-binding protein localised to the centromere. Length = 215

>gnl|CDD|222070 pfam13358, DDE_3, DDE superfamily endonuclease Back     alignment and domain information

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 213
PF03184217 DDE_1: DDE superfamily endonuclease; InterPro: IPR 100.0
PF13358146 DDE_3: DDE superfamily endonuclease 99.51
KOG3105|consensus488 99.21
KOG3105|consensus 488 98.62
COG3335132 Transposase and inactivated derivatives [DNA repli 94.3
PF00665120 rve: Integrase core domain; InterPro: IPR001584 In 91.61
PHA02517277 putative transposase OrfB; Reviewed 87.29
>PF03184 DDE_1: DDE superfamily endonuclease; InterPro: IPR004875 These proteins are probably endonucleases of the DDE superfamily Back     alignment and domain information
Probab=100.00  E-value=1.2e-40  Score=267.12  Aligned_cols=190  Identities=28%  Similarity=0.442  Sum_probs=151.6

Q ss_pred             CCCCCeecCCCCCCC-----CceEEeccccccccccc-CCCCceEEEE-eeeccCCcccCCeEEecccccccccc--cCC
Q psy5177           8 TAYTDYWNPDSLVPP-----PKVVAKKRTKKLGQAAS-AERGELVTMI-GIIRASGEAIPPVFVFPRVFMKDEFL--LGA   78 (213)
Q Consensus         8 ~~~~~I~N~DET~~~-----~~~~~~~g~k~v~~~~~-~~~~~~~Tvl-~~~~a~G~~lpP~~I~~~~~~~~~~~--~~~   78 (213)
                      |+|++|||+|||++.     .+++...+.+....+.. ...++++|++ +|++++|..+||++|++++.....+.  ...
T Consensus         1 y~~~~Ifn~DEt~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~t~~l~~~~a~g~~lpP~vI~~~~~~~~~~~~~~~~   80 (217)
T PF03184_consen    1 YSPSNIFNMDETGFFYGMPPKKTVVGRSKKNKKRVPGQKSDKRRVTTVLCCNSAGGEKLPPLVIFKGKRPRCFWNDVDDK   80 (217)
T ss_pred             CCHHHEEEeccceeeecCCCCceEEEEeecccccceeccCCCcccccceeeecccccccccccccccccccccccccccc
Confidence            789999999999974     45533333222222221 1114456555 55667788899999999988776665  456


Q ss_pred             CCCceeeecCCCCccHHHHHHHH-HHHHhhhhcCCCCCeEEeccCCCcccchHHHHHHHHcCcEEEeeCCCCCCCcCccc
Q psy5177          79 PEGSLGFANKSGWSSTEIFFETI-KHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLD  157 (213)
Q Consensus        79 ~~~~~~~~s~~gw~~~~~f~~wl-~~f~~~~~~~~~~~~lLllD~~~~H~~~~~~~~~~~~~i~l~~lP~~~t~~lQPlD  157 (213)
                      +.++.|..+++||||.++|++|| +.|.++++ ..++++|||+|++++|.+.++.++|++.||.+++||||+|+.+||||
T Consensus        81 ~~~~~~~~~~~g~~t~~~~~~wl~~~f~~~~~-~~~~~~lLllD~~~~h~~~~~~~~~~~~~i~l~~lP~~~t~~lQPld  159 (217)
T PF03184_consen   81 PLGVYYASSENGWMTTELFLEWLKKVFDPRVK-SPGRPVLLLLDSHSSHISPEFKQLCKSNNIILVFLPPNTTHLLQPLD  159 (217)
T ss_pred             cCCcccccccccccccccchhhhhhhhccccc-ccccccccccccccccccccchhhhhccccccceecccccccccccc
Confidence            78999999999999999999999 57999887 46899999999999999999999999999999999999999999999


Q ss_pred             ccccHHHHHHHHHHHHHHHhcCCCCC------chhhhhhhHhhHHHh
Q psy5177         158 VAVFSPFKNRLKAGFKKWTSDNPGSS------WPLMAMWPKISESSR  198 (213)
Q Consensus       158 ~~v~~~lK~~~~~~~~~~~~~~~~~~------~t~~~~~~~~~~~~~  198 (213)
                      +|+|++||..||+.+.+|+.+..+..      +++.+++..+..+|.
T Consensus       160 ~~v~~~~K~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~AW~  206 (217)
T PF03184_consen  160 VGVFGPFKAHYRKKLLQWIIENGNNPKDFLYKPTLLDAIRWISKAWN  206 (217)
T ss_pred             hhhhhhhhHHHHHHHHHHHHhcccccccccCCcCHHHHHHHHHHHHH
Confidence            99999999999999999998876543      456666555555554



Transposase proteins are necessary for efficient DNA transposition. This domain is a member of the DDE superfamily, which contain three carboxylate residues that are believed to be responsible for coordinating metal ions needed for catalysis. The catalytic activity of this enzyme involves DNA cleavage at a specific site followed by a strand transfer reaction. Interestingly this family also includes the CENP-B protein. This domain in that protein appears to have lost the metal binding residues and is unlikely to have endonuclease activity. Centromere Protein B (CENP-B) is a DNA-binding protein localised to the centromere []. ; GO: 0003676 nucleic acid binding

>PF13358 DDE_3: DDE superfamily endonuclease Back     alignment and domain information
>KOG3105|consensus Back     alignment and domain information
>KOG3105|consensus Back     alignment and domain information
>COG3335 Transposase and inactivated derivatives [DNA replication, recombination, and repair] Back     alignment and domain information
>PF00665 rve: Integrase core domain; InterPro: IPR001584 Integrase comprises three domains capable of folding independently and whose three-dimensional structures are known Back     alignment and domain information
>PHA02517 putative transposase OrfB; Reviewed Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

No hit with e-value below 0.005

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query213
2f7t_A227 MOS1 transposase; RNAse-H like fold, DDD motif, DN 99.18
3f2k_A226 Histone-lysine N-methyltransferase setmar; histone 99.07
3hot_A345 Transposable element mariner, complete CDS; protei 99.05
1cxq_A162 Avian sarcoma virus integrase; mixed beta-sheet su 91.47
3kks_A152 Integrase, IN; beta-strands flanked by alpha-helic 88.26
1c0m_A238 Protein (integrase); HIV, X-RAY crystallography, p 85.87
2x78_A200 Integrase; viral protein, DNA-directed DNA polyme 84.97
3hph_A219 Integrase, IN; protein-protein complex, tetramer, 83.78
>2f7t_A MOS1 transposase; RNAse-H like fold, DDD motif, DNA binding protein; 2.25A {Drosophila mauritiana} Back     alignment and structure
Probab=99.18  E-value=3.5e-11  Score=96.15  Aligned_cols=136  Identities=13%  Similarity=0.122  Sum_probs=89.4

Q ss_pred             CCCeecCCCCCCC------CceEEecccccccccccCCCCceEEEEeeeccCCcccCCeEEecccccccccccCCCCCce
Q psy5177          10 YTDYWNPDSLVPP------PKVVAKKRTKKLGQAASAERGELVTMIGIIRASGEAIPPVFVFPRVFMKDEFLLGAPEGSL   83 (213)
Q Consensus        10 ~~~I~N~DET~~~------~~~~~~~g~k~v~~~~~~~~~~~~Tvl~~~~a~G~~lpP~~I~~~~~~~~~~~~~~~~~~~   83 (213)
                      ..+|+..||+++.      .+.+..+|.+............++.+..|++++|.. .+.++                   
T Consensus        31 ~~~Ivf~DE~~f~~~~~~~~~~W~~~ge~~~~~~~~~~~~~~vmv~g~~~~~G~~-~~~~~-------------------   90 (227)
T 2f7t_A           31 LHRIVTGDEKWIFFVNPKRKKSYVDPGQPATSTARPNRFGKKTMLCVWWDQSGVI-YYELL-------------------   90 (227)
T ss_dssp             GGGEEEEEEEEEEBCCCC-----------------------EEEEEEEEETTEEE-EEEEC-------------------
T ss_pred             hhheEeccCCEEEecCccccccccCCCCCCcceeccccccCeeEEEEEEecCceE-EeEec-------------------
Confidence            4679999998752      455666776443222222223567888888888732 22221                   


Q ss_pred             eeecCCCCccHHHHHHHHHHHHhhhhcC-----CCCC-eEEeccCCCcccchHHHHHHHHcCcEEEeeCCCCCCCcCccc
Q psy5177          84 GFANKSGWSSTEIFFETIKHIQNQIKAS-----KENP-ALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLD  157 (213)
Q Consensus        84 ~~~s~~gw~~~~~f~~wl~~f~~~~~~~-----~~~~-~lLllD~~~~H~~~~~~~~~~~~~i~l~~lP~~~t~~lQPlD  157 (213)
                         .+.|-+|++.+.++|+.+.+.+.+.     .++. +++++||++.|.+..+.+++.+.++.++++||++.. |.|+|
T Consensus        91 ---~~~~~in~~~y~~~L~~l~~~~~~~~p~~~~~~~~~~~~~Dna~~H~~~~~~~~~~~~~~~~~~~p~ysPd-lnpie  166 (227)
T 2f7t_A           91 ---KPGETVNAARYQQQLINLNRALQRKRPEYQKRQHRVIFLHDNAPSHTARAVRDTLETLNWEVLPHAAYSPD-LAPSD  166 (227)
T ss_dssp             ---CTTCCCCHHHHHHHHHHHHHHHHHHCHHHHTTSSCCEEECCCCGGGGSHHHHHHHHHHTCEECCCCTTCGG-GCHHH
T ss_pred             ---CCCceecHHHHHHHHHHHHHHHHHhChHhhccCCceEEEECCCchhhhHHHHHHHHHhCcccccCCCCCCC-cCccc
Confidence               2357899999999998777765421     1233 899999999999999999999999999999999988 67999


Q ss_pred             ccccHHHHHHHH
Q psy5177         158 VAVFSPFKNRLK  169 (213)
Q Consensus       158 ~~v~~~lK~~~~  169 (213)
                      ..+++.+|..++
T Consensus       167 ~~~w~~lk~~l~  178 (227)
T 2f7t_A          167 YHLFASMGHALA  178 (227)
T ss_dssp             HTHHHHHHHHHH
T ss_pred             chhhHHHHHHHh
Confidence            778888887654



>3f2k_A Histone-lysine N-methyltransferase setmar; histone-lysine N-methyltransferase setmar, SET domain and mariner transposase fusion; 1.85A {Homo sapiens} PDB: 3k9k_A 3k9j_A Back     alignment and structure
>3hot_A Transposable element mariner, complete CDS; protein-DNA complex, synaptic complex, transposase, inverted DNA, DNA binding protein-DNA complex; HET: 5IU; 3.25A {Drosophila mauritiana} PDB: 3hos_A* Back     alignment and structure
>1cxq_A Avian sarcoma virus integrase; mixed beta-sheet surrounded by alpha-helices, transferase; HET: EPE; 1.02A {Avian sarcoma virus} SCOP: c.55.3.2 PDB: 1cxu_A* 1czb_A* 1cz9_A* 1asu_A* 1asw_A* 1asv_A 1a5v_A* 1a5w_A* 1a5x_A* 1vsf_A* 1vse_A* 3o4n_A* 1vsm_A 3o4q_A* 1vsd_A* 1vsh_A* 1vsi_A* 1vsj_A* 1vsk_A 1vsl_A Back     alignment and structure
>3kks_A Integrase, IN; beta-strands flanked by alpha-helices, DNA binding protein; 2.20A {Bovine immunodeficiency virus} PDB: 3kkr_A Back     alignment and structure
>1c0m_A Protein (integrase); HIV, X-RAY crystallography, protein structure, transferase; 2.53A {Rous sarcoma virus} SCOP: b.34.7.1 c.55.3.2 PDB: 1c1a_A Back     alignment and structure
>2x78_A Integrase; viral protein, DNA-directed DNA polyme nucleotidyltransferase, DNA integration, aspartyl protease, recombination, transferase; 2.00A {Human spumaretrovirus} PDB: 2x6n_A 2x6s_A 2x74_A 3dlr_A Back     alignment and structure
>3hph_A Integrase, IN; protein-protein complex, tetramer, DNA integration, endonucl magnesium, metal-binding, multifunctional enzyme; 2.64A {Maedi visna virus} PDB: 3hpg_A Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

No hit with e-value below 0.005

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query213
d1cxqa_147 Retroviral integrase, catalytic domain {Rous sarco 94.18
d1exqa_154 Retroviral integrase, catalytic domain {Human immu 86.97
>d1cxqa_ c.55.3.2 (A:) Retroviral integrase, catalytic domain {Rous sarcoma virus RSV [TaxId: 11886]} Back     information, alignment and structure
class: Alpha and beta proteins (a/b)
fold: Ribonuclease H-like motif
superfamily: Ribonuclease H-like
family: Retroviral integrase, catalytic domain
domain: Retroviral integrase, catalytic domain
species: Rous sarcoma virus RSV [TaxId: 11886]
Probab=94.18  E-value=0.21  Score=34.28  Aligned_cols=75  Identities=16%  Similarity=0.101  Sum_probs=49.3

Q ss_pred             ccHHHHHHHHHHHHhhhhcCCCCCeEEeccCCCcccchHHHHHHHHcCcEEEeeCCCCCCCcCcccccccHHHHHHHHHH
Q psy5177          92 SSTEIFFETIKHIQNQIKASKENPALILMDNHETHQSMEAIEYCRASGITLLTFPPHKTDELQPLDVAVFSPFKNRLKAG  171 (213)
Q Consensus        92 ~~~~~f~~wl~~f~~~~~~~~~~~~lLllD~~~~H~~~~~~~~~~~~~i~l~~lP~~~t~~lQPlD~~v~~~lK~~~~~~  171 (213)
                      .+......++..+....    +.|..+..|+-+...+.++.++|++.||...+.+|++... +..=.-.++.+|..++..
T Consensus        45 ~t~~~~~~~l~~~~~~~----g~p~~i~sDnG~~F~s~~~~~~~~~~gI~~~~~~p~~P~~-ng~vER~~~tlk~~~~~~  119 (147)
T d1cxqa_          45 VTSVAAQHHWATAIAVL----GRPKAIKTDNGSCFTSKSTREWLARWGIAHTTGIPGNSQG-QAMVERANRLLKDKIRVL  119 (147)
T ss_dssp             CCHHHHHHHHHHHHHHH----CCCSEEECCSCHHHHSHHHHHHHHHHTCEEECCCCC-----CHHHHHHHHHHHHHHHHH
T ss_pred             cchHHHHHHHHHHHHHh----CCCeEEEecccccccchhhhhhhhheeEEeeeeccchhhc-CceeehhhHHHHHHHHHH
Confidence            44555556665544433    5678999999999999999999999999999999987542 222223344455444443



>d1exqa_ c.55.3.2 (A:) Retroviral integrase, catalytic domain {Human immunodeficiency virus type 1 [TaxId: 11676]} Back     information, alignment and structure