Psyllid ID: psy3060


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------210-------220-------230-------240-------250-------260-------27
MHLTTENMNDNENFISAEVLAERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGREAEASPKDWVGTLIWLIGIKNKHQQYEIEISFLINQITASISELLRMYEKTLRLGHLILLVPIVPNHLSKIDIFTGGSLCGLVLLTHFSSGARWDSASAPSIPISNVVHHTVRLFGYHIHKTKLLKSPFEKSEPVKLEFRDPGLKSYDATPN
ccccccccccccccHHHHHHHHHHHHHHHHcccccccccccccEEEEcccccccccccccccEEEEEEccccccEEEEEEEcccccEEEEccccEEEccccEEccccEEEEEcccccccccccccccccccEEEccccEEEEEEcccccccHHHHHHHHHHHHHHHHHHcccccccccccccccccccccccccccccEEccccccccccccccccccccccEEEEEEEEEEEEEEcEEEcccccccccccccccccccccccccccc
cccccccccccccccccccHHHccHHHHHHHcccccEEcccccEEEcccccEEEcccccccccEEEEccccccccEEEEEEEccccccEcEEccEEEEEccEEEccccEEcccccccccccccEEEcccHHHHHEHcccEEEEEEccccccHHHHcHHHHHHHHHHHHHcccEEEEcccccccccEEEEEcccccHHHEEEHccccccccccccccccccccHHHHHEEEEEEHHcHcccccccccccccEEEEEccccccccccccc
mhlttenmndnenfISAEVLAERTLDSLlaehpgeltrtgcphvvctvlpthwrsnktlpVAFKVVALGDVMDGTIVTIragndenycgelRNCTAVMKNQVakfndlrfvgrsgreaeaspkdWVGTLIWLIGIKNKHQQYEIEISFLINQITASISELLRMYEKTLRLGHLILLvpivpnhlskidiftggslCGLVLLThfssgarwdsasapsipisnvvHHTVRLFGyhihktkllkspfeksepvklefrdpglksydatpn
mhlttenmndneNFISAEVLAERTLDSLLAEHPgeltrtgcPHVVCTVLPthwrsnktlPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVgrsgreaeaspkdwvGTLIWLIGIKNKHQQYEIEISFLINQITASISELLRMYEKTLRLGHLILLVPIVPNHLSKIDIFTGGSLCGLVLLTHFSSGARWDSASAPSIPISNVVHHTVRLFGYHIHKTkllkspfeksepvklefrdpglksydatpn
MHLTTENMNDNENFISAEVLAERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGREAEASPKDWVGTLIWLIGIKNKHQQYEIEISFLINQITASISELLRMYEKTLRLGHLILLVPIVPNHLSKIDIFTGGSLCGLVLLTHFSSGARWDSASAPSIPISNVVHHTVRLFGYHIHKTKLLKSPFEKSEPVKLEFRDPGLKSYDATPN
**************ISAEVLAERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSG******PKDWVGTLIWLIGIKNKHQQYEIEISFLINQITASISELLRMYEKTLRLGHLILLVPIVPNHLSKIDIFTGGSLCGLVLLTHFSSGARWDSASAPSIPISNVVHHTVRLFGYHIHKTKLL***************************
******************VLAERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGREAEASPKDWVGTLIWLIGIKNKHQQYEI***************************************************CGLVLLTHFSSGARWDSASAPSIPISNVVHHTVRLFGYHIHKTKLLKSPFEKSEPVKLEFR************
********NDNENFISAEVLAERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGREAEASPKDWVGTLIWLIGIKNKHQQYEIEISFLINQITASISELLRMYEKTLRLGHLILLVPIVPNHLSKIDIFTGGSLCGLVLLTHFSSGARWDSASAPSIPISNVVHHTVRLFGYHIHKTKLLKSPFEKSEPVKLEFRDPGLKSYDATPN
****************AEVLAERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGREAEASPKDWVGTLIWLIGIKNKHQQYEIEISFLINQITASISELLRMYEKTLRLGHLILLVPIVPNHLSKIDIFTGGSLCGLVLLTHFS*********APSIPISNVVHHTVRLFGYHIHKTKLLKSPFEKSEPVKLEFRDPGLKSY*****
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiihhhhhhhhhhhhhhhhhhhhooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MHLTTENMNDNENFISAEVLAERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGREAEASPKDWVGTLIWLIGIKNKHQQYEIEISFLINQITASISELLRMYEKTLRLGHLILLVPIVPNHLSKIDIFTGGSLCGLVLLTHFSSGARWDSASAPSIPISNVVHHTVRLFGYHIHKTKLLKSPFEKSEPVKLEFRDPGLKSYDATPN
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query268 2.2.26 [Sep-21-2011]
Q03347 451 Runt-related transcriptio yes N/A 0.358 0.212 0.729 1e-38
Q01196 453 Runt-related transcriptio yes N/A 0.358 0.211 0.729 1e-38
Q63046 450 Runt-related transcriptio yes N/A 0.358 0.213 0.729 1e-38
Q64131 409 Runt-related transcriptio no N/A 0.406 0.266 0.627 1e-37
Q13761 415 Runt-related transcriptio no N/A 0.406 0.262 0.618 4e-37
Q13950 521 Runt-related transcriptio no N/A 0.350 0.180 0.723 4e-37
Q08775 607 Runt-related transcriptio no N/A 0.350 0.154 0.723 6e-37
Q6PF39 462 Runt-related transcriptio N/A N/A 0.354 0.205 0.705 9e-37
P22814 510 Segmentation protein Runt no N/A 0.347 0.182 0.709 1e-36
Q9W349 826 Protein lozenge OS=Drosop no N/A 0.354 0.115 0.715 3e-34
>sp|Q03347|RUNX1_MOUSE Runt-related transcription factor 1 OS=Mus musculus GN=Runx1 PE=1 SV=1 Back     alignment and function desciption
 Score =  160 bits (404), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 70/96 (72%), Positives = 82/96 (85%)

Query: 21  AERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIR 80
            +R++  +LA+HPGEL RT  P+ +C+VLPTHWR NKTLP+AFKVVALGDV DGT+VT+ 
Sbjct: 47  GDRSMVEVLADHPGELVRTDSPNFLCSVLPTHWRCNKTLPIAFKVVALGDVPDGTLVTVM 106

Query: 81  AGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGR 116
           AGNDENY  ELRN TA MKNQVA+FNDLRFVGRSGR
Sbjct: 107 AGNDENYSAELRNATAAMKNQVARFNDLRFVGRSGR 142




CBF binds to the core site, 5'-PYGPYGGT-3', of a number of enhancers and promoters, including murine leukemia virus, polyomavirus enhancer, T-cell receptor enhancers, LCK, IL-3 and GM-CSF promoters. Essential for the development of normal hematopoiesis. Isoform 4 shows higher binding activities for target genes and binds TCR-beta-E2 and RAG-1 target site with threefold higher affinity than other isoforms. It is less effective in the context of neutrophil terminal differentiation. Acts synergistically with ELF4 to transactivate the IL-3 promoter and with ELF2 to transactivate the BLK promoter. Inhibits KAT6B-dependent transcriptional activation.
Mus musculus (taxid: 10090)
>sp|Q01196|RUNX1_HUMAN Runt-related transcription factor 1 OS=Homo sapiens GN=RUNX1 PE=1 SV=3 Back     alignment and function description
>sp|Q63046|RUNX1_RAT Runt-related transcription factor 1 OS=Rattus norvegicus GN=Runx1 PE=2 SV=1 Back     alignment and function description
>sp|Q64131|RUNX3_MOUSE Runt-related transcription factor 3 OS=Mus musculus GN=Runx3 PE=2 SV=3 Back     alignment and function description
>sp|Q13761|RUNX3_HUMAN Runt-related transcription factor 3 OS=Homo sapiens GN=RUNX3 PE=1 SV=2 Back     alignment and function description
>sp|Q13950|RUNX2_HUMAN Runt-related transcription factor 2 OS=Homo sapiens GN=RUNX2 PE=1 SV=2 Back     alignment and function description
>sp|Q08775|RUNX2_MOUSE Runt-related transcription factor 2 OS=Mus musculus GN=Runx2 PE=1 SV=2 Back     alignment and function description
>sp|Q6PF39|RUNX1_XENLA Runt-related transcription factor 1 OS=Xenopus laevis GN=runx1 PE=2 SV=2 Back     alignment and function description
>sp|P22814|RUNT_DROME Segmentation protein Runt OS=Drosophila melanogaster GN=run PE=2 SV=2 Back     alignment and function description
>sp|Q9W349|LOZEN_DROME Protein lozenge OS=Drosophila melanogaster GN=lz PE=2 SV=2 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query268
328713666194 PREDICTED: protein lozenge-like [Acyrtho 0.429 0.592 0.844 7e-51
91089023 525 PREDICTED: similar to CG15455 CG15455-PA 0.421 0.215 0.814 4e-48
195567885 657 GD17495 [Drosophila simulans] gi|1942048 0.399 0.162 0.822 2e-47
221500120 663 RunxB, isoform A [Drosophila melanogaste 0.399 0.161 0.822 2e-47
195482114 699 GE17890 [Drosophila yakuba] gi|194189441 0.399 0.153 0.822 2e-47
194897264 671 GG19691 [Drosophila erecta] gi|190650271 0.399 0.159 0.822 2e-47
242013873216 runx1, putative [Pediculus humanus corpo 0.380 0.472 0.862 3e-47
194770329 684 GF15967 [Drosophila ananassae] gi|190614 0.399 0.156 0.822 3e-47
195346011293 GM23044 [Drosophila sechellia] gi|194134 0.399 0.365 0.822 4e-47
198469482301 GA13742 [Drosophila pseudoobscura pseudo 0.399 0.355 0.822 4e-47
>gi|328713666|ref|XP_001948981.2| PREDICTED: protein lozenge-like [Acyrthosiphon pisum] Back     alignment and taxonomy information
 Score =  206 bits (525), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 98/116 (84%), Positives = 106/116 (91%), Gaps = 1/116 (0%)

Query: 1   MHLTTENMNDNENFISAEVLAERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLP 60
           MHL+ E M + + F+ A+VLAERTLDSLLAEHPGEL RTGCPHVVCTVLPTHWRSNKTLP
Sbjct: 1   MHLS-EPMRNRDGFLKADVLAERTLDSLLAEHPGELVRTGCPHVVCTVLPTHWRSNKTLP 59

Query: 61  VAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGR 116
           VAFKVVALG+V DGT VTIRAGNDEN+C ELRNCTA+MKNQVAKFNDLRFVGRSGR
Sbjct: 60  VAFKVVALGEVPDGTPVTIRAGNDENFCAELRNCTALMKNQVAKFNDLRFVGRSGR 115




Source: Acyrthosiphon pisum

Species: Acyrthosiphon pisum

Genus: Acyrthosiphon

Family: Aphididae

Order: Hemiptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|91089023|ref|XP_968913.1| PREDICTED: similar to CG15455 CG15455-PA [Tribolium castaneum] gi|270012808|gb|EFA09256.1| runt-like protein [Tribolium castaneum] Back     alignment and taxonomy information
>gi|195567885|ref|XP_002107488.1| GD17495 [Drosophila simulans] gi|194204896|gb|EDX18472.1| GD17495 [Drosophila simulans] Back     alignment and taxonomy information
>gi|221500120|ref|NP_608399.2| RunxB, isoform A [Drosophila melanogaster] gi|442617081|ref|NP_001259745.1| RunxB, isoform C [Drosophila melanogaster] gi|220901834|gb|AAF50893.2| RunxB, isoform A [Drosophila melanogaster] gi|440216982|gb|AGB95584.1| RunxB, isoform C [Drosophila melanogaster] Back     alignment and taxonomy information
>gi|195482114|ref|XP_002101917.1| GE17890 [Drosophila yakuba] gi|194189441|gb|EDX03025.1| GE17890 [Drosophila yakuba] Back     alignment and taxonomy information
>gi|194897264|ref|XP_001978622.1| GG19691 [Drosophila erecta] gi|190650271|gb|EDV47549.1| GG19691 [Drosophila erecta] Back     alignment and taxonomy information
>gi|242013873|ref|XP_002427625.1| runx1, putative [Pediculus humanus corporis] gi|212512040|gb|EEB14887.1| runx1, putative [Pediculus humanus corporis] Back     alignment and taxonomy information
>gi|194770329|ref|XP_001967246.1| GF15967 [Drosophila ananassae] gi|190614522|gb|EDV30046.1| GF15967 [Drosophila ananassae] Back     alignment and taxonomy information
>gi|195346011|ref|XP_002039562.1| GM23044 [Drosophila sechellia] gi|194134788|gb|EDW56304.1| GM23044 [Drosophila sechellia] Back     alignment and taxonomy information
>gi|198469482|ref|XP_001355038.2| GA13742 [Drosophila pseudoobscura pseudoobscura] gi|198146891|gb|EAL32094.2| GA13742 [Drosophila pseudoobscura pseudoobscura] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query268
FB|FBgn0259162 663 RunxB [Drosophila melanogaster 0.399 0.161 0.822 1.6e-45
FB|FBgn0083981 658 RunxA [Drosophila melanogaster 0.406 0.165 0.761 2.9e-42
MGI|MGI:102672 409 Runx3 "runt related transcript 0.350 0.229 0.734 6.4e-37
UNIPROTKB|Q13761 415 RUNX3 "Runt-related transcript 0.350 0.226 0.734 1.3e-36
RGD|620082 409 Runx3 "runt-related transcript 0.350 0.229 0.734 1.7e-36
UNIPROTKB|E9PU46 409 Runx3 "Protein Runx3" [Rattus 0.350 0.229 0.734 1.7e-36
UNIPROTKB|Q90813 408 Gga.45609 "Ch-runtB2" [Gallus 0.358 0.235 0.739 5.7e-36
UNIPROTKB|E1BAD4 453 RUNX1 "Uncharacterized protein 0.354 0.209 0.736 1.5e-35
UNIPROTKB|E2RJJ8 479 RUNX1 "Uncharacterized protein 0.354 0.198 0.736 1.5e-35
UNIPROTKB|E2RJK5 467 RUNX1 "Uncharacterized protein 0.354 0.203 0.736 1.5e-35
FB|FBgn0259162 RunxB [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
 Score = 478 (173.3 bits), Expect = 1.6e-45, P = 1.6e-45
 Identities = 88/107 (82%), Positives = 97/107 (90%)

Query:    10 DNENFISAEVLAERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALG 69
             D   +++ E L ERT+D LLAEHPGEL +TG PHVVCT LPTHWRSNKTLP+AFKV+ALG
Sbjct:    96 DTSPYLTPENLIERTVDVLLAEHPGELVKTGSPHVVCTTLPTHWRSNKTLPIAFKVLALG 155

Query:    70 DVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGR 116
             +VMDGTIVTIRAGNDEN+CGELRNCTAVMKNQVAKFNDLRFVGRSGR
Sbjct:   156 EVMDGTIVTIRAGNDENFCGELRNCTAVMKNQVAKFNDLRFVGRSGR 202




GO:0006355 "regulation of transcription, DNA-dependent" evidence=ISS
GO:0005634 "nucleus" evidence=IEA;IC;NAS
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=ISS;NAS
GO:0003677 "DNA binding" evidence=IEA
GO:0005524 "ATP binding" evidence=IEA
GO:0009987 "cellular process" evidence=IMP
FB|FBgn0083981 RunxA [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
MGI|MGI:102672 Runx3 "runt related transcription factor 3" [Mus musculus (taxid:10090)] Back     alignment and assigned GO terms
UNIPROTKB|Q13761 RUNX3 "Runt-related transcription factor 3" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
RGD|620082 Runx3 "runt-related transcription factor 3" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|E9PU46 Runx3 "Protein Runx3" [Rattus norvegicus (taxid:10116)] Back     alignment and assigned GO terms
UNIPROTKB|Q90813 Gga.45609 "Ch-runtB2" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|E1BAD4 RUNX1 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|E2RJJ8 RUNX1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms
UNIPROTKB|E2RJK5 RUNX1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query268
pfam00853134 pfam00853, Runt, Runt domain 2e-63
>gnl|CDD|109891 pfam00853, Runt, Runt domain Back     alignment and domain information
 Score =  195 bits (496), Expect = 2e-63
 Identities = 76/94 (80%), Positives = 86/94 (91%)

Query: 23  RTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAG 82
           R++  +LAEHPGEL +TG P+ +C+VLP+HWRSNKTLPVAFKVVALG+V DGT+VTIRAG
Sbjct: 1   RSVVEVLAEHPGELVKTGSPNFLCSVLPSHWRSNKTLPVAFKVVALGEVPDGTMVTIRAG 60

Query: 83  NDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGR 116
           NDENYC ELRN TAVMKNQVAKFNDLRFVGRSGR
Sbjct: 61  NDENYCAELRNATAVMKNQVAKFNDLRFVGRSGR 94


Length = 134

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 268
PF00853135 Runt: Runt domain; InterPro: IPR013524 The AML1 ge 100.0
KOG3982|consensus 475 100.0
>PF00853 Runt: Runt domain; InterPro: IPR013524 The AML1 gene is rearranged by the t(8;21) translocation in acute myeloid leukemia [] Back     alignment and domain information
Probab=100.00  E-value=4.3e-73  Score=468.34  Aligned_cols=120  Identities=66%  Similarity=1.003  Sum_probs=105.9

Q ss_pred             hhhhhHHhhcCCCCccccCCCceEEeecCcccccCCCCCcceEEEEeeecCCCeEEEEeecCCCccccccccceeeeccc
Q psy3060          22 ERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQ  101 (268)
Q Consensus        22 er~l~~~l~eh~geLV~T~sP~~lCs~LP~HWRsNKtLP~aFkVvaLg~VpDGT~Vtv~AGNDEN~~AELRN~tAvMKNq  101 (268)
                      ||+|.|+|+|||||||+||||+|+||+||+||||||+||.+|||||||+|||||+|||+|||||||||||||++|+||||
T Consensus         1 er~~~~~l~~~~~~lv~T~sP~~lcs~LP~HWR~NKsLP~~FkVvalg~V~DGT~Vtv~AGNDEn~~aElRN~tavmknq   80 (135)
T PF00853_consen    1 ERTLHEALAEHPGELVRTDSPNFLCSVLPSHWRSNKSLPVAFKVVALGDVPDGTLVTVMAGNDENYCAELRNATAVMKNQ   80 (135)
T ss_dssp             ---HHHHHHHSTCCEEECSSTTEEEES-TSEEETTSS-SS-EEEEESSS-STTEEEEEEEEESSCSSBBEES-EEEEETT
T ss_pred             CccHHHHHHhCCcccccCCCCCEEeecccccccccCCCCCceeEEEEEEcCCCcEEEEEecCCchhhhhhhchhhhhhcc
Confidence            89999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             eeeecceeeeeccCCCceeeceEee-------eeeeeEEee----ecCccc
Q psy3060         102 VAKFNDLRFVGRSGREAEASPKDWV-------GTLIWLIGI----KNKHQQ  141 (268)
Q Consensus       102 VArFNDLRFVGRSGRGKsFtlti~v-------~t~~~~i~v----~~~hrq  141 (268)
                      |||||||||||||||||||+|||+|       +||++||||    ||++|.
T Consensus        81 vA~FnDLRFvGRSGRGKsFtltItv~t~PpqvAty~~AIKVTVDGPRePR~  131 (135)
T PF00853_consen   81 VARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKVTVDGPREPRS  131 (135)
T ss_dssp             EEEESS-EECST-TTTSEEEEEEEE-SSS-EEEEECCEEEEESS-SSCCG-
T ss_pred             cccccccccccccCCccceEEEEEEeCCCchHHhheeeEEEEecCCCCCcc
Confidence            9999999999999999999999996       889999999    555554



The gene is highly similar to the Drosophila melanogaster segmentation gene runt and to the mouse transcription factor PEBP2 alpha subunit gene []. The region of shared similarity, known as the Runt domain, is responsible for DNA-binding and protein-protein interaction. In addition to the highly-conserved Runt domain, the AML-1 gene product carries a putative ATP-binding site (GRSGRGKS), and has a C-terminal region rich in proline and serine residues. The protein (known as acute myeloid leukemia 1 protein, oncogene AML-1, core-binding factor (CBF), alpha-B subunit, etc.) binds to the core site, 5'-pygpyggt-3', of a number of enhancers and promoters. The protein is a heterodimer of alpha- and beta-subunits. The alpha-subunit binds DNA as a monomer, and appears to have a role in the development of normal hematopoiesis. CBF is a nuclear protein expressed in numerous tissue types, except brain and heart; highest levels have been found to occur in thymus, bone marrow and peripheral blood. This domain occurs towards the N terminus of the proteins in this entry.; GO: 0003677 DNA binding, 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1E50_E 1LJM_A 1CO1_A 1H9D_C 1CMO_A 1EAO_B 1HJC_D 1HJB_F 2J6W_B 1EAN_A ....

>KOG3982|consensus Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query268
2j6w_A140 R164n Mutant Of The Runx1 Runt Domain Length = 140 1e-38
1ean_A140 The Runx1 Runt Domain At 1.25a Resolution: A Struct 2e-38
1cmo_A127 Immunoglobulin Motif Dna-Recognition And Heterodime 2e-38
1h9d_A134 Aml1CBF-BetaDNA COMPLEX Length = 134 1e-37
1ljm_A131 Dna Recognition Is Mediated By Conformational Trans 2e-37
1eaq_A140 The Runx1 Runt Domain At 1.25a Resolution: A Struct 2e-37
1io4_C123 Crystal Structure Of Runx-1AML1CBFALPHA RUNT DOMAIN 2e-34
1co1_A115 Fold Of The Cbfa Length = 115 9e-34
>pdb|2J6W|A Chain A, R164n Mutant Of The Runx1 Runt Domain Length = 140 Back     alignment and structure

Iteration: 1

Score = 156 bits (394), Expect = 1e-38, Method: Compositional matrix adjust. Identities = 70/96 (72%), Positives = 82/96 (85%) Query: 21 AERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIR 80 +R++ +LA+HPGEL RT P+ + +VLPTHWRSNKTLP+AFKVVALGDV DGT+VT+ Sbjct: 2 GDRSMVEVLADHPGELVRTDSPNFLSSVLPTHWRSNKTLPIAFKVVALGDVPDGTLVTVM 61 Query: 81 AGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGR 116 AGNDENY ELRN TA MKNQVA+FNDLRFVGRSGR Sbjct: 62 AGNDENYSAELRNATAAMKNQVARFNDLRFVGRSGR 97
>pdb|1EAN|A Chain A, The Runx1 Runt Domain At 1.25a Resolution: A Structural Switch And Specifically Bound Chloride Ions Modulate Dna Binding Length = 140 Back     alignment and structure
>pdb|1CMO|A Chain A, Immunoglobulin Motif Dna-Recognition And Heterodimerization For The Pebp2CBF RUNT-Domain Length = 127 Back     alignment and structure
>pdb|1H9D|A Chain A, Aml1CBF-BetaDNA COMPLEX Length = 134 Back     alignment and structure
>pdb|1LJM|A Chain A, Dna Recognition Is Mediated By Conformational Transition And By Dna Bending Length = 131 Back     alignment and structure
>pdb|1EAQ|A Chain A, The Runx1 Runt Domain At 1.25a Resolution: A Structural Switch And Specifically Bound Chloride Ions Modulate Dna Binding Length = 140 Back     alignment and structure
>pdb|1IO4|C Chain C, Crystal Structure Of Runx-1AML1CBFALPHA RUNT DOMAIN- Cbfbeta Core Domain Heterodimer And CEBPBETA BZIP Homodimer Bound To A Dna Fragment From The Csf-1r Promoter Length = 123 Back     alignment and structure
>pdb|1CO1|A Chain A, Fold Of The Cbfa Length = 115 Back     alignment and structure

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query268
1eaq_A140 RUNT-related transcription factor 1; transcription 2e-51
>1eaq_A RUNT-related transcription factor 1; transcription/DNA, acute myeloid leukemia, AML, RUNX1, RUNT domain, chloride binding, IG fold; HET: MSE; 1.25A {Mus musculus} SCOP: b.2.5.6 PDB: 1ean_A 1eao_A* 2j6w_A 1e50_A 1h9d_A* 1ljm_A 1cmo_A 1hjc_A* 1hjb_C* 1io4_C 1co1_A Length = 140 Back     alignment and structure
 Score =  163 bits (414), Expect = 2e-51
 Identities = 70/96 (72%), Positives = 82/96 (85%)

Query: 21  AERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIR 80
            +R++  +LA+HPGEL RT  P+ + +VLPTHWRSNKTLP+AFKVVALGDV DGT+VT+ 
Sbjct: 2   GDRSMVEVLADHPGELVRTDSPNFLSSVLPTHWRSNKTLPIAFKVVALGDVPDGTLVTVM 61

Query: 81  AGNDENYCGELRNCTAVMKNQVAKFNDLRFVGRSGR 116
           AGNDENY  ELRN TA MKNQVA+FNDLRFVGRSGR
Sbjct: 62  AGNDENYSAELRNATAAMKNQVARFNDLRFVGRSGR 97


Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query268
1eaq_A140 RUNT-related transcription factor 1; transcription 100.0
>1eaq_A RUNT-related transcription factor 1; transcription/DNA, acute myeloid leukemia, AML, RUNX1, RUNT domain, chloride binding, IG fold; HET: MSE; 1.25A {Mus musculus} SCOP: b.2.5.6 PDB: 1ean_A 1eao_A* 2j6w_A 1e50_A 1h9d_A* 1ljm_A 1cmo_A 1hjc_A* 1hjb_C* 1io4_C 1co1_A Back     alignment and structure
Probab=100.00  E-value=8.7e-73  Score=467.17  Aligned_cols=124  Identities=61%  Similarity=0.904  Sum_probs=110.9

Q ss_pred             hhhhhhHHhhcCCCCccccCCCceEEeecCcccccCCCCCcceEEEEeeecCCCeEEEEeecCCCccccccccceeeecc
Q psy3060          21 AERTLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKN  100 (268)
Q Consensus        21 ~er~l~~~l~eh~geLV~T~sP~~lCs~LP~HWRsNKtLP~aFkVvaLg~VpDGT~Vtv~AGNDEN~~AELRN~tAvMKN  100 (268)
                      +||++.|+|+|||||||+||||||+||+||+||||||+||++|||||||+|||||+|||+|||||||||||||++|+|||
T Consensus         2 ~er~~~e~l~~~~gelv~T~sP~~~cs~LP~HWRsNKtLP~~FkVvalg~V~DGT~Vtv~AGNDEn~~aelrN~~a~mkn   81 (140)
T 1eaq_A            2 GDRSMVEVLADHPGELVRTDSPNFLSSVLPTHWRSNKTLPIAFKVVALGDVPDGTLVTVMAGNDENYSAELRNATAAMKN   81 (140)
T ss_dssp             ---CHHHHHHHSTTTCEECSSTTEEECCCCSEEETTCCCSSCCEEEESSCCCTTCEEEEEEEETTEEEECEESCEEEEET
T ss_pred             cchhHHHHHhhCCCceEecCCCCEEEccCcchhccCCCCCCceEEEEecccCCCCEEEEEecCCcchhHHHHhHHHHHhh
Confidence            79999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ceeeecceeeeeccCCCceeeceEee-------eeeeeEEee-------ecCcccchh
Q psy3060         101 QVAKFNDLRFVGRSGREAEASPKDWV-------GTLIWLIGI-------KNKHQQYEI  144 (268)
Q Consensus       101 qVArFNDLRFVGRSGRGKsFtlti~v-------~t~~~~i~v-------~~~hrq~~~  144 (268)
                      ||||||||||||||||||||+|||+|       +||++||||       ||+|||+.+
T Consensus        82 qvA~FnDLRFvgRSGRGKsFtlTItv~t~PpqvAty~~AIKVTVDGPRePR~~rqk~~  139 (140)
T 1eaq_A           82 QVARFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKITVDGPREPRRHRQKLD  139 (140)
T ss_dssp             TEEECSSCEECSCCCTTCCBEEEEEECSSSCEEEECSSCBEEESSCC-----------
T ss_pred             ccceeecccccccCCCCccEEEEEEEecCCcchhhheeeEEEEecCCCCCcccccccC
Confidence            99999999999999999999999996       889999999       688888654




Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 268
d1eaqa_124 b.2.5.6 (A:) Acute myeloid leukemia 1 protein (AML 7e-60
>d1eaqa_ b.2.5.6 (A:) Acute myeloid leukemia 1 protein (AML1), RUNT domain {Mouse (Mus musculus) [TaxId: 10090]} Length = 124 Back     information, alignment and structure

class: All beta proteins
fold: Common fold of diphtheria toxin/transcription factors/cytochrome f
superfamily: p53-like transcription factors
family: RUNT domain
domain: Acute myeloid leukemia 1 protein (AML1), RUNT domain
species: Mouse (Mus musculus) [TaxId: 10090]
 Score =  183 bits (467), Expect = 7e-60
 Identities = 69/93 (74%), Positives = 80/93 (86%)

Query: 24  TLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGN 83
           ++  +LA+HPGEL RT  P+ + +VLPTHWRSNKTLP+AFKVVALGDV DGT+VT+ AGN
Sbjct: 1   SMVEVLADHPGELVRTDSPNFLSSVLPTHWRSNKTLPIAFKVVALGDVPDGTLVTVMAGN 60

Query: 84  DENYCGELRNCTAVMKNQVAKFNDLRFVGRSGR 116
           DENY  ELRN TA MKNQVA+FNDLRFVGRSGR
Sbjct: 61  DENYSAELRNATAAMKNQVARFNDLRFVGRSGR 93


Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query268
d1eaqa_124 Acute myeloid leukemia 1 protein (AML1), RUNT doma 100.0
>d1eaqa_ b.2.5.6 (A:) Acute myeloid leukemia 1 protein (AML1), RUNT domain {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: All beta proteins
fold: Common fold of diphtheria toxin/transcription factors/cytochrome f
superfamily: p53-like transcription factors
family: RUNT domain
domain: Acute myeloid leukemia 1 protein (AML1), RUNT domain
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=100.00  E-value=1.1e-69  Score=439.79  Aligned_cols=113  Identities=65%  Similarity=0.957  Sum_probs=108.2

Q ss_pred             hhhHHhhcCCCCccccCCCceEEeecCcccccCCCCCcceEEEEeeecCCCeEEEEeecCCCccccccccceeeecccee
Q psy3060          24 TLDSLLAEHPGELTRTGCPHVVCTVLPTHWRSNKTLPVAFKVVALGDVMDGTIVTIRAGNDENYCGELRNCTAVMKNQVA  103 (268)
Q Consensus        24 ~l~~~l~eh~geLV~T~sP~~lCs~LP~HWRsNKtLP~aFkVvaLg~VpDGT~Vtv~AGNDEN~~AELRN~tAvMKNqVA  103 (268)
                      +|.|+|+|||||||+|+||+|+||+||+|||||||||.+||||+||+|||||+|||+|||||||||||||++|+||||||
T Consensus         1 ~l~~~l~e~pgelv~t~sP~~lCs~LP~HWRsNKtLp~~Fkvv~L~~v~dGt~vti~agNden~~aelrn~ta~mknqva   80 (124)
T d1eaqa_           1 SMVEVLADHPGELVRTDSPNFLSSVLPTHWRSNKTLPIAFKVVALGDVPDGTLVTVMAGNDENYSAELRNATAAMKNQVA   80 (124)
T ss_dssp             CHHHHHHHSTTTCEECSSTTEEECCCCSEEETTCCCSSCCEEEESSCCCTTCEEEEEEEETTEEEECEESCEEEEETTEE
T ss_pred             CHhHHHHhCCCccccCCCCCEEeecCcchhccCCccCcceEEEEeeecCCCcEEEEEecCCcccchhhhhhHHHHhhccc
Confidence            47899999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             eecceeeeeccCCCceeeceEee-------eeeeeEEeee
Q psy3060         104 KFNDLRFVGRSGREAEASPKDWV-------GTLIWLIGIK  136 (268)
Q Consensus       104 rFNDLRFVGRSGRGKsFtlti~v-------~t~~~~i~v~  136 (268)
                      |||||||||||||||||||||+|       +||++||||.
T Consensus        81 ~fndLrFvgRSgrgKsFtLTItv~t~PpqVAty~rAIKVT  120 (124)
T d1eaqa_          81 RFNDLRFVGRSGRGKSFTLTITVFTNPPQVATYHRAIKIT  120 (124)
T ss_dssp             ECSSCEECSCCCTTCCBEEEEEECSSSCEEEECSSCBEEE
T ss_pred             cccceEEeccCCCCceEEEEEEEecCCchheeheeeEEEe
Confidence            99999999999999999999997       5667777663