Psyllid ID: psy2138


Local Sequence Feature Prediction

Prediction and (Method)Result
Residue Number Marker
Protein Sequence ?
Secondary Structure (PSIPRED) ?
Secondary Structure Prediction (SSPRO) ?
Coil and Loop (DISEMBL) ?
Flexible Loop (DISEMBL) ?
Low Complexity Region (SEG) ?
Disordered region (IsUnstruct) ?
Disordered Region (DISOPRED) ?
Disordered Region (DISEMBL) ?
Disordered Region (DISPRO) ?
Transmembrane Helix (TMHMM) ?
Transmembrane Helix (HMMTOP) ?
Transmembrane Helix (MEMSAT) ?
TM Helix, Signal Peptide (MEMSAT_SVM) ?
TM Helix, Signal Peptide (Phobius) ?
Signal Peptide (SignalP HMM Mode) ?
Signal Peptide (SignalP NN Mode) ?
Coiled Coils (COILS) ?
Positional Conservation ?
 
--------10--------20--------30--------40--------50--------60--------70--------80--------90-------100-------110-------120-------130-------140-------150-------160-------170-------180-------190-------200-------21
MAEAEVDMAEYNPHDLFFHDVGPGMVNGANGDVLISNKPCESCNVTVSTNWFPWYSGTKICINCHAFWRKYGGLKIPGKLSDGELDYLKPNKKSGSSSLSEEDRIGPSGLVGHRPHRCSVPGCSKEFKLKAHLHRHLATGHGLVARPGSPRPIMKTRNAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKTDCEYIESFP
cccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHcccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHHHHHHHHHccccccccccccccccccccEEEEEccHHHHHHHHHHHHcccccHHcccccccccHHHHHHHHHHHHccc
ccHHHHHHHHccccccEcccccccccccccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHccccccccccccccccccccccccccccccccccccccccccccccccccccccccccHHHHcHHHcccccccccccccccccccHHHEEEccHHHHHHHHHHHHHHHcHHHHHccccccccHHHHHHHHHHHHccc
MAEAEVdmaeynphdlffhdvgpgmvngangdvlisnkpcescnvtvstnwfpwysgtkicINCHAFWRkygglkipgklsdgeldylkpnkksgssslseedrigpsglvghrphrcsvpgcskefKLKAHLHRHLATghglvarpgsprpimktrnafylratplTRISRRLCRhviqprhaarspfwcINIAAVKTDCEYIESFP
MAEAEVDMAEYNPHDLFFHDVGPGMVNGANGDVLISNKPCESCNVTVSTNWFPWYSGTKICINCHAFWRKYGGLKIPGKLSDGELDYLKpnkksgssslseedRIGPSGLVGHRPHRCSVPGCSKEFKLKAHLHRHLATghglvarpgsprpiMKTRNAFYLRATPLTRISRRLCRHVIqprhaarspfwciNIAAVKTDCEYIESFP
MAEAEVDMAEYNPHDLFFHDVGPGMVNGANGDVLISNKPCESCNVTVSTNWFPWYSGTKICINCHAFWRKYGGLKIPGKLSDGELDYLKPNKKSGSSSLSEEDRIGPSGLVGHRPHRCSVPGCSKEFKLKAHLHRHLATGHGLVARPGSPRPIMKTRNAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKTDCEYIESFP
**********YNPHDLFFHDVGPGMVNGANGDVLISNKPCESCNVTVSTNWFPWYSGTKICINCHAFWRKYGGLKIPGKL*************************************CSVPGCSKEFKLKAHLHRHLATGHGLVARP***RPIMKTRNAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKTDCEYI****
***AEVDMAEYNPHDL***********************CESCNVTVSTNWFPWYSGTKICINCHAFWRKYGGLKIP********************************************************************************NAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKTDCEYI****
********AEYNPHDLFFHDVGPGMVNGANGDVLISNKPCESCNVTVSTNWFPWYSGTKICINCHAFWRKYGGLKIPGKLSDGELDYLKPN*************IGPSGLVGHRPHRCSVPGCSKEFKLKAHLHRHLATGHGLVARPGSPRPIMKTRNAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKTDCEYIESFP
****EVDMAEYNPHDLFFH**G*****************CESCNVTVSTNWFPWYSGTKICINCHAFWRKYGGLKIP*********************************************C*KEFKL********ATGHGLVARPGSPRPIMKTRNAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKTDCEYIES**
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
ooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooohhhhhhhhhhhhhhhhiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
oooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooooo
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
MAEAEVDMAEYNPHDLFFHDVGPGMVNGANGDVLISNKPCESCNVTVSTNWFPWYSGTKICINCHAFWRKYGGLKIPGKLSDGELDYLKPNKKSGSSSLSEEDRIGPSGLVGHRPHRCSVPGCSKEFKLKAHLHRHLATGHGLVARPGSPRPIMKTRNAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKTDCEYIESFP
no confident homologs detected

Close Homologs for Annotation Transfer

Close Homologs in SWISS-PROT Database Detected by BLAST ?

ID ?Alignment graph ?Length ? Definition ? RBH(Q2H) ? RBH(H2Q) ? Q cover ? H cover ? Identity ? E-value ?
Query208 2.2.26 [Sep-21-2011]
Q13330 715 Metastasis-associated pro yes N/A 0.692 0.201 0.372 2e-24
Q62599 703 Metastasis-associated pro yes N/A 0.692 0.204 0.357 3e-23
Q8K4B0 715 Metastasis-associated pro yes N/A 0.692 0.201 0.357 8e-23
Q9BTC8594 Metastasis-associated pro no N/A 0.745 0.260 0.304 8e-21
A6QL72590 Metastasis-associated pro yes N/A 0.793 0.279 0.294 3e-20
Q924K8591 Metastasis-associated pro no N/A 0.788 0.277 0.308 1e-18
Q9R190 668 Metastasis-associated pro no N/A 0.721 0.224 0.290 2e-17
O94776 668 Metastasis-associated pro no N/A 0.721 0.224 0.290 2e-17
>sp|Q13330|MTA1_HUMAN Metastasis-associated protein MTA1 OS=Homo sapiens GN=MTA1 PE=1 SV=2 Back     alignment and function desciption
 Score =  112 bits (280), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 73/196 (37%), Positives = 98/196 (50%), Gaps = 52/196 (26%)

Query: 12  NPHDLFFHDVGPGMVNG--ANGDVLISNKPCESCNVTVSTNWFPW---YSGTKICINCHA 66
           NP+ +  ++V  G+VNG  A G    + + CESC  T S  W+ W       ++C +C  
Sbjct: 363 NPNQISVNNVKAGVVNGTGAPGQSPGAGRACESCYTTQSYQWYSWGPPNMQCRLCASCWT 422

Query: 67  FWRKYGGLKIPGKLSDGELDYLKPNKKSGSSSLSEEDRIGPSGLVGHRPHRCSVPGCSKE 126
           +W+KYGGLK+P +L DGE                   R GP                   
Sbjct: 423 YWKKYGGLKMPTRL-DGE-------------------RPGP------------------- 443

Query: 127 FKLKAHLHRHLATGHGLVARP-GSPRPIMKTRNAFYLRATPLTRISRRLCRHVIQPRHAA 185
                  +R   + HGL AR  GSP+  MKTR AFYL  T LTRI+RRLCR +++P HAA
Sbjct: 444 -------NRSNMSPHGLPARSSGSPKFAMKTRQAFYLHTTKLTRIARRLCREILRPWHAA 496

Query: 186 RSPFWCINIAAVKTDC 201
           R P+  IN AA+K +C
Sbjct: 497 RHPYLPINSAAIKAEC 512




May be involved in the regulation of gene expression by covalent modification of histone proteins. Isoform Long is a corepressor of estrogen receptor (ER). Isoform Short binds to ER and sequesters it in the cytoplasm and enhances non-genomic responses of ER.
Homo sapiens (taxid: 9606)
>sp|Q62599|MTA1_RAT Metastasis-associated protein MTA1 OS=Rattus norvegicus GN=Mta1 PE=1 SV=1 Back     alignment and function description
>sp|Q8K4B0|MTA1_MOUSE Metastasis-associated protein MTA1 OS=Mus musculus GN=Mta1 PE=1 SV=1 Back     alignment and function description
>sp|Q9BTC8|MTA3_HUMAN Metastasis-associated protein MTA3 OS=Homo sapiens GN=MTA3 PE=1 SV=2 Back     alignment and function description
>sp|A6QL72|MTA3_BOVIN Metastasis-associated protein MTA3 OS=Bos taurus GN=MTA3 PE=2 SV=1 Back     alignment and function description
>sp|Q924K8|MTA3_MOUSE Metastasis-associated protein MTA3 OS=Mus musculus GN=Mta3 PE=1 SV=1 Back     alignment and function description
>sp|Q9R190|MTA2_MOUSE Metastasis-associated protein MTA2 OS=Mus musculus GN=Mta2 PE=1 SV=1 Back     alignment and function description
>sp|O94776|MTA2_HUMAN Metastasis-associated protein MTA2 OS=Homo sapiens GN=MTA2 PE=1 SV=1 Back     alignment and function description

Close Homologs in the Non-Redundant Database Detected by BLAST ?

GI ?Alignment Graph ?Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query208
242008559 694 conserved hypothetical protein [Pediculu 0.831 0.249 0.590 1e-54
189239491 761 PREDICTED: similar to MTA1-like CG2244-P 0.884 0.241 0.548 5e-52
405950922 723 Metastasis-associated protein MTA1 [Cras 0.889 0.255 0.478 7e-43
347971738 1091 AGAP004341-PA [Anopheles gambiae str. PE 0.879 0.167 0.424 2e-38
307208573 962 Metastasis-associated protein MTA1 [Harp 0.846 0.182 0.444 3e-38
312385022 801 hypothetical protein AND_01284 [Anophele 0.879 0.228 0.410 3e-38
195502050 880 GE24151 [Drosophila yakuba] gi|194184155 0.855 0.202 0.400 2e-34
170038987 959 metastasis-associated protein 3 [Culex q 0.860 0.186 0.389 3e-34
307169394 910 Metastasis-associated protein MTA1 [Camp 0.894 0.204 0.399 4e-34
194741148 875 GF17406 [Drosophila ananassae] gi|190626 0.826 0.196 0.408 4e-33
>gi|242008559|ref|XP_002425070.1| conserved hypothetical protein [Pediculus humanus corporis] gi|212508735|gb|EEB12332.1| conserved hypothetical protein [Pediculus humanus corporis] Back     alignment and taxonomy information
 Score =  218 bits (555), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 108/183 (59%), Positives = 135/183 (73%), Gaps = 10/183 (5%)

Query: 24  GMVNGAN-GDVLISNKPCESCNVTVSTNWFPW---YSGTKICINCHAFWRKYGGLKIPGK 79
           G++NG N GD LI  K CESC  T S+ W+ W   ++  K+C +C  +W+KYGGLK P +
Sbjct: 287 GVLNGNNHGDGLICGKSCESCQGTSSSQWYAWGSSHTQYKLCHSCWIYWKKYGGLKNPSR 346

Query: 80  LSDGELDYLKPNKKSGSSSLSEEDRIGPSGLVGHRPHRCSVPGCSKEFKLKAHLHRHLAT 139
           L++ E D     KKSGS  +S+E++I PS +  HRPHRC+V  C KEFKLKAHL RH AT
Sbjct: 347 LNESESDL---KKKSGS--ISDEEKI-PSSIASHRPHRCTVNNCGKEFKLKAHLARHYAT 400

Query: 140 GHGLVARPGSPRPIMKTRNAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKT 199
            HG+V R GSPRPIMKTR AFYL  TPLTR+SRRLCRH++QPRHAARSPFW IN+ A+K 
Sbjct: 401 AHGVVIRSGSPRPIMKTRTAFYLHTTPLTRLSRRLCRHIMQPRHAARSPFWPINVPAIKQ 460

Query: 200 DCE 202
           +C+
Sbjct: 461 ECQ 463




Source: Pediculus humanus corporis

Species: Pediculus humanus

Genus: Pediculus

Family: Pediculidae

Order: Phthiraptera

Class: Insecta

Phylum: Arthropoda

Superkingdom: Eukaryota

>gi|189239491|ref|XP_975498.2| PREDICTED: similar to MTA1-like CG2244-PB [Tribolium castaneum] gi|270009558|gb|EFA06006.1| hypothetical protein TcasGA2_TC008832 [Tribolium castaneum] Back     alignment and taxonomy information
>gi|405950922|gb|EKC18877.1| Metastasis-associated protein MTA1 [Crassostrea gigas] Back     alignment and taxonomy information
>gi|347971738|ref|XP_313620.5| AGAP004341-PA [Anopheles gambiae str. PEST] gi|333469006|gb|EAA09201.5| AGAP004341-PA [Anopheles gambiae str. PEST] Back     alignment and taxonomy information
>gi|307208573|gb|EFN85907.1| Metastasis-associated protein MTA1 [Harpegnathos saltator] Back     alignment and taxonomy information
>gi|312385022|gb|EFR29616.1| hypothetical protein AND_01284 [Anopheles darlingi] Back     alignment and taxonomy information
>gi|195502050|ref|XP_002098054.1| GE24151 [Drosophila yakuba] gi|194184155|gb|EDW97766.1| GE24151 [Drosophila yakuba] Back     alignment and taxonomy information
>gi|170038987|ref|XP_001847328.1| metastasis-associated protein 3 [Culex quinquefasciatus] gi|167862606|gb|EDS25989.1| metastasis-associated protein 3 [Culex quinquefasciatus] Back     alignment and taxonomy information
>gi|307169394|gb|EFN62114.1| Metastasis-associated protein MTA1 [Camponotus floridanus] Back     alignment and taxonomy information
>gi|194741148|ref|XP_001953051.1| GF17406 [Drosophila ananassae] gi|190626110|gb|EDV41634.1| GF17406 [Drosophila ananassae] Back     alignment and taxonomy information

Prediction of Gene Ontology (GO) Terms

Close Homologs with Gene Ontology terms Detected by BLAST ?

ID ? Alignment graph ? Length ? Definition ? Q cover ? H cover ? Identity ? E-value ?
Query208
UNIPROTKB|G3MY42 511 G3MY42 "Uncharacterized protei 0.293 0.119 0.596 3.7e-28
UNIPROTKB|H0Y4T7 511 MTA1 "Metastasis-associated pr 0.293 0.119 0.612 4.7e-28
UNIPROTKB|G3MXY7 713 G3MXY7 "Uncharacterized protei 0.293 0.085 0.596 1.2e-27
UNIPROTKB|E1C4P3521 MTA1 "Uncharacterized protein" 0.326 0.130 0.536 1.4e-27
UNIPROTKB|E7ESY4 703 MTA1 "Metastasis-associated pr 0.293 0.086 0.612 1.4e-27
UNIPROTKB|Q13330 715 MTA1 "Metastasis-associated pr 0.293 0.085 0.612 1.5e-27
FB|FBgn0027951 880 MTA1-like "MTA1-like" [Drosoph 0.692 0.163 0.395 2e-27
UNIPROTKB|F1N9W5 624 MTA1 "Uncharacterized protein" 0.326 0.108 0.536 2.7e-27
UNIPROTKB|E1C4P2 697 MTA1 "Uncharacterized protein" 0.326 0.097 0.536 3.8e-27
UNIPROTKB|F1PP42 713 MTA1 "Uncharacterized protein" 0.293 0.085 0.596 4e-27
UNIPROTKB|G3MY42 G3MY42 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
 Score = 190 (71.9 bits), Expect = 3.7e-28, Sum P(2) = 3.7e-28
 Identities = 37/62 (59%), Positives = 46/62 (74%)

Query:   141 HGLVARP-GSPRPIMKTRNAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKT 199
             HG+ AR  GSP+  MKTR AFYL  T LTRI+RRLCR +++P HAAR P+  IN AA+K 
Sbjct:   243 HGIPARSSGSPKFAMKTRQAFYLHTTKLTRIARRLCREILRPWHAARHPYMPINSAAIKA 302

Query:   200 DC 201
             +C
Sbjct:   303 EC 304


GO:0005634 "nucleus" evidence=IEA
GO:0043565 "sequence-specific DNA binding" evidence=IEA
GO:0008270 "zinc ion binding" evidence=IEA
GO:0003700 "sequence-specific DNA binding transcription factor activity" evidence=IEA
GO:0003682 "chromatin binding" evidence=IEA
UNIPROTKB|H0Y4T7 MTA1 "Metastasis-associated protein MTA1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|G3MXY7 G3MXY7 "Uncharacterized protein" [Bos taurus (taxid:9913)] Back     alignment and assigned GO terms
UNIPROTKB|E1C4P3 MTA1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|E7ESY4 MTA1 "Metastasis-associated protein MTA1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
UNIPROTKB|Q13330 MTA1 "Metastasis-associated protein MTA1" [Homo sapiens (taxid:9606)] Back     alignment and assigned GO terms
FB|FBgn0027951 MTA1-like "MTA1-like" [Drosophila melanogaster (taxid:7227)] Back     alignment and assigned GO terms
UNIPROTKB|F1N9W5 MTA1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|E1C4P2 MTA1 "Uncharacterized protein" [Gallus gallus (taxid:9031)] Back     alignment and assigned GO terms
UNIPROTKB|F1PP42 MTA1 "Uncharacterized protein" [Canis lupus familiaris (taxid:9615)] Back     alignment and assigned GO terms

Prediction of Enzyme Commission (EC) Number

EC Number Prediction by Annotation Transfer from SWISS-PROT Entries ?

No confident hit for EC number transfering in SWISSPROT detected by BLAST

EC Number Prediction by EFICAz Software ?

No EC number assignment, probably not an enzyme!


Prediction of Functionally Associated Proteins


Conserved Domains and Related Protein Families

Conserved Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query208
smart0040152 smart00401, ZnF_GATA, zinc finger binding to DNA c 0.001
>gnl|CDD|214648 smart00401, ZnF_GATA, zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
 Score = 35.9 bits (83), Expect = 0.001
 Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 1/43 (2%)

Query: 36 SNKPCESCNVTVSTNWFPWYSGTK-ICINCHAFWRKYGGLKIP 77
          S + C +C  T +  W    SG K +C  C  +++K+GGLK P
Sbjct: 2  SGRSCSNCGTTETPLWRRGPSGNKTLCNACGLYYKKHGGLKRP 44


Length = 52

Conserved Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query 208
KOG3554|consensus 693 100.0
smart0040152 ZnF_GATA zinc finger binding to DNA consensus sequ 98.89
PF0032036 GATA: GATA zinc finger; InterPro: IPR000679 Zinc f 98.53
cd0020254 ZnF_GATA Zinc finger DNA binding domain; binds spe 98.47
>KOG3554|consensus Back     alignment and domain information
Probab=100.00  E-value=1.6e-62  Score=460.64  Aligned_cols=166  Identities=37%  Similarity=0.671  Sum_probs=144.5

Q ss_pred             ceeecCCCCCCCCCCccC---CCCcccCCCCCCCCCCCCCCCccccCcccceecCCC---cccccccchhhhccCCCCCC
Q psy2138           4 AEVDMAEYNPHDLFFHDV---GPGMVNGANGDVLISNKPCESCNVTVSTNWFPWYSG---TKICINCHAFWRKYGGLKIP   77 (208)
Q Consensus         4 kQVYIP~YnKPNPn~I~~---~~g~~NG~~g~~~~~g~~CEsC~t~~S~QWYsWGP~---cRLC~sCW~YWKKYGGLK~p   77 (208)
                      ||||||+|||||||||++   ++| |||++.. .+.|++||||||+||.|||+|||+   ||||++||+|||||||||||
T Consensus       352 kqvYIP~ynKPnpnqI~~~n~k~g-~~gtt~~-~~~g~~CEsC~ttqs~qWYsWGppnmqcrLCasCWiyWKKygGLk~p  429 (693)
T KOG3554|consen  352 KQVYIPTYNKPNPNQISPYNTKPG-MNGTTFQ-NQDGRACESCYTTQSLQWYSWGPPNMQCRLCASCWIYWKKYGGLKMP  429 (693)
T ss_pred             heeeccCCCCCCcceecccCCCcC-ccccccc-CCCCCcccccccccccceeccCCCCccchhhHHHHHHHHHhcCcCCc
Confidence            899999999999999987   555 7886422 245999999999999999999999   99999999999999999999


Q ss_pred             ccCCCCCCCccCCCCCCCCCCCcccccCCCCCCCCCCCCccCCCCCchhhhhhhhhcccccccCCcccC-CCC--CCCcc
Q psy2138          78 GKLSDGELDYLKPNKKSGSSSLSEEDRIGPSGLVGHRPHRCSVPGCSKEFKLKAHLHRHLATGHGLVAR-PGS--PRPIM  154 (208)
Q Consensus        78 ~~~e~~~~~~~~~~~~s~~~~~~~~~~~~p~~~~~~~~~~~~i~~~~ke~~~r~hl~R~~~~~~g~~~r-~g~--p~~~~  154 (208)
                      |++|+++...                    +..++             +...+.|++|.  +.||++++ +|+  |+..+
T Consensus       430 tqle~~~~~~--------------------~p~~e-------------~p~~r~~~Sr~--~p~s~~a~~t~a~~~k~~~  474 (693)
T KOG3554|consen  430 TQLEGEDKGP--------------------PPPVE-------------PPAPRSHASRG--TPQSLPARATGAVNPKTAQ  474 (693)
T ss_pred             chhccccCCC--------------------CCCCC-------------CCCCCccccCC--CcccCCcccccccCcccHH
Confidence            9999887431                    00111             22347889987  78888888 666  89999


Q ss_pred             cccceEEEecChhhHHHHHhhhcccChhhhhcCCCcccCHHHHHhhhhhhcc
Q psy2138         155 KTRNAFYLRATPLTRISRRLCRHVIQPRHAARSPFWCINIAAVKTDCEYIES  206 (208)
Q Consensus       155 KTR~AF~L~Tt~~tRlARrlck~~~~~R~aARrPf~~IN~~aIk~ec~~r~~  206 (208)
                      ||||+|+|+||.|||||||+|+++||+|++|||||.+||.++||+||++|++
T Consensus       475 k~rQ~f~l~tt~lt~lar~~crnlL~~~~Aarrp~~~in~~aikae~a~r~~  526 (693)
T KOG3554|consen  475 KTRQAFALHTTKLTRLARRDCRNLLRPREAARRPYTPINFAAIKAECAQRLP  526 (693)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhcCHHHHhhCccchhhHHHHHHHHHHHHH
Confidence            9999999999999999999999999999999999999999999999998864



>smart00401 ZnF_GATA zinc finger binding to DNA consensus sequence [AT]GATA[AG] Back     alignment and domain information
>PF00320 GATA: GATA zinc finger; InterPro: IPR000679 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule Back     alignment and domain information
>cd00202 ZnF_GATA Zinc finger DNA binding domain; binds specifically to DNA consensus sequence [AT]GATA[AG] promoter elements; a subset of family members may also bind protein; zinc-finger consensus topology is C-X(2)-C-X(17)-C-X(2)-C Back     alignment and domain information

Homologous Structure Templates

Structure Templates Detected by BLAST ?

No homologous structure with e-value below 0.005

Structure Templates Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query208
2epa_A72 Krueppel-like factor 10; transforming growth facto 2e-04
1va1_A37 Transcription factor SP1; C2H2 type zinc finger, D 2e-04
>2epa_A Krueppel-like factor 10; transforming growth factor-beta-inducible early growth response protein 1, TGFB-inducible early growth response protein 1; NMR {Homo sapiens} Length = 72 Back     alignment and structure
 Score = 37.8 bits (88), Expect = 2e-04
 Identities = 17/46 (36%), Positives = 18/46 (39%), Gaps = 8/46 (17%)

Query: 95  GSSSLSEEDRIGPSGLVGHRPHRCSVPGCSKEFK----LKAHLHRH 136
           GSS  S             R H CS PGC K +     LKAH   H
Sbjct: 1   GSSGSSGPQIDSS----RIRSHICSHPGCGKTYFKSSHLKAHTRTH 42


>1va1_A Transcription factor SP1; C2H2 type zinc finger, DNA-binding protein; NMR {Homo sapiens} Length = 37 Back     alignment and structure

Structure Templates Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query208
1gnf_A46 Transcription factor GATA-1; zinc finger, transcri 96.92
2vut_I43 AREA, nitrogen regulatory protein AREA; transcript 96.42
3dfx_A63 Trans-acting T-cell-specific transcription factor 96.08
4gat_A66 Nitrogen regulatory protein AREA; DNA binding prot 95.79
2kae_A71 GATA-type transcription factor; zinc finger, GATA- 95.73
4hc9_A115 Trans-acting T-cell-specific transcription factor; 94.7
4hc9_A115 Trans-acting T-cell-specific transcription factor; 92.86
2eln_A38 Zinc finger protein 406; ZFAT zinc finger 1, struc 91.97
1va1_A37 Transcription factor SP1; C2H2 type zinc finger, D 89.27
1zfd_A32 SWI5; DNA binding motif, zinc finger DNA binding d 87.11
1sp2_A31 SP1F2; zinc finger, transcription activation; NMR 86.87
1bhi_A38 CRE-BP1, ATF-2; CRE binding protein, transcription 86.49
1x3c_A73 Zinc finger protein 292; DNA binding, nuclear prot 84.65
1fv5_A36 First zinc finger of U-shaped; CCHC, protein inter 84.22
>1gnf_A Transcription factor GATA-1; zinc finger, transcription regulation; NMR {Mus musculus} SCOP: g.39.1.1 PDB: 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
Probab=96.92  E-value=0.00042  Score=46.40  Aligned_cols=37  Identities=19%  Similarity=0.535  Sum_probs=33.1

Q ss_pred             CCCCCCCccccCcccceecCCC-cccccccchhhhccCC
Q psy2138          36 SNKPCESCNVTVSTNWFPWYSG-TKICINCHAFWRKYGG   73 (208)
Q Consensus        36 ~g~~CEsC~t~~S~QWYsWGP~-cRLC~sCW~YWKKYGG   73 (208)
                      .++.|..|.++++.+|=. ||. .-||..|-+|||++|-
T Consensus         3 ~~~~C~~C~tt~Tp~WR~-gp~G~~LCNaCGl~~k~~~~   40 (46)
T 1gnf_A            3 EARECVNCGATATPLWRR-DRTGHYLCNACGLYHKMNGQ   40 (46)
T ss_dssp             CSCCCTTTCCCCCSSCBC-CTTCCCBCSHHHHHHHHTCS
T ss_pred             CCCCCCCcCCCCCCcCcc-CCCCCccchHHHHHHHHcCC
Confidence            368899999999999987 777 7899999999999984



>2vut_I AREA, nitrogen regulatory protein AREA; transcription regulation, protein-protein interactions, metal-binding, nitrate assimilation; HET: NAD; 2.3A {Emericella nidulans} SCOP: g.39.1.1 PDB: 2vus_I* 2vuu_I* Back     alignment and structure
>3dfx_A Trans-acting T-cell-specific transcription factor GATA-3; activator, DNA-binding, metal-binding, nucleus; HET: DNA; 2.70A {Mus musculus} PDB: 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* Back     alignment and structure
>4gat_A Nitrogen regulatory protein AREA; DNA binding protein, transcription factor, zinc binding domain, complex (transcription regulation/DNA); HET: DNA; NMR {Emericella nidulans} SCOP: g.39.1.1 PDB: 5gat_A* 6gat_A* 7gat_A* Back     alignment and structure
>2kae_A GATA-type transcription factor; zinc finger, GATA-type, DNA; NMR {Caenorhabditis elegans} Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>4hc9_A Trans-acting T-cell-specific transcription factor; zinc finger, GATA transcription factor, DNA bridging, transc DNA complex; HET: DNA; 1.60A {Homo sapiens} PDB: 4hc7_A* 4hca_A* 3dfx_A* 3dfv_D* 2gat_A* 3gat_A* 1gat_A* 1gau_A* 1gnf_A 1y0j_A 2l6y_A 2l6z_A Back     alignment and structure
>2eln_A Zinc finger protein 406; ZFAT zinc finger 1, structural genomics, NPPSFA, national project on protein structural and functional analyses; NMR {Homo sapiens} Back     alignment and structure
>1va1_A Transcription factor SP1; C2H2 type zinc finger, DNA-binding protein; NMR {Homo sapiens} Back     alignment and structure
>1zfd_A SWI5; DNA binding motif, zinc finger DNA binding domain; NMR {Saccharomyces cerevisiae} SCOP: g.37.1.1 Back     alignment and structure
>1sp2_A SP1F2; zinc finger, transcription activation; NMR {Homo sapiens} SCOP: g.37.1.1 PDB: 1va2_A Back     alignment and structure
>1bhi_A CRE-BP1, ATF-2; CRE binding protein, transcriptional activation domain, Zn finger, DNA-binding regulatory protein; NMR {Homo sapiens} SCOP: g.37.1.1 Back     alignment and structure
>1x3c_A Zinc finger protein 292; DNA binding, nuclear protein, C2H2-type zinc finger, KIAA0530, structural genomics, NPPSFA; NMR {Homo sapiens} SCOP: g.37.1.1 Back     alignment and structure
>1fv5_A First zinc finger of U-shaped; CCHC, protein interaction, transcription; NMR {Drosophila melanogaster} SCOP: g.37.1.2 PDB: 1y0j_B 2l6z_B Back     alignment and structure

Homologous Structure Domains

Structure Domains Detected by RPS-BLAST ?

ID ?Alignment Graph ?Length ? Definition ? E-value ?
Query 208
d1ubdc428 g.37.1.1 (C:381-408) Ying-yang 1 (yy1, zinc finger 5e-04
d1zfda_32 g.37.1.1 (A:) SWI5 zinc-finger domains {Baker's ye 6e-04
d2glia330 g.37.1.1 (A:168-197) Five-finger GLI1 {Human (Homo 7e-04
d1sp2a_31 g.37.1.1 (A:) Transcription factor sp1 {Human (Hom 0.001
d1a1ia129 g.37.1.1 (A:103-131) ZIF268 {Mouse (Mus musculus) 0.001
d1ubdc330 g.37.1.1 (C:351-380) Ying-yang 1 (yy1, zinc finger 0.002
>d1ubdc4 g.37.1.1 (C:381-408) Ying-yang 1 (yy1, zinc finger domain) {Human (Homo sapiens) [TaxId: 9606]} Length = 28 Back     information, alignment and structure

class: Small proteins
fold: beta-beta-alpha zinc fingers
superfamily: beta-beta-alpha zinc fingers
family: Classic zinc finger, C2H2
domain: Ying-yang 1 (yy1, zinc finger domain)
species: Human (Homo sapiens) [TaxId: 9606]
 Score = 34.2 bits (79), Expect = 5e-04
 Identities = 10/26 (38%), Positives = 15/26 (57%)

Query: 114 RPHRCSVPGCSKEFKLKAHLHRHLAT 139
           RP+ C   GC+K+F    +L  H+ T
Sbjct: 1   RPYVCPFDGCNKKFAQSTNLKSHILT 26


>d1zfda_ g.37.1.1 (A:) SWI5 zinc-finger domains {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Length = 32 Back     information, alignment and structure
>d2glia3 g.37.1.1 (A:168-197) Five-finger GLI1 {Human (Homo sapiens) [TaxId: 9606]} Length = 30 Back     information, alignment and structure
>d1sp2a_ g.37.1.1 (A:) Transcription factor sp1 {Human (Homo sapiens) [TaxId: 9606]} Length = 31 Back     information, alignment and structure
>d1a1ia1 g.37.1.1 (A:103-131) ZIF268 {Mouse (Mus musculus) [TaxId: 10090]} Length = 29 Back     information, alignment and structure
>d1ubdc3 g.37.1.1 (C:351-380) Ying-yang 1 (yy1, zinc finger domain) {Human (Homo sapiens) [TaxId: 9606]} Length = 30 Back     information, alignment and structure

Homologous Domains Detected by HHsearch ?

ID ?Alignment Graph ?Length ? Definition ? Probability ?
Query208
d1y0ja139 Erythroid transcription factor GATA-1 {Mouse (Mus 96.89
d2vuti142 Erythroid transcription factor GATA-1 {Emericella 96.1
d2glia330 Five-finger GLI1 {Human (Homo sapiens) [TaxId: 960 95.87
d1ubdc428 Ying-yang 1 (yy1, zinc finger domain) {Human (Homo 95.81
d1zfda_32 SWI5 zinc-finger domains {Baker's yeast (Saccharom 95.74
d1ubdc330 Ying-yang 1 (yy1, zinc finger domain) {Human (Homo 95.38
d1a1ia129 ZIF268 {Mouse (Mus musculus) [TaxId: 10090]} 95.26
d3gata_66 Erythroid transcription factor GATA-1 {Chicken (Ga 95.16
d1sp2a_31 Transcription factor sp1 {Human (Homo sapiens) [Ta 94.76
d1srka_35 Zinc finger protein ZFPM1 (FOG-1) {Mouse (Mus musc 91.08
d2adra129 ADR1 {Synthetic, based on Saccharomyces cerevisiae 90.66
d2ct1a236 Transcriptional repressor CTCF {Human (Homo sapien 90.26
d1ncsa_47 SWI5 zinc-finger domains {Baker's yeast (Saccharom 90.18
d1tf3a230 Transcription factor IIIA, TFIIIA {Xenopus laevis 89.4
d1a1ia228 ZIF268 {Mouse (Mus musculus) [TaxId: 10090]} 88.49
d2dlka236 Zinc finger protein 692, ZNF692 {Human (Homo sapie 86.06
d1p7aa_37 Kruppel-like factor 3, Bklf {Mouse (Mus musculus) 85.5
d2epsa139 PATZ1 {Human (Homo sapiens) [TaxId: 9606]} 85.01
d1x6ha236 Transcriptional repressor CTCF {Human (Homo sapien 83.59
d1x6ea226 Zinc finger protein 24 {Human (Homo sapiens) [TaxI 83.32
d2cota238 Zinc finger and SCAN domain-containing protein 16, 82.16
d1x6ea133 Zinc finger protein 24 {Human (Homo sapiens) [TaxI 81.97
d2j7ja229 Transcription factor IIIA, TFIIIA {Xenopus laevis 80.06
>d1y0ja1 g.39.1.1 (A:200-238) Erythroid transcription factor GATA-1 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
class: Small proteins
fold: Glucocorticoid receptor-like (DNA-binding domain)
superfamily: Glucocorticoid receptor-like (DNA-binding domain)
family: Erythroid transcription factor GATA-1
domain: Erythroid transcription factor GATA-1
species: Mouse (Mus musculus) [TaxId: 10090]
Probab=96.89  E-value=0.00021  Score=45.04  Aligned_cols=35  Identities=20%  Similarity=0.573  Sum_probs=32.3

Q ss_pred             CCCCCCccccCcccceecCCC-cccccccchhhhccC
Q psy2138          37 NKPCESCNVTVSTNWFPWYSG-TKICINCHAFWRKYG   72 (208)
Q Consensus        37 g~~CEsC~t~~S~QWYsWGP~-cRLC~sCW~YWKKYG   72 (208)
                      ++-|.+|.+++.++|=. ||. ..||-.|=+|||++|
T Consensus         2 ~r~C~~Cgtt~Tp~WR~-gp~G~~LCNACGl~~r~~G   37 (39)
T d1y0ja1           2 ARECVNCGATATPLWRR-DRTGHYLCNACGLYHKMNG   37 (39)
T ss_dssp             CCCCSSSCCCCCSCCEE-CTTSCEECSSHHHHHHHSC
T ss_pred             cCCCCCCCCCCCccccc-CCCCCEeeHHhHHHHHHhC
Confidence            57899999999999985 888 789999999999998



>d2vuti1 g.39.1.1 (I:671-712) Erythroid transcription factor GATA-1 {Emericella nidulans [TaxId: 162425]} Back     information, alignment and structure
>d2glia3 g.37.1.1 (A:168-197) Five-finger GLI1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1ubdc4 g.37.1.1 (C:381-408) Ying-yang 1 (yy1, zinc finger domain) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1zfda_ g.37.1.1 (A:) SWI5 zinc-finger domains {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1ubdc3 g.37.1.1 (C:351-380) Ying-yang 1 (yy1, zinc finger domain) {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1a1ia1 g.37.1.1 (A:103-131) ZIF268 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d3gata_ g.39.1.1 (A:) Erythroid transcription factor GATA-1 {Chicken (Gallus gallus) [TaxId: 9031]} Back     information, alignment and structure
>d1sp2a_ g.37.1.1 (A:) Transcription factor sp1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1srka_ g.37.1.1 (A:) Zinc finger protein ZFPM1 (FOG-1) {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2adra1 g.37.1.1 (A:102-130) ADR1 {Synthetic, based on Saccharomyces cerevisiae sequence} Back     information, alignment and structure
>d2ct1a2 g.37.1.1 (A:8-43) Transcriptional repressor CTCF {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1ncsa_ g.37.1.1 (A:) SWI5 zinc-finger domains {Baker's yeast (Saccharomyces cerevisiae) [TaxId: 4932]} Back     information, alignment and structure
>d1tf3a2 g.37.1.1 (A:41-70) Transcription factor IIIA, TFIIIA {Xenopus laevis [TaxId: 8355]} Back     information, alignment and structure
>d1a1ia2 g.37.1.1 (A:132-159) ZIF268 {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2dlka2 g.37.1.1 (A:38-73) Zinc finger protein 692, ZNF692 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1p7aa_ g.37.1.1 (A:) Kruppel-like factor 3, Bklf {Mouse (Mus musculus) [TaxId: 10090]} Back     information, alignment and structure
>d2epsa1 g.37.1.1 (A:408-446) PATZ1 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1x6ha2 g.37.1.1 (A:8-43) Transcriptional repressor CTCF {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1x6ea2 g.37.1.1 (A:41-66) Zinc finger protein 24 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2cota2 g.37.1.1 (A:7-44) Zinc finger and SCAN domain-containing protein 16, ZSCAN16 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d1x6ea1 g.37.1.1 (A:8-40) Zinc finger protein 24 {Human (Homo sapiens) [TaxId: 9606]} Back     information, alignment and structure
>d2j7ja2 g.37.1.1 (A:29-57) Transcription factor IIIA, TFIIIA {Xenopus laevis [TaxId: 8355]} Back     information, alignment and structure