BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>006832
MPGAGKKVVKHSESGKSCFPKKDSGSELIASLKFKKGSKISKSKKLRSKSRCQSKTVNST
LLKRTVAESHSKGAGDDFARSKSISQKNLHIKIDRKGSKNWASSKHKGKNSALVISKGNG
EVVDGDGETKKLRKGRSKKRRKEKVELDEASRLQRRTRYLLIKMKLEQNLIDAYSGEGWK
GHSREKIRPEKELQRAKKQILKCKIGIRDAIRQLDSLSSVGCIEGSVIATDGSVHHEHII
CAKCKLREAFPDNDIVLCDGTCNCAFHQKCLDPPLDTESRDQGWFCKFCECKMEIIESMN
AHIGTSFSVNSNWQDIFKEEAAFPDGCSALLNQEEEWPSDDSEDDDYNPERRENSCSISR
AGTDDDPSSSTSLSWFSDSETFSESMRWEMESNGYKNYSVDSSIGSDETSDGEIICGRRQ
RRTVDYKKLYDEMFGKDASAFEQLSEDEDWGPAKRRRKEKESDAVNSLMTLYGSEEKYSK
VKTAEVKKKLPSNAKIRRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKV
NKWFKNARYLALKARKVESARQVSGSPRISKESSLETEKKNADVLTLKNSLEETLICSPK
SLKKIHPKRIQNQSAVAAASRKISRKELL

High Scoring Gene Products

Symbol, full name Information P value
PRHA
pathogenesis related homeodomain protein A
protein from Arabidopsis thaliana 1.0e-123
HAT3.1 protein from Arabidopsis thaliana 2.1e-56
I3LTW3
Uncharacterized protein
protein from Sus scrofa 8.8e-05
ceh-54 gene from Caenorhabditis elegans 0.00037
cec-8 gene from Caenorhabditis elegans 0.00086

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  006832
        (629 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2005538 - symbol:PRHA "pathogenesis related ho...  1216  1.0e-123  1
TAIR|locus:2090694 - symbol:HAT3.1 species:3702 "Arabidop...   478  2.1e-56   2
UNIPROTKB|I3LTW3 - symbol:I3LTW3 "Uncharacterized protein...   104  8.8e-05   1
WB|WBGene00020485 - symbol:ceh-54 species:6239 "Caenorhab...   115  0.00037   1
WB|WBGene00021913 - symbol:cec-8 species:6239 "Caenorhabd...   120  0.00086   1


>TAIR|locus:2005538 [details] [associations]
            symbol:PRHA "pathogenesis related homeodomain protein  A"
            species:3702 "Arabidopsis thaliana" [GO:0003700 "sequence-specific
            DNA binding transcription factor activity" evidence=IEA;ISS]
            [GO:0005634 "nucleus" evidence=ISM;ISS] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA;ISS] [GO:0008270 "zinc
            ion binding" evidence=IEA] [GO:0043565 "sequence-specific DNA
            binding" evidence=IEA;IDA] [GO:0045893 "positive regulation of
            transcription, DNA-dependent" evidence=IDA] [GO:0009733 "response
            to auxin stimulus" evidence=IEP] [GO:0005515 "protein binding"
            evidence=IPI] InterPro:IPR001356 InterPro:IPR001965
            InterPro:IPR009057 InterPro:IPR019787 Pfam:PF00046 Pfam:PF00628
            PROSITE:PS50016 PROSITE:PS50071 SMART:SM00249 SMART:SM00389
            GO:GO:0005634 GO:GO:0045893 EMBL:CP002687 GenomeReviews:CT486007_GR
            GO:GO:0006952 GO:GO:0009733 GO:GO:0046872 GO:GO:0043565
            GO:GO:0008270 GO:GO:0003700 GO:GO:0006351 Gene3D:1.10.10.60
            SUPFAM:SSF46689 EMBL:AL050352 EMBL:AL161575 Gene3D:3.30.40.10
            InterPro:IPR011011 InterPro:IPR013083 SUPFAM:SSF57903
            InterPro:IPR019786 PROSITE:PS01359 EMBL:L21991 EMBL:U48864
            IPI:IPI00540289 PIR:T08555 RefSeq:NP_194723.1 UniGene:At.19983
            ProteinModelPortal:P48785 SMR:P48785 IntAct:P48785 STRING:P48785
            PaxDb:P48785 PRIDE:P48785 EnsemblPlants:AT4G29940.1 GeneID:829117
            KEGG:ath:AT4G29940 TAIR:At4g29940 eggNOG:NOG311547
            HOGENOM:HOG000115692 InParanoid:P48785 OMA:MTEESHE PhylomeDB:P48785
            ProtClustDB:CLSN2716723 Genevestigator:P48785 GermOnline:AT4G29940
            Uniprot:P48785
        Length = 796

 Score = 1216 (433.1 bits), Expect = 1.0e-123, P = 1.0e-123
 Identities = 245/460 (53%), Positives = 317/460 (68%)

Query:   145 VELDEASRLQRRTRYLLIKMKLEQNLIDAYSGEGWKGHSREKIRPEKELQRAKKQILKCK 204
             VE+D++ RLQRRTRYLLIKMK++QNLIDAY+ EGWKG SREKIRP+KEL+RA+K+IL CK
Sbjct:    97 VEVDDSLRLQRRTRYLLIKMKMQQNLIDAYATEGWKGQSREKIRPDKELERARKEILNCK 156

Query:   205 IGIRDAIRQLDSLSSVGCIEGSVIATDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNC 264
             +G+RDAIRQLD LSSVG +E  VIA+DGS+HH+HI CA+C  REAFPDNDI+LCDGTCN 
Sbjct:   157 LGLRDAIRQLDLLSSVGSMEEKVIASDGSIHHDHIFCAECNSREAFPDNDIILCDGTCNR 216

Query:   265 AFHQKCLDPPLDTES---RDQGWFCKFCECKMEIIESMNAHIGTSFSVNSNWQDIFKEEA 321
             AFHQKCLDPPL+TES    DQGWFCKFC+CK+EII++MNA IGT F V+SNWQDIF EEA
Sbjct:   217 AFHQKCLDPPLETESIPPGDQGWFCKFCDCKIEIIDTMNAQIGTHFPVDSNWQDIFNEEA 276

Query:   322 AFPDGCSALLNQXXXXXXXXXXXXXXXXXRRENSCSISRAGTXXXXXXXXXXXXXXXXET 381
             + P G  A +N                   REN    S   +                  
Sbjct:   277 SLPIGSEATVNNEADWPSDDSKDDDYDPEMRENGGGNSSNVSGDGGGDNDEESISTSLSL 336

Query:   382 FSESMRWEMESNGYKNYSVDSSIGSDETSDGEIICGRRQRRTVDYKKLYDEMFGKDASAF 441
              S+ +   + +  ++ + + + +   ETS+ E +CG RQRRTVDY +LY EMFGKDA   
Sbjct:   337 SSDGVA--LSTGSWEGHRLSNMVEQCETSNEETVCGPRQRRTVDYTQLYYEMFGKDAVLQ 394

Query:   442 EQLSEDEDWGPAKRRRKEKESDAVNSLMTLYGSEEK-YSKVKTAEVKKK---LPSNAKIR 497
             EQ SEDEDWGP  RR++++ESDA ++L+T+  S +K    V+T E  ++      N   R
Sbjct:   395 EQGSEDEDWGPNDRRKRKRESDAGSTLVTMCESSKKDQDVVETLEQSERDSVSVENKGGR 454

Query:   498 RSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNARYLALKARKV 557
             R   R+P NAVEKLRQVFAE ELPS+ V++ L+KELSL+PEKVNKWFKN RY+AL+ RK 
Sbjct:   455 RRMFRLPRNAVEKLRQVFAETELPSKAVRDRLAKELSLDPEKVNKWFKNTRYMALRNRKT 514

Query:   558 ESARQVSGSPRISK-ESSLETE-KKNADVLTLKNSLEETL 595
             ES +Q   S  +S  +S  E   + N +   ++++L++T+
Sbjct:   515 ESVKQPGDSKTVSGGDSGPEAVMENNTETNEVQDTLDDTV 554


>TAIR|locus:2090694 [details] [associations]
            symbol:HAT3.1 species:3702 "Arabidopsis thaliana"
            [GO:0003700 "sequence-specific DNA binding transcription factor
            activity" evidence=IEA;ISS] [GO:0005634 "nucleus" evidence=ISM;ISS]
            [GO:0006355 "regulation of transcription, DNA-dependent"
            evidence=IEA;ISS;RCA] [GO:0008270 "zinc ion binding" evidence=IEA]
            [GO:0043565 "sequence-specific DNA binding" evidence=IEA;IDA]
            [GO:0045893 "positive regulation of transcription, DNA-dependent"
            evidence=RCA;IDA] [GO:0008284 "positive regulation of cell
            proliferation" evidence=RCA] [GO:0009560 "embryo sac egg cell
            differentiation" evidence=RCA] [GO:0043687 "post-translational
            protein modification" evidence=RCA] [GO:0051276 "chromosome
            organization" evidence=RCA] [GO:0003677 "DNA binding" evidence=IDA]
            InterPro:IPR001356 InterPro:IPR001965 InterPro:IPR009057
            InterPro:IPR019787 Pfam:PF00046 Pfam:PF00628 PROSITE:PS00027
            PROSITE:PS50016 PROSITE:PS50071 SMART:SM00249 SMART:SM00389
            GO:GO:0005634 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0045893
            GO:GO:0046872 GO:GO:0043565 GO:GO:0008270 GO:GO:0003700
            GO:GO:0006351 Gene3D:1.10.10.60 SUPFAM:SSF46689 Gene3D:3.30.40.10
            InterPro:IPR011011 InterPro:IPR013083 SUPFAM:SSF57903
            InterPro:IPR019786 PROSITE:PS01359 EMBL:AB025624 EMBL:X69512
            EMBL:AK117105 EMBL:BT005965 IPI:IPI00537420 PIR:S31437
            RefSeq:NP_188582.1 UniGene:At.19694 ProteinModelPortal:Q04996
            SMR:Q04996 IntAct:Q04996 STRING:Q04996 PaxDb:Q04996 PRIDE:Q04996
            EnsemblPlants:AT3G19510.1 GeneID:821485 KEGG:ath:AT3G19510
            TAIR:At3g19510 eggNOG:NOG79337 HOGENOM:HOG000112850
            InParanoid:Q04996 OMA:KEGECET PhylomeDB:Q04996
            ProtClustDB:CLSN2915378 Genevestigator:Q04996 Uniprot:Q04996
        Length = 723

 Score = 478 (173.3 bits), Expect = 2.1e-56, Sum P(2) = 2.1e-56
 Identities = 90/180 (50%), Positives = 123/180 (68%)

Query:   146 ELDEASRLQRRTRYLLIKMKLEQNLIDAYSGEGWKGHSREKIRPEKELQRAKKQILKCKI 205
             E DE +R++++ RY L ++  EQ+LIDAYS EGWKG S EKIRPEKEL+RA K+IL+ K+
Sbjct:   173 EDDEYTRIKKKLRYFLNRINYEQSLIDAYSLEGWKGSSLEKIRPEKELERATKEILRRKL 232

Query:   206 GIRDAIRQLDSLSSVGCIEGSVIATDGSVHHEHIICAKCKLREAFPDNDIVLCDGTCNCA 265
              IRD  + LD+L + G +  S+  TDG +  E I CAKC  ++   DNDI+LCDG C+  
Sbjct:   233 KIRDLFQHLDTLCAEGSLPESLFDTDGEISSEDIFCAKCGSKDLSVDNDIILCDGFCDRG 292

Query:   266 FHQKCLDPPL---DTESRDQGWFCKFCECKMEIIESMNAHIGTSFSVNSNWQDIFKEEAA 322
             FHQ CL+PPL   D    D+GW C  C+CK + ++ +N  +GT FSV+ +W+ IF E AA
Sbjct:   293 FHQYCLEPPLRKEDIPPDDEGWLCPGCDCKDDSLDLLNDSLGTKFSVSDSWEKIFPEAAA 352

 Score = 127 (49.8 bits), Expect = 2.1e-56, Sum P(2) = 2.1e-56
 Identities = 43/153 (28%), Positives = 71/153 (46%)

Query:   437 DASAFEQLSEDEDWGPAKRRRKEKESDAVNSL-MTLYGSEEKYSKVKTAEVKKKLPSNAK 495
             D    +Q S  ED    K  RK K +D  ++L M   G  E      + E++K   S  K
Sbjct:   564 DTVPLKQSSNAEDHTSKKLIRKSKRADKKDTLEMPQEGPGENGG---SGEIEKSSSSACK 620

Query:   496 IRRSFHRMPPNAVEKLRQVFAENELPSRIVKENLSKELSLEPEKVNKWFKNARYLALKAR 555
                   +  P   ++L   F EN+ P +  KE+L+KEL +  ++VN WFK+ R+ ++ ++
Sbjct:   621 ------QTDPKT-QRLYISFQENQYPDKATKESLAKELQMTVKQVNNWFKHRRW-SINSK 672

Query:   556 KVESARQVSGSPRISKESSLETEKKNADVLTLK 588
              + S   V    +  KE   ET    +   T++
Sbjct:   673 PLVSEENVE-KLKTGKEGECETSVAGSSKQTME 704

 Score = 94 (38.1 bits), Expect = 6.3e-53, Sum P(2) = 6.3e-53
 Identities = 45/167 (26%), Positives = 74/167 (44%)

Query:   400 VDSSIGSDETSDGEIICGRRQRRTVDYKKLYDEMFGKDASAFEQLSEDEDWGPAKRRRKE 459
             ++S +G D+   G  +  RR    +DYKKLYDE +    ++    S+D+DW    R  KE
Sbjct:   501 LESDVGLDDGPAG--VSRRRNVERLDYKKLYDEEYDNVPTSS---SDDDDWDKTARMGKE 555

Query:   460 -KESDAVNSLMTLYGSEEKYSKVKTAEVKKKLPSNAKIRRSFHRMPPNAVEKLRQVFAEN 518
               ES+     + L     K S        KKL     IR+S      + +E  ++   EN
Sbjct:   556 DSESEDEGDTVPL-----KQSSNAEDHTSKKL-----IRKSKRADKKDTLEMPQEGPGEN 605

Query:   519 ELPSRIVKENLS--KELSLEPEKVNKWFKNARYLALKARKVESARQV 563
                  I K + S  K+   + +++   F+  +Y   KA K   A+++
Sbjct:   606 GGSGEIEKSSSSACKQTDPKTQRLYISFQENQYPD-KATKESLAKEL 651


>UNIPROTKB|I3LTW3 [details] [associations]
            symbol:I3LTW3 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0008270 "zinc ion binding" evidence=IEA]
            InterPro:IPR001965 InterPro:IPR019787 Pfam:PF00628 SMART:SM00249
            GO:GO:0046872 GO:GO:0008270 Gene3D:3.30.40.10 InterPro:IPR011011
            InterPro:IPR013083 SUPFAM:SSF57903 GeneTree:ENSGT00390000008296
            EMBL:AEMK01185240 EMBL:CU928954 Ensembl:ENSSSCT00000025343
            Uniprot:I3LTW3
        Length = 113

 Score = 104 (41.7 bits), Expect = 8.8e-05, P = 8.8e-05
 Identities = 18/44 (40%), Positives = 28/44 (63%)

Query:   256 VLCDGTCNCAFHQKCLDPPLDTESRDQGWFCKFCEC-KMEIIES 298
             VLCD  CN A+H  CL+PPLD    ++ W+C  C+    E++++
Sbjct:     3 VLCD-ECNMAYHIYCLNPPLDKVPEEEYWYCPSCKTDSSEVVKA 45


>WB|WBGene00020485 [details] [associations]
            symbol:ceh-54 species:6239 "Caenorhabditis elegans"
            [GO:0003700 "sequence-specific DNA binding transcription factor
            activity" evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0043565 "sequence-specific DNA
            binding" evidence=IEA] [GO:0005634 "nucleus" evidence=IEA]
            InterPro:IPR001356 InterPro:IPR009057 InterPro:IPR017970
            Pfam:PF00046 PROSITE:PS00027 PROSITE:PS50071 SMART:SM00389
            GO:GO:0005634 GO:GO:0043565 GO:GO:0003700 Gene3D:1.10.10.60
            SUPFAM:SSF46689 EMBL:FO080546 HSSP:P06601
            GeneTree:ENSGT00700000104256 PIR:T16866 RefSeq:NP_509037.1
            UniGene:Cel.367 ProteinModelPortal:Q22459 SMR:Q22459 DIP:DIP-24677N
            IntAct:Q22459 MINT:MINT-1110113 EnsemblMetazoa:T13C5.4
            GeneID:188474 KEGG:cel:CELE_T13C5.4 UCSC:T13C5.4 CTD:188474
            WormBase:T13C5.4 eggNOG:NOG295926 HOGENOM:HOG000020434
            InParanoid:Q22459 OMA:ATRDSEN NextBio:938936 Uniprot:Q22459
        Length = 220

 Score = 115 (45.5 bits), Expect = 0.00037, P = 0.00037
 Identities = 32/111 (28%), Positives = 58/111 (52%)

Query:   483 TAEVKKKLPSNAKI-RRSFHRMP--PNAVEKLRQVFAENELPSRIVKENLSKELSLEPEK 539
             T ++ +K  +N    R+  +R+    N ++++ +VFAEN+ P  + +E L+ ++ L  E+
Sbjct:    28 TKKITRKRSNNFNPDRKKRNRITFDANQIDEMEKVFAENQYPDTMSREKLANKIQLHEER 87

Query:   540 VNKWFKNARYLALKARKVESARQVSGSPRISKESSLETEKKNADVLTLKNS 590
             V  WF+N R  A   R+ +        P I+K  + E EK   D  TL ++
Sbjct:    88 VQIWFQNRR--AKYRREQKQTGHPYEPPSITKNPTGEKEKTQ-DCTTLTSA 135


>WB|WBGene00021913 [details] [associations]
            symbol:cec-8 species:6239 "Caenorhabditis elegans"
            [GO:0005634 "nucleus" evidence=IEA] [GO:0009792 "embryo development
            ending in birth or egg hatching" evidence=IMP] InterPro:IPR000953
            PROSITE:PS50013 SMART:SM00298 Pfam:PF00385 GO:GO:0005634
            GO:GO:0009792 InterPro:IPR016197 SUPFAM:SSF54160 InterPro:IPR023780
            GeneTree:ENSGT00690000102102 EMBL:FO081109 RefSeq:NP_497198.1
            ProteinModelPortal:Q95XW8 SMR:Q95XW8 DIP:DIP-25228N IntAct:Q95XW8
            MINT:MINT-1109877 STRING:Q95XW8 PaxDb:Q95XW8
            EnsemblMetazoa:Y55B1BR.3 GeneID:175202 KEGG:cel:CELE_Y55B1BR.3
            UCSC:Y55B1BR.3 CTD:175202 WormBase:Y55B1BR.3 eggNOG:NOG255798
            InParanoid:Q95XW8 OMA:PRANNEN NextBio:887184 Uniprot:Q95XW8
        Length = 679

 Score = 120 (47.3 bits), Expect = 0.00086, P = 0.00086
 Identities = 58/232 (25%), Positives = 104/232 (44%)

Query:   407 DETSDGEIICGRRQRRTVDYKKL--YDEMFGKDASAFEQLSEDEDWGPAKRRRKEKESDA 464
             D++SD +    RR++++   KK   + +   +  +  +  S+DED      +R +K   A
Sbjct:   245 DDSSDDDSEMERRRKKSKKSKKSKKFKKSEKRKRAVNDSSSDDEDEEEKPEKRSKKSKKA 304

Query:   465 VNSLMTLYGSEEKYSKVKTAEVKKKLPSNAKIRRSFHRMPPNAVEKLRQVFAENELPSRI 524
             V    +    EEK SK ++ + KK+     +   S   +    VE  +   +  + P + 
Sbjct:   305 VIDSSSEDEEEEKSSKKRSKKSKKESDEEQQASDSEEEV----VEVKKNSKSPKKTPKKT 360

Query:   525 -VKENLSKELSLEPEKVNKWFKNARYLALKARKVES--ARQVSGSPRISKESSLETEKKN 581
              VKE   +    E E+V K  K+++    KA++  S    +V  SP+   +S  ++ KK+
Sbjct:   361 AVKEESEESSGDEEEEVVKKKKSSKINKRKAKESSSDEEEEVEESPKKKTKSPRKSSKKS 420

Query:   582 A-----DVLTLKNSLEETLICSPKSLKKIH-PKRIQNQSAVAAASRKISRKE 627
             A     +  +  N  EE +  SPK  KK+  PK+   + A    S + S  E
Sbjct:   421 AAKEESEEESSDNEEEEEVDYSPK--KKVKSPKKSSKKPAAKVESEEPSDNE 470


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.312   0.128   0.366    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      629       556   0.00098  119 3  11 23  0.46    34
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  5
  No. of states in DFA:  621 (66 KB)
  Total size of DFA:  315 KB (2160 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  53.73u 0.13s 53.86t   Elapsed:  00:00:02
  Total cpu time:  53.73u 0.13s 53.86t   Elapsed:  00:00:02
  Start:  Fri May 10 17:43:23 2013   End:  Fri May 10 17:43:25 2013

Back to top