BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>000335
MRMRMHGWRAGEEEGDRKREGRRRHMWTVPPRDSAILASVADGGSASPSSSSSLSSSPNS
VTFFSQDGRKISVGDCALFKPPQDSPPFIGIIRSLTAGKENKLTLSVNWLYRPAEVKLGK
GFLLEAAPNEIFYSFHKDEIPAASLLHPCKVAFLPKGIELPSGIGSFVCRKVYDITNKCL
WWLTDQDYINERQEEVDQLLYKTHIEMHATVPSGGRSPKPMNGPTSTSQLKPGSDGAQNS
ASFPSQVKGKKRERGDQSSEPVKRERSSKMEDGNSGHSRTETNLKTEIAKITEKGGLVDY
DGVEKLVQLMVPERNDKKIDLVCRSLLAGVVAATDKFDCLNWFVQLRGLLVFDEWLQEVH
KGKIGDAGSPRDGDKSVEEFLLILLRALDKLPVNLNALQMCNIGKSVNHLRTHKNVEIQK
KARSLVDTWKKRVEAEMDARPRLPEVPHSGNRQTGASTEVAIKSLVTQPASSKTGAVKLC
QGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAASATGTTDLPSTPAKDEKSSSSS
QSHNNGQSCVGDHAKTGGLSGKEDARSSATVSMTLNKISVGSSRSRKSVNGYPSSTPAGV
QRETVSSKNATLHRNSASDRPSQPSLTCEKALDVPVVEGANPKIIVKIPNRGRSPAQNSS
GGSVEDTSVTNSRASSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAA
CPDEGSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIE
SCVKYSEANVSAPAGDDIGMNLLASVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRV
KSFPGDHSTDSTDDEHEKQGIDRNLWAKNSDSNQDKPAGGLTGHISTSPVDLQQSGDPCQ
ENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSESTRG
VETEAVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSPITTHSESVKGKDGELLHTS
GSGEDMPLKNVDEVKVEKADEVDSKSHVNQTEEQNSEWKSNAPMIREDRVVPHLGSAENE
EKGNGKVDHRENLEGKEVKEELCAGPALPEVSTALRAQETGQLVRTGAVKLTISEGDKAQ
ESTSTTIDAASSAVGVSDMEAKVEFDLNEGFDGDDGKYGESSNFIVPGCSGVVQQLVSPL
PLPVTSVSSSLPSSVTVAAAAKGPFVPPEDLLRSKVELGWKGSAATSAFRPAEPRKILEM
PLGATSISVPDSTSGKLGRPLLDIDLNVPDERVLEDLASRSSVQDTVTASDHTNNRDGSR
CEVMGSKSVRGSVGLDLDLNRAEELIDIGNYSTSNGNKIDVPVQPGTSSGGLLNGEVNVR
RDFDLNDGPVLDDCSAEPSVFPQHPRNVSQAPVSGLRLSSADTVNFSSWFPRGNTYSTIA
VPSVLPDRGEQPFPIIAPCAPQRMLVPSTSGSPFGPDVFRGPVLSSSPAVPFPSAPFQYP
VFPFGTSFPLPSATFSGGTTTYVDSSSGGRFCFPAVNSQLMGPAGAVPSHFPRPYVVSLP
DGSNSASSESSWKRSRQSLDLNAGPGVPDIEGRDETSPLVPRQLSVAGSQVLTEDQARMY
QQMAGGHFKRKEPEGGWDGYKRPSWQ

High Scoring Gene Products

Symbol, full name Information P value
SUO
AT3G48050
protein from Arabidopsis thaliana 0.
AT3G48060 protein from Arabidopsis thaliana 0.
AT4G11560 protein from Arabidopsis thaliana 6.1e-19
AT4G24200 protein from Arabidopsis thaliana 6.1e-13
AT2G25120 protein from Arabidopsis thaliana 6.8e-12
AT4G23120 protein from Arabidopsis thaliana 3.4e-10
AT3G43990 protein from Arabidopsis thaliana 2.4e-08
DDB_G0268640
unknown
gene from Dictyostelium discoideum 1.3e-07
Muc68D
Mucin 68D
protein from Drosophila melanogaster 1.7e-07
DSPP
Dentin sialophosphoprotein
protein from Homo sapiens 1.8e-06
K06A9.1 gene from Caenorhabditis elegans 1.9e-05
AT2G22795 protein from Arabidopsis thaliana 3.3e-05
DDB_G0271670 gene from Dictyostelium discoideum 3.5e-05
Muc68Ca
Mucin 68Ca
protein from Drosophila melanogaster 5.3e-05
TCEA2
Transcription elongation factor A protein 2
protein from Homo sapiens 0.00014
Nefh
neurofilament, heavy polypeptide
protein from Mus musculus 0.00014
DDB_G0289357 gene from Dictyostelium discoideum 0.00018
DDB_G0295719
unknown
gene from Dictyostelium discoideum 0.00022
FLO11
GPI-anchored cell surface glycoprotein (flocculin)
gene from Saccharomyces cerevisiae 0.00022
DDB_G0293012
CHR group protein
gene from Dictyostelium discoideum 0.00035
TfIIS
RNA polymerase II elongation factor
protein from Drosophila melanogaster 0.00051

The BLAST search returned 2 gene products which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  000335
        (1646 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

TAIR|locus:2097915 - symbol:SUO "'shuttle' in chinese" sp...  3465  0.        1
TAIR|locus:2097925 - symbol:AT3G48060 species:3702 "Arabi...  3332  0.        1
TAIR|locus:2123136 - symbol:AT4G11560 species:3702 "Arabi...   264  6.1e-19   1
TAIR|locus:2135907 - symbol:AT4G24200 species:3702 "Arabi...   161  6.1e-13   4
TAIR|locus:2040209 - symbol:AT2G25120 species:3702 "Arabi...   195  6.8e-12   1
TAIR|locus:2121601 - symbol:AT4G23120 species:3702 "Arabi...   185  3.4e-10   2
TAIR|locus:2097134 - symbol:AT3G43990 species:3702 "Arabi...   163  2.4e-08   1
DICTYBASE|DDB_G0268640 - symbol:DDB_G0268640 "unknown" sp...   149  1.3e-07   2
FB|FBgn0036203 - symbol:Muc68D "Mucin 68D" species:7227 "...   187  1.7e-07   3
UNIPROTKB|Q9NZW4 - symbol:DSPP "Dentin sialophosphoprotei...   155  1.8e-06   2
WB|WBGene00019435 - symbol:K06A9.1 species:6239 "Caenorha...   136  1.9e-05   3
TAIR|locus:504955937 - symbol:AT2G22795 "AT2G22795" speci...   141  3.3e-05   2
DICTYBASE|DDB_G0271670 - symbol:DDB_G0271670 species:4468...   134  3.5e-05   1
FB|FBgn0036181 - symbol:Muc68Ca "Mucin 68Ca" species:7227...   131  5.3e-05   3
UNIPROTKB|Q5JRI8 - symbol:TCEA2 "Transcription elongation...   107  0.00014   1
MGI|MGI:97309 - symbol:Nefh "neurofilament, heavy polypep...   145  0.00014   2
DICTYBASE|DDB_G0289357 - symbol:DDB_G0289357 species:4468...   103  0.00018   2
DICTYBASE|DDB_G0295719 - symbol:DDB_G0295719 "unknown" sp...   130  0.00022   1
SGD|S000001458 - symbol:FLO11 "GPI-anchored cell surface ...   143  0.00022   1
DICTYBASE|DDB_G0293012 - symbol:DDB_G0293012 "CHR group p...    98  0.00035   3
UNIPROTKB|F1M074 - symbol:F1M074 "Uncharacterized protein...   130  0.00046   1
UNIPROTKB|E9PTW3 - symbol:Dspp "Dentin sialophosphoprotei...   129  0.00049   1
FB|FBgn0010422 - symbol:TfIIS "RNA polymerase II elongati...   122  0.00051   1
POMBASE|SPBC215.13 - symbol:SPBC215.13 "sequence orphan" ...   124  0.00078   1


>TAIR|locus:2097915 [details] [associations]
            symbol:SUO "'shuttle' in chinese" species:3702
            "Arabidopsis thaliana" [GO:0003677 "DNA binding" evidence=IEA;ISS]
            [GO:0005634 "nucleus" evidence=IEA;IDA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=IEA] [GO:0016021 "integral
            to membrane" evidence=IEA] [GO:0000932 "cytoplasmic mRNA processing
            body" evidence=IDA] [GO:0031053 "primary miRNA processing"
            evidence=IMP] [GO:0035278 "negative regulation of translation
            involved in gene silencing by miRNA" evidence=IDA]
            InterPro:IPR001025 InterPro:IPR003617 InterPro:IPR017923
            Pfam:PF01426 Pfam:PF08711 PROSITE:PS51038 PROSITE:PS51319
            SMART:SM00439 SMART:SM00509 GO:GO:0005634 EMBL:CP002686
            GO:GO:0003677 GO:GO:0006351 GO:GO:0000932 GO:GO:0035278
            Gene3D:1.20.930.10 SUPFAM:SSF47676 IPI:IPI00524563
            RefSeq:NP_190388.2 RefSeq:NP_850669.2 UniGene:At.19774 PRIDE:F4JCS8
            EnsemblPlants:AT3G48050.1 EnsemblPlants:AT3G48050.2 GeneID:823960
            KEGG:ath:AT3G48050 OMA:DVNTESW ArrayExpress:F4JCS8 Uniprot:F4JCS8
        Length = 1613

 Score = 3465 (1224.8 bits), Expect = 0., P = 0.
 Identities = 786/1608 (48%), Positives = 987/1608 (61%)

Query:    66 QDGRKISVGDCALFKPPQDSPPFIGIIRSLTAGKENKLTLSVNWLYRPAEVKLGKGFLLE 125
             +DGRKISVGDCALFKPPQD PPFIGIIR + A +E+KL L VNWLYRP E+KLGKG LLE
Sbjct:    46 KDGRKISVGDCALFKPPQDCPPFIGIIRLIIAEEEDKLKLGVNWLYRPTELKLGKGILLE 105

Query:   126 AAPNEIFYSFHKDEIPAASLLHPCKVAFLPKGIELPSGIGSFVCRKVYDITNKCLWWLTD 185
             A PNE+FYSFH+D IPAASLLHPCKVAFLP+G+ELPSGI SFVCR+VYD+TN+ LWWLTD
Sbjct:   106 AEPNELFYSFHEDNIPAASLLHPCKVAFLPRGVELPSGISSFVCRRVYDVTNERLWWLTD 165

Query:   186 QDYINERQEEVDQLLYKTHIEMHATVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPS 245
             QDYI++RQ EVD+LL KT  EMH T+  GGRSPK MN PT TSQ K   DG QNS SF S
Sbjct:   166 QDYIDDRQLEVDKLLCKTRSEMHTTLQQGGRSPKSMNSPT-TSQPK---DGIQNSNSFLS 221

Query:   246 QVKGKKRERGDQSSEPVKRERSSKMEDGNSGHSRTETNLKTEIAKITEKGGLVDYDGVEK 305
             Q KG+KRER D  SE VKRERSS+++D  SG  RTE+ LK+EI K TEKGGLVD +GVEK
Sbjct:   222 QGKGRKRERMDHGSESVKRERSSRVDDSGSGPLRTESGLKSEILKFTEKGGLVDSEGVEK 281

Query:   306 LVQLMVPERNDKKIDLVCRSLLAGVVAATDKFDCLNWFVQLRGLLVFDEWLQEVHKGKIG 365
             LVQLM+PERN+KKIDLV R++LAGVVAATDKFDCL+ FVQLRGL VFDEWLQEVHKGK+G
Sbjct:   282 LVQLMLPERNEKKIDLVGRAILAGVVAATDKFDCLSRFVQLRGLPVFDEWLQEVHKGKVG 341

Query:   366 DAGSPRDGDKSVEEFLLILLRALDKLPVNLNALQMCNIGKSVNHLRTHKNVEIQKKARSL 425
             D GSP+D D+ V++FLL+LLRALDKLPVNLNALQ CNIGKSVNHLR+HKN EI KKARSL
Sbjct:   342 DGGSPKDSDRLVDDFLLVLLRALDKLPVNLNALQTCNIGKSVNHLRSHKNSEIGKKARSL 401

Query:   426 VDTWKKRVEAEMDARP------RLPEVPHSGNRQTGASTEVAIKSLVTQPASSKTGAVKL 479
             VDTWKKRVEAEMDA+         P     G R +G S E A K+  +   +SK+ +VK 
Sbjct:   402 VDTWKKRVEAEMDAKSGSNQGVSWPGRLSHGGRHSGGSAE-ANKTSSSHLHASKSVSVKQ 460

Query:   480 CQGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAASATGTTDLPSTPAKDEKXXXX 539
              Q +   K   +SP S +SAP P SG   +KDGQ RNA  A G +++ +   KDEK    
Sbjct:   461 -QVENNLKCVATSPGSTRSAPSPGSGGNVSKDGQQRNAG-AGGVSEVLAA-VKDEKSSSS 517

Query:   540 XXXXXXXXXCVGDHAKTGGLSGKEDARSSATVSMTLNKISVGSSRSRKSVNGYPSSTPAG 599
                      C  +HAKTG L GKEDARSS T   TL K S GSSR RKS N +  S+ + 
Sbjct:   518 SQSHNNSQSCSSEHAKTGNLCGKEDARSS-TAGSTLKKCSGGSSRHRKSNNVFQGSSSSA 576

Query:   600 VQRETVSSKNATLHRNSASDRPSQPSLTCEKALDVPVVEGANPKIIVKIPNRGRSPAQNS 659
               R    S++ + HRN  S++ SQ SLT EK L+VP+ EG+  K+IVK+PNRGRSPAQ+ 
Sbjct:   577 SPRGAGLSRSFSSHRNVPSEKISQSSLTSEKTLEVPLTEGSGNKLIVKLPNRGRSPAQSV 636

Query:   660 SGGSVEDTSVTNSRASSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAA 719
             SGGS+ED +  NSR SSPV   KQ   D N +EKN + R ++SS +N+E WQSN  KD  
Sbjct:   637 SGGSLEDPAPVNSRVSSPVHAVKQELCDNNGREKNHSYRPNVSSVLNAESWQSNELKDIL 696

Query:   720 ACPDEGSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALI 779
                 E +GSP V  DE+G    D+ +        +SL  G EFK  + H  + SSMNALI
Sbjct:   697 TGSQEAAGSPLVAGDERGGDLKDSDKASGNVKGTSSL--GNEFKSGERHGGTLSSMNALI 754

Query:   780 ESCVKYSEANVSAPAGDDIGMNLLASVAAGEMSKSDVVSP-VGSPPRTPIHEPLCDDNDS 838
             ESCV+YSE N S    DD+GMNLLASVAA EMSKS V SP V  PP + ++E     N++
Sbjct:   755 ESCVRYSETNASLAGSDDVGMNLLASVAADEMSKSPVASPSVSQPPNSVMNENSTVGNNT 814

Query:   839 RVKSFPG----DHST--DSTDDEHEKQGIDRNLWAKNSDSNQDKPAGGLTGHISTSPVDL 892
             ++ +  G     H     S   E  +Q +  +     S+   +   G      ++   DL
Sbjct:   815 KLMASDGLPHEQHQAVCTSVSTEQGEQHVSSSGTQLESEIKNESKTGDRDKSSNSDTEDL 874

Query:   893 QQSGDPCQENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLS-TE 951
             Q+  D C E+ +NS  ++ +   P  A +    +K     D+    D K  +      T 
Sbjct:   875 QRLVDQCLESNDNSDGVVASPALPTKAVK----EKILNDSDSGELKDIKTDVKSEADCTS 930

Query:   952 DKVSESTRGVETEAVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSPITTHSESVKG 1011
             D        + TE  + S    S+  +    +GV +      K+E+KP P    SE VK 
Sbjct:   931 DSTKRVASSMLTECRDVSKKVDSVAVEQTPLEGVDDD-----KKEEKP-PTALSSELVKK 984

Query:  1012 KDGELLHTSGSGEDMPLXXXXXXXXXXXXXXXSKSHVNQTEEQNSEWKSNAPMIREDRVV 1071
              + ++  +SG   DM                 + +H++Q + +  +   +  +      +
Sbjct:   985 VEEDVPVSSGISRDMD-AVSIGRPITEMVNNVAFNHMDQKDVKKIKQDCDTSV----GAI 1039

Query:  1072 PHLGSAENEEKGNGKVDHRE-NLEGKEVKEELCAGPALPEVSTALRAQETGQLVRTGAVK 1130
                 +  +     GKV+  E NLE  EVKE      A P +S     +E   L R    K
Sbjct:  1040 KDTSAGLDSSVTKGKVEPVEGNLENSEVKERYSGLRATPGLSP----KEAEDLERPNGPK 1095

Query:  1131 LTISEGDKAQEXXXXXXXXXXXXVGVS---DMEAKVEFDLNEGFDGDDGKYGESSNFIVP 1187
              + ++GD+A E               S   +M+A+VEFDLNEGFDGDD K+G+S+NF   
Sbjct:  1096 TSDADGDEAGECTSAARDASSVSAAASAGSEMDARVEFDLNEGFDGDDAKHGDSNNFSGS 1155

Query:  1188 G-CSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGPFVPPEDLLRSKVELGWKGSAAT 1246
                +                               KGPFVPPEDLLR+K  +GW+GSAAT
Sbjct:  1156 VFLTPTPLQPVKTLPFPVAPVSSGTRASITVAAAAKGPFVPPEDLLRNKGAVGWRGSAAT 1215

Query:  1247 SAFRPAEPRKILEMPLGATSISVPD-STS-GKLGRPLLDIDLNVPDERVLEDLASRSSVQ 1304
             SAFRPAEPRK  ++ L   + S  D STS GK  R  LD DLNVPDERVLEDLAS+ S  
Sbjct:  1216 SAFRPAEPRKPQDVLLSINNTSTSDASTSAGKQTRTFLDFDLNVPDERVLEDLASQRSGN 1275

Query:  1305 DTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELIDIGNYSTSNGNKIDVPVQ 1364
              T   SD TN+ D  R  VMGS     S GLDLDLN+ ++  D+ +Y+ ++ +++D   Q
Sbjct:  1276 PTNCTSDITNSFDQVRSGVMGSALDHSSGGLDLDLNKVDDSTDMISYTMNSSHRLDSSFQ 1335

Query:  1365 PGT--SSGGLLNGEVNVRRDFDLNDGPVLDDCSAEPS-VFPQHPRN--VSQAPVSGLRLS 1419
                  S+GG        RRDFDLNDGPV DD + EPS V  QH R+   SQ  +SG+R++
Sbjct:  1336 QVKLPSTGG--------RRDFDLNDGPVGDDAAVEPSMVLNQHSRSGLPSQPSLSGIRVN 1387

Query:  1420 SADTVNFSSWFPRGNTYSTIAVPSVLPDRGEQPFPIIAPCAPQRMLVPSTSGSPFGPDVF 1479
               +  +FS+WFP  N YS +++P ++P+RG+QPFP+IA   PQRML P+T  S F P+ +
Sbjct:  1388 GENMASFSTWFPAANAYSAVSMPPIMPERGDQPFPMIATRGPQRMLGPTTGVSSFTPEGY 1447

Query:  1480 RGXXXXXXXXXXXXXXXXQYPVFPFGTSFPLPSATFSGGTTTYVDSSSGGRFCFPAVNSQ 1539
             RG                QYPVFPFG SFP+ SA F G +T ++DSSS GR CFP VNSQ
Sbjct:  1448 RGPVLSSSPAMPFQSTTFQYPVFPFGNSFPVTSANFPGASTAHMDSSSSGRACFPGVNSQ 1507

Query:  1540 LMGPAGAVPSHFPRPYVVSLPDGXXXXXXXXXXXX-XXXXLDLNAGPGVPDIEGRDETSP 1598
             ++GP   VPS++PRPY+V LP+G                 LDLN+GPG  + EGRDE S 
Sbjct:  1508 ILGPGVPVPSNYPRPYIVGLPNGGSNGGVLDNGAKWFRSGLDLNSGPGGHETEGRDE-ST 1566

Query:  1599 LVPRQLSVAGSQVLTEDQARMYQQMAGGHFKRKEPEGGWDGYKRPSWQ 1646
             LV RQLS + S  L EDQARMYQ M+GG  KRKEPEGGWDGY++ SWQ
Sbjct:  1567 LVARQLSSSASLPLKEDQARMYQ-MSGGVLKRKEPEGGWDGYRQSSWQ 1613


>TAIR|locus:2097925 [details] [associations]
            symbol:AT3G48060 species:3702 "Arabidopsis thaliana"
            [GO:0003677 "DNA binding" evidence=IEA;ISS] [GO:0005576
            "extracellular region" evidence=ISM] [GO:0005634 "nucleus"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0016021 "integral to membrane"
            evidence=IEA] InterPro:IPR001025 InterPro:IPR003617
            InterPro:IPR017923 Pfam:PF01426 Pfam:PF08711 PROSITE:PS51038
            PROSITE:PS51319 SMART:SM00439 SMART:SM00509 GO:GO:0005634
            EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0003677 GO:GO:0006351
            EMBL:AL049658 Gene3D:1.20.930.10 SUPFAM:SSF47676 UniGene:At.19774
            IPI:IPI00534352 PIR:T06677 RefSeq:NP_190389.1 UniGene:At.48751
            ProteinModelPortal:Q9SU69 SMR:Q9SU69 STRING:Q9SU69 PaxDb:Q9SU69
            PRIDE:Q9SU69 EnsemblPlants:AT3G48060.1 GeneID:823961
            KEGG:ath:AT3G48060 TAIR:At3g48060 eggNOG:NOG298939
            HOGENOM:HOG000083288 InParanoid:Q9SU69 OMA:TERVECL PhylomeDB:Q9SU69
            ProtClustDB:CLSN2684022 Genevestigator:Q9SU69 Uniprot:Q9SU69
        Length = 1611

 Score = 3332 (1178.0 bits), Expect = 0., P = 0.
 Identities = 770/1609 (47%), Positives = 976/1609 (60%)

Query:    66 QDGRKISVGDCALFKPPQDSPPFIGIIRSLTAGKENKLTLSVNWLYRPAEVKLGKGFLLE 125
             +DGRKISVGDCALFKPPQD PPFIGIIR + A +E+KL L VNWLYRP E+KLGKG LLE
Sbjct:    46 KDGRKISVGDCALFKPPQDCPPFIGIIRLIIAEEEDKLKLGVNWLYRPTELKLGKGILLE 105

Query:   126 AAPNEIFYSFHKDEIPAASLLHPCKVAFLPKGIELPSGIGSFVCRKVYDITNKCLWWLTD 185
             A PNE+FYSFH+D IPAASLLHPCKVAFLP+G+ELPSGI SFVCR+VYD+TN+ LWWLTD
Sbjct:   106 AEPNELFYSFHEDNIPAASLLHPCKVAFLPRGVELPSGISSFVCRRVYDVTNERLWWLTD 165

Query:   186 QDYINERQEEVDQLLYKTHIEMHATVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPS 245
             QDYI++RQ EVD+LL KT  EMH T+  GGRSPK MN PT TSQ K   DG QN+ S  S
Sbjct:   166 QDYIDDRQLEVDKLLCKTRSEMHTTLQQGGRSPKSMNSPT-TSQPK---DGIQNNNSLFS 221

Query:   246 QVKGKKRERGDQSSEPVKRERSSKMEDGNSGHSRTETNLKTEIAKITEKGGLVDYDGVEK 305
             Q KG+KRER D  SE VKRERSS+++D  SG  RTE+ L +EI K TEKGGLVD +GVEK
Sbjct:   222 QSKGRKRERMDHGSESVKRERSSRVDDSGSGPLRTESGLTSEILKFTEKGGLVDSEGVEK 281

Query:   306 LVQLMVPERNDKKIDLVCRSLLAGVVAATDKFDCLNWFVQLRGLLVFDEWLQEVHKGKIG 365
             LVQLM+PERN+KKIDLV R++LAG VAAT++FDCL+ FVQLRGL VFDEWLQEVHKGK+G
Sbjct:   282 LVQLMLPERNEKKIDLVGRAILAGFVAATNRFDCLSRFVQLRGLPVFDEWLQEVHKGKVG 341

Query:   366 DAGSPRDGDKSVEEFLLILLRALDKLPVNLNALQMCNIGKSVNHLRTHKNVEIQKKARSL 425
             D GSP+D D+ V++FLL+LLRALDKLPVNLNALQ CNIGKSVNHLR+HKN EI KKARSL
Sbjct:   342 DGGSPKDSDRLVDDFLLVLLRALDKLPVNLNALQTCNIGKSVNHLRSHKNSEIGKKARSL 401

Query:   426 VDTWKKRVEAEMDARP------RLPEVPHSGNRQTGASTEVAIKSLVTQPASSKTGAVKL 479
             VDTWKKRVEAEMDA+         P     G R +G S E A K+  +   +SK+ +VK 
Sbjct:   402 VDTWKKRVEAEMDAKSGSNQGVSWPGRLSHGGRHSGGSAE-ANKTSSSHLHASKSVSVKQ 460

Query:   480 CQGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAASATGTTDLPSTPAKDEKXXXX 539
              Q +   K   +SP S +SAP P SG   +KDGQ RNA  A G +++ +   KDEK    
Sbjct:   461 -QVENNLKCVATSPGSTRSAPSPGSGGNVSKDGQQRNAG-AGGVSEVLAA-VKDEKSSSS 517

Query:   540 XXXXXXXXXCVGDHAKTGGLSGKEDARSSATVSMTLNKISVGSSRSRKSVNGYPSSTPAG 599
                      C  +HAKTG L GKEDARSS T   TL K S GSSR RKS N +  S+ + 
Sbjct:   518 SQSHNNSQSCSSEHAKTGNLCGKEDARSS-TAGSTLKKCSGGSSRHRKSNNVFQGSSSSA 576

Query:   600 VQRETVSSKNATLHRNSASDRPSQPSLTCEKALDVPVVEGANPKIIVKIPNRGRSPAQNS 659
               R    S++ + HRN  S++ SQ SLT EK L+VP+ EG+  K+IVK+P RGRSPAQ+ 
Sbjct:   577 SPRGAGLSRSFSSHRNVPSEKISQSSLTSEKTLEVPLTEGSGNKLIVKLP-RGRSPAQSV 635

Query:   660 SGGSVEDTSVTNSRASSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAA 719
             SGGS+ED +  NSR SSPV   KQ   D N++EKN + RAD+SS +N+E WQSN  KD  
Sbjct:   636 SGGSLEDPAPVNSRVSSPVHTVKQELCDNNWREKNHSYRADVSSVLNAESWQSNELKDIL 695

Query:   720 ACPDEGSGSPAVLP-DEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNAL 778
                 E +GSP V+  DE+     D+ +        +SL  G EFK  + H  + SSMNAL
Sbjct:   696 TGSQEATGSPLVVAGDEREGALKDSDKASGNVKATSSL--GNEFKSGERHGGTLSSMNAL 753

Query:   779 IESCVKYSEANVSAPAGDDIGMNLLASVAAGEMSKSDVVSP-VGSPPRTPIHEPLCDDND 837
             IESCV+YSE N S    DD+GMNLLASVAA EMSKS V SP V  PP + ++E     N+
Sbjct:   754 IESCVRYSETNASLAGSDDVGMNLLASVAADEMSKSPVASPSVSQPPNSLMNENSTVGNN 813

Query:   838 SRVKSFPG----DHST--DSTDDEHEKQGIDRNLWAKNSDSNQDKPAGGLTGHISTSPVD 891
             +++ +  G     H     +  +E  +Q +  +     S+   +   G      ++   D
Sbjct:   814 TKLMASDGLPHKQHQAVRPTLSNEQGEQHVSSSGTQLESEIKNESKTGDRVKSSNSDTED 873

Query:   892 LQQSGDPCQENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLS-T 950
             LQ+  D   E+ ENS  ++ +   P    +    D +    D+    D K  +      T
Sbjct:   874 LQRFVDQRLESNENSDGVVASPPLPTKVIKENILDDS----DSGEVKDIKTDVKSEADCT 929

Query:   951 EDKVSESTRGVETEAVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSPITTHSESVK 1010
              D        + TE  + S    S+  +    +GV +      K+E+KP P    SE VK
Sbjct:   930 SDLTKRVASSMLTECRDVSKMVDSVAVEHTPLEGVDDD-----KKEEKP-PTALSSELVK 983

Query:  1011 GKDGELLHTSGSGEDMPLXXXXXXXXXXXXXXXSKSHVNQTEEQNSEWKSNAPMIREDRV 1070
               + ++  +SG    M                 + +H++Q + +  +   +  +      
Sbjct:   984 KVEEDVPVSSGISRGMD-AVSIDRPITEMVNNIAFNHMDQKDIKKIKQDFDTSV----GA 1038

Query:  1071 VPHLGSAENEEKGNGKVDHRE-NLEGKEVKEELCAGPALPEVSTALRAQETGQLVRTGAV 1129
             V    +  +     GKV+  E NLE  E+ E      A P +S     +E   L R  A 
Sbjct:  1039 VKDASAGLDSSVTKGKVEPVEGNLENIEIMERYSGLRATPGLSP----KEAEDLKRPNAP 1094

Query:  1130 KLTISEGDKAQEXXXXXXXXXXXXVGVS-----DMEAKVEFDLNEGFDGDDGKYGESSNF 1184
             K + ++GD+A E               S     +M+A+VEFDLNEGFDGDD ++G+S+NF
Sbjct:  1095 KTSDADGDEAGECTSAARDASSVSAAASASAGSEMDARVEFDLNEGFDGDDAQHGDSNNF 1154

Query:  1185 ---IVPGCSGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKGPFVPPEDLLRSKVELGWK 1241
                +V   +                               KGPFVPPEDLLR K  +GW+
Sbjct:  1155 SGSVV--LTPTPLQPVNTLPFPVAPVSSGIPASITVAAAVKGPFVPPEDLLRYKGAVGWR 1212

Query:  1242 GSAATSAFRPAEPRKILEMPLGATSISVPD-STS-GKLGRPLLDIDLNVPDERVLEDLAS 1299
             GSAATSAFRPAEPRK  ++ L   + S  D STS GK  R  LD DLNVPDERVLEDLAS
Sbjct:  1213 GSAATSAFRPAEPRKAQDVLLSINNTSTSDASTSAGKQTRTFLDFDLNVPDERVLEDLAS 1272

Query:  1300 RSSVQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELIDIGNYSTSNGNKI 1359
             + S   T   S  TNN D  R  VMGS     S GLDL  N+ ++L D+ +Y+ ++ +++
Sbjct:  1273 QRSGNPTNCTSGITNNFDQVRSGVMGSALDHSSGGLDL--NKVDDLTDMNSYTMNSSHRL 1330

Query:  1360 DVPVQPGT--SSGGLLNGEVNVRRDFDLNDGPVLDDCSAEPS-VFPQHPRN--VSQAPVS 1414
             D   Q     S+GG        RRDFDLNDGPV DD + EPS V  QH R+   SQ  +S
Sbjct:  1331 DSSFQQVKLPSTGG--------RRDFDLNDGPVGDDAAVEPSMVLNQHSRSGLPSQPSLS 1382

Query:  1415 GLRLSSADTVNFSSWFPRGNTYSTIAVPSVLPDRGEQPFPIIAPCAPQRMLVPSTSGSPF 1474
             G+R++  +  +FS+WFP  N YS +++P ++P+RG+QPFP+IA   PQRML P+T  S F
Sbjct:  1383 GIRVNGENMASFSTWFPAANAYSAVSMPPIMPERGDQPFPMIATRGPQRMLGPTTGVSSF 1442

Query:  1475 GPDVFRGXXXXXXXXXXXXXXXXQYPVFPFGTSFPLPSATFSGGTTTYVDSSSGGRFCFP 1534
              P+ +RG                QYPVFPFG SFP+  A F G +T ++DSSS GR  FP
Sbjct:  1443 SPEGYRGPVLSSSPAMPFQSTTFQYPVFPFGNSFPVTPANFPGASTAHMDSSSSGRAYFP 1502

Query:  1535 AVNSQLMGPAGAVPSHFPRPYVVSLPDGXXXXXXXXXXXX-XXXXLDLNAGPGVPDIEGR 1593
              VNSQ++GP   VPS++PRPY+V LP+G                 LDLN+GPG  + EGR
Sbjct:  1503 GVNSQILGPGVPVPSNYPRPYIVGLPNGGSNGGVLDNSAKWFRSGLDLNSGPGGHETEGR 1562

Query:  1594 DETSPLVPRQLSVAGSQVLTEDQARMYQQMAGGHFKRKEPEGGWDGYKR 1642
             DE S LV RQLS + S    EDQARMYQ M+GG  KRKEPEGGWDGY++
Sbjct:  1563 DE-STLVSRQLSSSASVPSKEDQARMYQ-MSGGVLKRKEPEGGWDGYRQ 1609


>TAIR|locus:2123136 [details] [associations]
            symbol:AT4G11560 species:3702 "Arabidopsis thaliana"
            [GO:0003677 "DNA binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0000956 "nuclear-transcribed mRNA catabolic
            process" evidence=RCA] [GO:0009630 "gravitropism" evidence=RCA]
            InterPro:IPR001025 InterPro:IPR003618 Pfam:PF01426 Pfam:PF07500
            PROSITE:PS51038 PROSITE:PS51321 SMART:SM00439 EMBL:CP002687
            GenomeReviews:CT486007_GR GO:GO:0003677 GO:GO:0006351
            Gene3D:1.10.472.30 SUPFAM:SSF46942 HOGENOM:HOG000237923
            EMBL:AY080679 EMBL:AY133747 IPI:IPI00528925 RefSeq:NP_192893.2
            UniGene:At.44003 ProteinModelPortal:Q8RXT5 PaxDb:Q8RXT5
            PRIDE:Q8RXT5 EnsemblPlants:AT4G11560.1 GeneID:826760
            KEGG:ath:AT4G11560 TAIR:At4g11560 eggNOG:NOG282987
            InParanoid:Q8RXT5 OMA:KELDPPV PhylomeDB:Q8RXT5
            ProtClustDB:CLSN2690291 Genevestigator:Q8RXT5 Uniprot:Q8RXT5
        Length = 587

 Score = 264 (98.0 bits), Expect = 6.1e-19, P = 6.1e-19
 Identities = 56/141 (39%), Positives = 79/141 (56%)

Query:    67 DGRKISVGDCALFKPPQDSP-PFIGIIRSLTAGKENKLTLSVNWLYRPAEV-KLGKGFLL 124
             DG    +    L  P   S  P++ II+ +T  K+  + +   W YRP E  K G G   
Sbjct:   127 DGNTYDLEVPVLLVPEDKSQKPYVAIIKDITQTKDGSMMILGQWFYRPEEAEKRGGGNWQ 186

Query:   125 EAAPNEIFYSFHKDEIPAASLLHPCKVAFLPKGIELPSGIGS--FVCRKVYDITNKCLWW 182
              +   E+FYSFH+DE+PA S++H C V F+P   +LP    +  F+ RKVYD   K LW 
Sbjct:   187 SSDTRELFYSFHRDEVPAESVMHRCVVYFVPAHKQLPKRKNNPGFIVRKVYDTVEKKLWK 246

Query:   183 LTDQDYINERQEEVDQLLYKT 203
             LTD+DY + +Q E+D L+ KT
Sbjct:   247 LTDKDYEDSKQREIDVLVKKT 267


>TAIR|locus:2135907 [details] [associations]
            symbol:AT4G24200 species:3702 "Arabidopsis thaliana"
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0005634 "nucleus"
            evidence=ISM;IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0005739 "mitochondrion" evidence=IDA]
            InterPro:IPR017923 PROSITE:PS51319 GO:GO:0005739 GO:GO:0005634
            EMBL:CP002687 GO:GO:0003677 GO:GO:0006351 GO:GO:0003746
            Gene3D:1.20.930.10 SUPFAM:SSF47676 EMBL:AY064014 EMBL:AY096636
            IPI:IPI00525810 RefSeq:NP_194151.2 UniGene:At.22816
            ProteinModelPortal:Q8VZM2 SMR:Q8VZM2 PaxDb:Q8VZM2 PRIDE:Q8VZM2
            EnsemblPlants:AT4G24200.1 GeneID:828521 KEGG:ath:AT4G24200
            eggNOG:NOG244511 InParanoid:Q8VZM2 OMA:TKNEECL
            ProtClustDB:CLSN2714365 Genevestigator:Q8VZM2 Uniprot:Q8VZM2
        Length = 1000

 Score = 161 (61.7 bits), Expect = 6.1e-13, Sum P(4) = 6.1e-13
 Identities = 63/199 (31%), Positives = 97/199 (48%)

Query:   291 ITE-KGGLVDYDGVEKLVQLMVPERND--KKIDLVCR--SLLAGVVAATDKFDCLNWFVQ 345
             +TE K GL     VE+LV +M   ++   K      R  + +A  +AAT   DCL+ FV 
Sbjct:     9 LTEIKDGLTVTSRVEELVSVMQSNKDSVLKNSGDAFRQWTAVASTIAATKNRDCLDVFVN 68

Query:   346 LRGLLVFDEWLQEVHKGKIGDAGSPRDGDKSVEEFLLILLRALDKLPVNLNALQMCNIGK 405
             L GL+    WL E    ++ D  S    D+SVEE +L LL A++ L V+ + L    +  
Sbjct:    69 LDGLVYLSSWLAEA---QMLDNDSV---DRSVEESILALLEAVENLGVDSSKLVSSGLWV 122

Query:   406 SVNHLRTHKNVEIQKKARSLVDTWKKRVE---AEMDARPRLPEVPHSGNRQTGASTEVA- 461
             +V  L  H +  +Q +AR L  +WK + +   +E D+     ++     R   AS E + 
Sbjct:   123 AVKKLVDHGSSRVQDQARKLFGSWKDKDDYDHSEHDSES-CNKIHEDEMRVVAASIESSG 181

Query:   462 IKSLVTQPASSKTGAVKLC 480
              KS VT   S ++ + K C
Sbjct:   182 QKSAVTL-CSMQSTSEKHC 199

 Score = 77 (32.2 bits), Expect = 6.1e-13, Sum P(4) = 6.1e-13
 Identities = 57/195 (29%), Positives = 80/195 (41%)

Query:  1224 PFVPPEDLLRSKVELGWKGSAATSAFRPAEPRKILEMPLGATSISVPDSTSGKLGRPLLD 1283
             P   P  L RS   L  KGSAATS F  A P K+    L    +         + R    
Sbjct:   621 PAAAPWHLERS---LSGKGSAATSVFHSALPHKVPSGDLREKQV---------ISR---G 665

Query:  1284 IDLNVP---DERVLEDLASRSSVQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLD-L 1339
             IDLNV    D++V EDL      Q   ++S+           + GS        LDL+ +
Sbjct:   666 IDLNVAEVGDDQV-EDLTPWK--QFPFSSSNSRGGESSHEASLRGSSKFN----LDLNCM 718

Query:  1340 NRAEEL-------IDIGNYSTSNGNKIDVPVQPGTSSGGLLNG-EVNVRRDFDLNDGP-V 1390
             N  +E+       ++   + + NG +   PV   +SS    +G EVN    FDLND P  
Sbjct:   719 NEDDEMPPPSESKMETRLFLSHNGQQSASPVS--SSSVAQQSGKEVN----FDLNDRPQF 772

Query:  1391 LDDCSAEPSVFPQHP 1405
               D   +   + +HP
Sbjct:   773 FIDSRDQGPYYGRHP 787

 Score = 64 (27.6 bits), Expect = 6.1e-13, Sum P(4) = 6.1e-13
 Identities = 30/133 (22%), Positives = 55/133 (41%)

Query:   845 GDHSTDSTDDEHEKQGIDRNLWAKNSDSNQDKPAGGLT---GHISTSPV----DLQQSGD 897
             G+ S  S+++  ++ G   + ++++ D +   P+ GL+    H    P     DL    D
Sbjct:   495 GEPSHSSSEELSDESGQSGSQYSRDDDVHTGSPSKGLSVTENHSFEEPHVGDDDLMDEKD 554

Query:   898 PCQENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSES 957
                E+ +  +E  +A       GR  E+   GF ++ D  PD    I    ST       
Sbjct:   555 DKPESGD-VEERHLATAAKSEVGR--EKSPCGFDLNQDICPDETDVIMSSTSTTPAPMSV 611

Query:   958 TRGVETEAVEGSA 970
             +  V + A+  +A
Sbjct:   612 SFSVSSSAMPAAA 624

 Score = 61 (26.5 bits), Expect = 6.1e-13, Sum P(4) = 6.1e-13
 Identities = 39/149 (26%), Positives = 57/149 (38%)

Query:  1503 PFGTSFPLPSATFSGGTTTYVDSSSGGRFCFPAVNSQLMGPAGAVPSHFPRPYV-VSLPD 1561
             P G S   P     G    Y+  S G     P +  Q++G A  V   FP+ ++ +SL  
Sbjct:   868 PPGLSMSSPMYV-PGTAIPYMVDSRGT----PVMMPQIIGSAPYVQPPFPQQHMFMSLAG 922

Query:  1562 GXXXXXXXXXXXXXXXXLDLNAGPGVPDIEGRDETSP---LVPRQLSVAGSQVLTEDQAR 1618
             G                 D N+G G+ +I  R+  +    L P Q    G       +  
Sbjct:   923 GSPSTNGSMRPN-----FDQNSGFGL-EIGNRESLNLRQFLSPSQSGAMGEHSGANVEPS 976

Query:  1619 MYQQMA-GGHFKRKEPEGGWDGYKRPSWQ 1646
                 ++ GG  KRKEPE  W+    P W+
Sbjct:   977 SSSSISIGG--KRKEPEPRWEF---PPWR 1000

 Score = 40 (19.1 bits), Expect = 2.8e-09, Sum P(4) = 2.8e-09
 Identities = 14/43 (32%), Positives = 21/43 (48%)

Query:  1369 SGGLLNGEVNVRRDFDLNDGPVLDDCSAEPSVFPQHPRNVSQA 1411
             SG L   +V + R  DLN   V DD   + + + Q P + S +
Sbjct:   653 SGDLREKQV-ISRGIDLNVAEVGDDQVEDLTPWKQFPFSSSNS 694

 Score = 39 (18.8 bits), Expect = 2.9e-07, Sum P(3) = 2.9e-07
 Identities = 10/32 (31%), Positives = 17/32 (53%)

Query:   559 LSGKEDARSSATVSMTLNKISVGSSRSRKSVN 590
             LSGK  A +S   S   +K+  G  R ++ ++
Sbjct:   632 LSGKGSAATSVFHSALPHKVPSGDLREKQVIS 663


>TAIR|locus:2040209 [details] [associations]
            symbol:AT2G25120 species:3702 "Arabidopsis thaliana"
            [GO:0003677 "DNA binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR001025 Pfam:PF01426 PROSITE:PS51038 SMART:SM00439
            EMBL:CP002685 GO:GO:0003677 IPI:IPI00531195 RefSeq:NP_180084.1
            UniGene:At.52905 ProteinModelPortal:F4IRK4
            EnsemblPlants:AT2G25120.1 GeneID:817050 KEGG:ath:AT2G25120
            OMA:KKDCEEK PhylomeDB:F4IRK4 Uniprot:F4IRK4
        Length = 380

 Score = 195 (73.7 bits), Expect = 6.8e-12, P = 6.8e-12
 Identities = 61/222 (27%), Positives = 103/222 (46%)

Query:    68 GRKISVGDCALFKPPQ-DSPPFIGIIRSL-TAGKENKLTLSVNWLYRPAEV-KLGKGFLL 124
             G + ++ D     P   +S P+  II+ +    KE  + L+V+W YRP +V K   G   
Sbjct:    93 GNQYALEDSVQLVPDDPNSKPYCAIIKDIYIPNKEKYVKLAVHWFYRPEDVDKKHVGKWE 152

Query:   125 EAAPNEIFYSFHKDEIPAASLLHPCKVAFLPKGIELPSGIGS--FVCRKVYDITNKCLWW 182
                   +FYSFH+DE+ A S+ H C V F+P+  ++P+      F+ + VYD   K +  
Sbjct:   153 SKDSRNLFYSFHRDEVFAESVKHKCVVNFVPENKQIPNRREHPCFIVQNVYDFVKKKVRK 212

Query:   183 LTDQDYINERQEEVDQLLYKTHIEMHATVPSGGRSP--KPMNGPTSTSQLKPGSDGAQNS 240
              TD+++   ++ E+D+L+ KT + +   +P   +    K   G  +  +  P +     S
Sbjct:   213 FTDKNFDVHQKNEIDRLVAKTSLRL-GDLPDIEKDQVTKTSKGKRTVQRKSPKTSTVYKS 271

Query:   241 A--SFPSQVKGKKRE-RGDQSSEPVKRE-RSSKMEDGNSGHS 278
                 F        R+ R  +  E VK E R+SK +      S
Sbjct:   272 ILEDFDLLTGDSDRDKRLGELLEAVKHECRTSKKKGARDDDS 313


>TAIR|locus:2121601 [details] [associations]
            symbol:AT4G23120 species:3702 "Arabidopsis thaliana"
            [GO:0003677 "DNA binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM] InterPro:IPR001025 Pfam:PF01426 PROSITE:PS51038
            SMART:SM00439 EMBL:CP002687 GenomeReviews:CT486007_GR GO:GO:0003677
            EMBL:AL031018 EMBL:AL161558 EMBL:AL022347 IPI:IPI00520892
            PIR:T04829 RefSeq:NP_194043.1 UniGene:At.54486
            ProteinModelPortal:O65464 SMR:O65464 PaxDb:O65464
            EnsemblPlants:AT4G23120.1 GeneID:828411 KEGG:ath:AT4G23120
            TAIR:At4g23120 eggNOG:NOG239021 HOGENOM:HOG000237923
            InParanoid:O65464 OMA:KLEWRPE PhylomeDB:O65464
            ProtClustDB:CLSN2683146 ArrayExpress:O65464 Genevestigator:O65464
            Uniprot:O65464
        Length = 360

 Score = 185 (70.2 bits), Expect = 3.4e-10, Sum P(2) = 3.4e-10
 Identities = 46/138 (33%), Positives = 70/138 (50%)

Query:    70 KISVGDCALFKPPQDSPPFIGIIRSL-TAGKENKLTLSVNWLYRPAEV-KLGKGFLLEAA 127
             K  + D  L  P     P++ II+ + T  KE  + L V WLYRP EV K   G      
Sbjct:    58 KYGLEDSVLLVPEDGEKPYVAIIKDIYTQRKEGHVKLEVQWLYRPEEVEKKYVGNWKSKG 117

Query:   128 PNEIFYSFHKDEIPAASLLHPCKVAFLPKGIELPSGIG--SFVCRKVYDITNKCLWWLTD 185
               ++FYSFH+DE+ A S+   C V F+ +  ++P+      F+ + VYD   K L  LT 
Sbjct:   118 SRDLFYSFHRDEVFAESVKDDCIVHFVQENKQIPNRRKHPGFIVQHVYDNVKKKLRKLTF 177

Query:   186 QDYINERQEEVDQLLYKT 203
               +  +++ E+D  + KT
Sbjct:   178 NGFDLQQKREIDHFVEKT 195

 Score = 40 (19.1 bits), Expect = 3.4e-10, Sum P(2) = 3.4e-10
 Identities = 9/37 (24%), Positives = 19/37 (51%)

Query:   745 RKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIES 781
             +K +      ++P  Y  K V+    S + +N+++ES
Sbjct:   209 QKTLISRSKRTVPQSYIIKAVETSRESNNVVNSILES 245


>TAIR|locus:2097134 [details] [associations]
            symbol:AT3G43990 species:3702 "Arabidopsis thaliana"
            [GO:0003677 "DNA binding" evidence=IEA;ISS] [GO:0005634 "nucleus"
            evidence=ISM] [GO:0008150 "biological_process" evidence=ND]
            InterPro:IPR001025 InterPro:IPR003618 Pfam:PF01426 PROSITE:PS51038
            SMART:SM00439 EMBL:CP002686 GenomeReviews:BA000014_GR GO:GO:0003677
            GO:GO:0006351 Gene3D:1.10.472.30 EMBL:AL163975 HOGENOM:HOG000237923
            ProtClustDB:CLSN2683146 IPI:IPI00530281 PIR:T48953
            RefSeq:NP_189985.1 UniGene:At.53714 ProteinModelPortal:Q9LXW1
            SMR:Q9LXW1 PaxDb:Q9LXW1 PRIDE:Q9LXW1 EnsemblPlants:AT3G43990.1
            GeneID:823516 KEGG:ath:AT3G43990 TAIR:At3g43990 eggNOG:NOG279418
            OMA:KCERMSS PhylomeDB:Q9LXW1 Genevestigator:Q9LXW1 Uniprot:Q9LXW1
        Length = 380

 Score = 163 (62.4 bits), Expect = 2.4e-08, P = 2.4e-08
 Identities = 43/143 (30%), Positives = 76/143 (53%)

Query:    68 GRKISVGDCALFKPPQDSPP-FIGIIRSL-TAGKENKLTLSVNWLYRPAEVK---LGKGF 122
             G++  + D  L  P  +    +I II+ + +  K+  + + V W YR  +++    GK +
Sbjct:    81 GKQYRLKDFVLLYPEDNKQKEYIAIIKDIYSQEKDGLVKMEVQWFYRREDIEEKHFGK-W 139

Query:   123 LLEAAPNEIFYSFHKDEIPAASLLHPCKVAFLPKGIELPSGI--GSFVCRKVYDITNKCL 180
               E  P EIF+SFH DE+ A S+ + C V F+P   ++P+ I    F+ + VYD   K +
Sbjct:   140 KTEN-PREIFFSFHCDEVFAESVKYKCLVYFVPDDKQIPNRIHHSGFIVQMVYDNVRKKV 198

Query:   181 WWLTDQDYINERQEEVDQLLYKT 203
                + + +  E++ E+D L+ KT
Sbjct:   199 RKFSHEGFEEEQKFEIDMLVAKT 221


>DICTYBASE|DDB_G0268640 [details] [associations]
            symbol:DDB_G0268640 "unknown" species:44689
            "Dictyostelium discoideum" [GO:0008150 "biological_process"
            evidence=ND] [GO:0005575 "cellular_component" evidence=ND]
            [GO:0003674 "molecular_function" evidence=ND]
            dictyBase:DDB_G0268640 EMBL:AAFI02000004 eggNOG:NOG12793
            RefSeq:XP_646824.1 EnsemblProtists:DDB0233766 GeneID:8616507
            KEGG:ddi:DDB_G0268640 InParanoid:Q55F46 OMA:HSTSEVS Uniprot:Q55F46
        Length = 784

 Score = 149 (57.5 bits), Expect = 1.3e-07, Sum P(2) = 1.3e-07
 Identities = 134/683 (19%), Positives = 233/683 (34%)

Query:   472 SKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAASATGTTDLPSTPA 531
             S   A ++  G + + S  SS  S  ++ +P S S  A +    +  S+   +++PS   
Sbjct:    59 SSHSAPEVPTGSSHSTSEVSSDSSNSASKVPTSSSHSASEA---STGSSHSESEVPSGST 115

Query:   532 KDEKXXXXXXXXXXXXXCVGDHAKTGGLS-GKEDARSSATV--SMTLNKISVGSSRSRKS 588
                               +G    T  +S G   + S      S + +++S GSS S   
Sbjct:   116 HSSSEVSTGSSNSASEVSIGSSHSTSEVSTGSSHSTSEVPSGSSHSTSEVSTGSSHSASE 175

Query:   589 VN-GYPSST---PAGVQRETVSSKNATLHRNSASDRPSQPSLTCEKALDVPVVEGANPKI 644
             V+ G   ST   P G      SS+  T   +S+S+ P+  S     + +VP     +   
Sbjct:   176 VSIGSSHSTSEVPTGSSHS--SSEVPTGSSHSSSEVPTGSS---HSSSEVPTGSSHSSSE 230

Query:   645 IVKIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLPEKQNQFDRNFK--EKNDALRADIS 702
             +    +   S     S  S  +    +S ++S V P   +           + A      
Sbjct:   231 VPTGSSHSASEVPTGSSNSASEVPSDSSHSASEV-PSGSSHSASEVPTGSSHSASEVPTG 289

Query:   703 SNMNSEPWQSNVNKDAAACPDEGSGSPAVLP---DEQGSKTGDNCRKVVEDLEDNSLPPG 759
             S+ +S    S+ +  A+  P   S S + +P       S+          ++  +S    
Sbjct:   290 SSHSSSEVPSDSSNSASEVPTGSSHSNSEVPTGSSHSASEVSTGSSHSASEVSTSSSLSA 349

Query:   760 YEFKDVKLH---ESSFSSMNALIESCVKYSEANVSAPAGDDIGMNLLASVAAGEMSKSDV 816
              E      H   E S  S N+  E     S +    P G     +  + V  G    +  
Sbjct:   350 SEVSAGSSHSASEVSAGSSNSASEVPTGSSHSKSEVPNGSS---HSASEVPIGSSHSASE 406

Query:   817 VSPVGSPPRTPIHEPLCDDNDSRVKSFPGDHSTDSTDDEHEKQGIDRNLWAKNSDSNQDK 876
             VS   S   + +             S     S  ST   H   G + +  + NS S    
Sbjct:   407 VSTSSSHSASEVPNGSSHSRSEVSTSSSNSGSEVSTSSSHS--GSEVSTSSSNSASEVST 464

Query:   877 PAGGLTGHISTSPVDLQQSGDPCQENTENSKEIIV-----AEETPDGAGRNPEEDKAGFR 931
              +   T  +STS  +   + +    ++ ++ E++      A E P G+  +  E      
Sbjct:   465 GSSRSTSEVSTSSSN--SASEVLSGSSNSASEVLTGSSNSASEVPTGSSNSASEVPTDSS 522

Query:   932 VDADGAPDGKQRISGPLSTEDKVSESTRGVETEAVEGSASNQSLEFDGENKKGVSE---- 987
               A   P G    +  + T    S S   V T +   ++SN   E    +   V+E    
Sbjct:   523 NSASEVPTGSSNSASEVPTGS--SNSVTEVPTGSSNSASSNSVSEVPTGSSNSVTEVSTT 580

Query:   988 GLNSGVKREQKPSPITTHSESVKGKDGELLH--TSGSGEDMPLXXXXXXXXXXXXXXXSK 1045
               +S        S ++T S S  G +G      +S SG +  +               S 
Sbjct:   581 SSHSASGSSHSTSEVSTGS-SQSGSEGSTGSNGSSHSGSEGSIGSGSLNSGSVSHNSDSG 639

Query:  1046 SHVNQTEEQNSEWKSNAPMIREDRVVPHLGSAE-NEEKGNGKVDHRENLEGKEVKEELCA 1104
             S  +    Q+    SN     +      +GS   N   G+   D   +L+GK      C 
Sbjct:   640 SEDSNGSSQSGSEVSNGSS--QSVSEGSIGSGSFNSGSGSHNSDSGSSLDGKTT----CI 693

Query:  1105 GPALPEVSTALRAQETGQLVRTG 1127
                  + + +++  ETG++  TG
Sbjct:   694 SY---QDTISIKTLETGEIECTG 713

 Score = 145 (56.1 bits), Expect = 3.5e-07, Sum P(2) = 3.5e-07
 Identities = 123/606 (20%), Positives = 226/606 (37%)

Query:   444 PEVPHSGNRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPA 503
             PEVP   +  T   +  +  S    P SS   A +   G + ++S   S  +  S+ +  
Sbjct:    64 PEVPTGSSHSTSEVSSDSSNSASKVPTSSSHSASEASTGSSHSESEVPSGSTHSSSEVST 123

Query:   504 SGSTDAKD---GQPRNAAS-ATG----TTDLPSTPAKDEKXXXXXXXXXXXXXCVGDHAK 555
               S  A +   G   + +  +TG    T+++PS  +                  +G    
Sbjct:   124 GSSNSASEVSIGSSHSTSEVSTGSSHSTSEVPSGSSHSTSEVSTGSSHSASEVSIGSSHS 183

Query:   556 TGGL-SGKEDARSSATV--SMTLNKISVGSSRSRKSV---NGYPSS-TPAGVQRETVSSK 608
             T  + +G   + S      S + +++  GSS S   V   + + SS  P G      +S+
Sbjct:   184 TSEVPTGSSHSSSEVPTGSSHSSSEVPTGSSHSSSEVPTGSSHSSSEVPTGSSHS--ASE 241

Query:   609 NATLHRNSASDRPSQPSLTCEKALDVPVVEGANPKIIVKIPN-RGRSPAQNSSGGSVEDT 667
               T   NSAS+ PS  S     A +VP   G++     ++P     S ++  +G S   +
Sbjct:   242 VPTGSSNSASEVPSDSS---HSASEVP--SGSSHSAS-EVPTGSSHSASEVPTGSSHSSS 295

Query:   668 SV-TNSRASSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGS 726
              V ++S  S+  +P   +  +      +    +++S+  +    + + +   +A      
Sbjct:   296 EVPSDSSNSASEVPTGSSHSNSEVPTGSSHSASEVSTGSSHSASEVSTSSSLSASEVSAG 355

Query:   727 GSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPG--YEFKDVKLHESSFSSMNALIESCVK 784
              S +      GS    +          + +P G  +   +V +  SS S+      S   
Sbjct:   356 SSHSASEVSAGSSNSASEVPTGSSHSKSEVPNGSSHSASEVPIG-SSHSASEVSTSSSHS 414

Query:   785 YSEA-NVSAPAGDDIGMNLLASVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSF 843
              SE  N S+ +  ++  +  +S +  E+S S   S  GS   T       + +    +S 
Sbjct:   415 ASEVPNGSSHSRSEVSTS--SSNSGSEVSTSS--SHSGSEVSTSSSNSASEVSTGSSRS- 469

Query:   844 PGDHSTDSTDDEHEKQGIDRNLWAK----NSDSNQDKPAGGLTGHISTSPVDLQQSGDPC 899
               + ST S++   E      N  ++    +S+S  + P G  +   S  P D   S    
Sbjct:   470 TSEVSTSSSNSASEVLSGSSNSASEVLTGSSNSASEVPTGS-SNSASEVPTDSSNSASEV 528

Query:   900 QENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSESTR 959
                + NS     A E P G+  +  E   G    A    +    +  P  + + V+E + 
Sbjct:   529 PTGSSNS-----ASEVPTGSSNSVTEVPTGSSNSASS--NSVSEV--PTGSSNSVTEVST 579

Query:   960 GVETEAVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSPITTHSESVKGKDGELLHT 1019
                + +  GS+ + S    G ++ G SEG           S  +  S S+    G + H 
Sbjct:   580 -TSSHSASGSSHSTSEVSTGSSQSG-SEGSTGSNGSSHSGSEGSIGSGSLNS--GSVSHN 635

Query:  1020 SGSGED 1025
             S SG +
Sbjct:   636 SDSGSE 641

 Score = 121 (47.7 bits), Expect = 0.00011, Sum P(2) = 0.00011
 Identities = 116/582 (19%), Positives = 202/582 (34%)

Query:   471 SSKTGAV-KLCQGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAASATGTTDLPST 529
             S K G+   +    + ++S+  S  S    P  +S ST        N+AS   T+   S 
Sbjct:    37 SLKPGSEGSISSSQSGSESSRGSSHSAPEVPTGSSHSTSEVSSDSSNSASKVPTSS--SH 94

Query:   530 PAKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKEDARSSATV--SMTLNKISVGSSRSRK 587
              A +                   H+ +   +G  ++ S  ++  S + +++S GSS S  
Sbjct:    95 SASEASTGSSHSESEVPSGST--HSSSEVSTGSSNSASEVSIGSSHSTSEVSTGSSHSTS 152

Query:   588 SVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQ-PSLTCEKALDVPVVEGANPKIIV 646
              V    S + + V   T SS +A+     +S   S+ P+ +   + +VP     +   + 
Sbjct:   153 EVPSGSSHSTSEVS--TGSSHSASEVSIGSSHSTSEVPTGSSHSSSEVPTGSSHSSSEVP 210

Query:   647 KIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLPEKQNQFDRNFKEKN-DALRADISSNM 705
                +   S     S  S  +    +S ++S V     N       + +  A      S+ 
Sbjct:   211 TGSSHSSSEVPTGSSHSSSEVPTGSSHSASEVPTGSSNSASEVPSDSSHSASEVPSGSSH 270

Query:   706 NSEPWQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDV 765
             ++    +  +  A+  P   S S + +P +  +   +          ++ +P G      
Sbjct:   271 SASEVPTGSSHSASEVPTGSSHSSSEVPSDSSNSASEV--PTGSSHSNSEVPTGSSHSAS 328

Query:   766 KLHE-SSFSSMNALIESCVKYSEANV-SAPAGDDIGM---NLLASVAAGEM-SKSDVVSP 819
             ++   SS S+      S +  SE +  S+ +  ++     N  + V  G   SKS+V  P
Sbjct:   329 EVSTGSSHSASEVSTSSSLSASEVSAGSSHSASEVSAGSSNSASEVPTGSSHSKSEV--P 386

Query:   820 VGSPPRTPIHEPLCDDNDSRVKSFPGDHSTDSTDD--EHEKQGIDRNLWAKNSDSNQDKP 877
              GS   +    P+   + +   S    HS     +   H +  +  +  + NS S     
Sbjct:   387 NGSS-HSASEVPIGSSHSASEVSTSSSHSASEVPNGSSHSRSEVSTS--SSNSGSEVSTS 443

Query:   878 AGGLTGHISTSP------VDLQQSGDPCQENTENSKEIIVAEETPDGAGRNPEEDKAGFR 931
             +      +STS       V    S    + +T +S     A E   G+  +  E   G  
Sbjct:   444 SSHSGSEVSTSSSNSASEVSTGSSRSTSEVSTSSSNS---ASEVLSGSSNSASEVLTGSS 500

Query:   932 VDADGAPDGKQRISG--PLSTEDKVSESTRGVETEAVE---GSASNQSLEFDGENKKGVS 986
               A   P G    +   P  + +  SE   G    A E   GS SN   E    +    S
Sbjct:   501 NSASEVPTGSSNSASEVPTDSSNSASEVPTGSSNSASEVPTGS-SNSVTEVPTGSSNSAS 559

Query:   987 EGLNSGVKREQKPSPI---TTHSESVKGKDGELLHTS-GSGE 1024
                 S V      S     TT S S  G        S GS +
Sbjct:   560 SNSVSEVPTGSSNSVTEVSTTSSHSASGSSHSTSEVSTGSSQ 601

 Score = 65 (27.9 bits), Expect = 1.3e-07, Sum P(2) = 1.3e-07
 Identities = 24/75 (32%), Positives = 35/75 (46%)

Query:   214 GGRSPKPMNGPTST-SQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEP-VKRERSSKME 271
             G  S    N P+S+   LKPGS+G+ +S    SQ  G +  RG   S P V    S    
Sbjct:    21 GSVSSSSNNSPSSSLDSLKPGSEGSISS----SQ-SGSESSRGSSHSAPEVPTGSSHSTS 75

Query:   272 DGNSGHSRTETNLKT 286
             + +S  S + + + T
Sbjct:    76 EVSSDSSNSASKVPT 90


>FB|FBgn0036203 [details] [associations]
            symbol:Muc68D "Mucin 68D" species:7227 "Drosophila
            melanogaster" [GO:0016490 "structural constituent of peritrophic
            membrane" evidence=ISS] [GO:0005576 "extracellular region"
            evidence=IEA] [GO:0008061 "chitin binding" evidence=IEA]
            [GO:0006030 "chitin metabolic process" evidence=IEA] [GO:0031012
            "extracellular matrix" evidence=ISM] [GO:0005201 "extracellular
            matrix structural constituent" evidence=ISM] InterPro:IPR002557
            Pfam:PF01607 PROSITE:PS50940 SMART:SM00494 GO:GO:0005576
            EMBL:AE014296 eggNOG:NOG12793 GO:GO:0031012 GO:GO:0008061
            GO:GO:0005201 CAZy:CBM14 Gene3D:2.170.140.10 SUPFAM:SSF57625
            GO:GO:0006030 GeneTree:ENSGT00700000104174 EMBL:AY075323
            RefSeq:NP_648504.2 UniGene:Dm.20068 SMR:Q9VTN2 MINT:MINT-900668
            STRING:Q9VTN2 EnsemblMetazoa:FBtr0076119 GeneID:39326
            KEGG:dme:Dmel_CG6004 UCSC:CG6004-RB CTD:39326 FlyBase:FBgn0036203
            InParanoid:Q9VTN2 OMA:STESSQD OrthoDB:EOG4WSTSF GenomeRNAi:39326
            NextBio:813085 Uniprot:Q9VTN2
        Length = 1514

 Score = 187 (70.9 bits), Expect = 1.7e-07, Sum P(3) = 1.7e-07
 Identities = 124/672 (18%), Positives = 239/672 (35%)

Query:   446 VPHSGNRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSS---PVSVKSAPLP 502
             +P+S  + + +STE + ++  T  A+ ++ + +  Q D+ T+ + SS   P+S +S+   
Sbjct:   332 LPNSSTQDSSSSTETSFQTESTTDATDESSSTE-SQPDSTTQESSSSTEGPLSTESSTAV 390

Query:   503 ASGSTDAKDGQPRNAASATGTTDLP-STPAKDEKXXXXXXXXXXXXXCVGDHAKT--GGL 559
                S+  +  Q      ++ +T+ P ST +  E                 + + +  G L
Sbjct:   391 TDQSSSTESSQDSTTQESSSSTEGPLSTESSTEATNESSSTESSQDSTTQESSSSTEGPL 450

Query:   560 SGKEDARSSATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQR-----ETVSSKNATLHR 614
             S +    ++   S T +     +  S  S  G P ST +  +       T SS+++T   
Sbjct:   451 STESSTEATNESSSTESSQDSTTQESSSSTEG-PLSTESSTEATNESSSTESSQDSTTQE 509

Query:   615 NSASDR-PSQPSLTCEKALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGSVEDTSVTNSR 673
             +S+S   P     + E   +    E +      +  +   SP          ++S T S 
Sbjct:   510 SSSSSEGPLSTESSTEATNESSSTESSQDSTTQESSSSTESPLSTEPSTEANESSSTESS 569

Query:   674 ASSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACP------DEGSG 727
               S       +  D    E +     + SS  +S+   +  +  +   P       EGS 
Sbjct:   570 QDSTTQESSSSTEDPLSTESSTEATNESSSTESSQDSTTQESSSSTEGPLSTESSTEGSN 629

Query:   728 SPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIESCVKYSE 787
               +     Q S T  +       L   S  P  E  +    ESS  S      S  +   
Sbjct:   630 ESSSTESSQDSTTQKSSSSTESPL---STEPSTEANESSSTESSQDSTTQESSSSTEGPL 686

Query:   788 ANVSAPAGDDIGMNLLASVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSFPGDH 847
             +   +   ++      +  +  + S S    P+ +   T  +E    ++     +     
Sbjct:   687 STEPSTEANESSSTESSQDSTTQESSSSSEGPLSTESSTEANESSSTESSQDSTTQESSS 746

Query:   848 STDST-DDEHEKQGIDRNLWAKNSDSNQDKPAGGLTGHISTSP-VDLQQSGDPCQENTEN 905
             ST+S    E   +  + +    + DS   + +    G +ST P  +  +S      +TE+
Sbjct:   747 STESPLSTEPSTEANESSSTESSQDSTTQESSSSTEGPLSTEPSTEANESS-----STES 801

Query:   906 SKEIIVAEETPDGAGRNPEED--KAGFRVDADGAPDGKQRISGPLSTEDKVSESTRGVET 963
             S++    E +    G    E   +A      + + D   + S   STED +S  +    T
Sbjct:   802 SQDSTTQESSSSSEGPLSTESSTEANESSSTESSQDSTTQESSS-STEDPLSTES---ST 857

Query:   964 EAVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSPITTHSESVKGKDGELLHTSGSG 1023
             EA   S+S +S + D   ++  S     G    +  +  +  S S +         S S 
Sbjct:   858 EATYESSSTESSQ-DSTTQESSSS--TEGPLSTESSTEGSNESSSTESSQDSTTQESSSS 914

Query:  1024 EDMPLXXXXXXXXXXXXXXXSKSHVNQTEEQNSEWKSNAPMIREDRVVPHLGSAENEEKG 1083
              + PL               S S  + T+E +S   +  P+  E     +  S+    + 
Sbjct:   915 TESPLSTEPSTEANESSSTES-SQDSTTQESSSS--TEGPLSTESSTEANESSSTESSQD 971

Query:  1084 NGKVDHRENLEG 1095
             +   +   + EG
Sbjct:   972 STTQESSSSTEG 983

 Score = 163 (62.4 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 139/735 (18%), Positives = 276/735 (37%)

Query:   418 IQKKARSLVDTWKKRVEAEMDARPRLPEVPHSGNRQTGASTEVAIKSLVTQPASSKTGAV 477
             +  ++ + V       E+  D+  +       G   T +STE   +S  T+ +   T   
Sbjct:   382 LSTESSTAVTDQSSSTESSQDSTTQESSSSTEGPLSTESSTEATNESSSTESSQDSTTQ- 440

Query:   478 KLCQGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAASAT-GTTDLPS-TPAKDEK 535
                +  + T+   S+  S + A   +S +  ++D   + ++S+T G     S T A +E 
Sbjct:   441 ---ESSSSTEGPLSTESSTE-ATNESSSTESSQDSTTQESSSSTEGPLSTESSTEATNES 496

Query:   536 XXXXXXXXXXXXXCVGDHAKTGGLSGK---EDARSSATVSMTLNKISVGSSRSRKSVNGY 592
                               +  G LS +   E    S++   + +  +  SS S +S    
Sbjct:   497 SSTESSQDSTTQE--SSSSSEGPLSTESSTEATNESSSTESSQDSTTQESSSSTESPLST 554

Query:   593 PSSTPAGVQRETVSSKNATLHRNSAS-DRPSQPSLTCEKALDVPVVEGANPKIIVKIPNR 651
               ST A     T SS+++T   +S+S + P     + E   +    E +      +  + 
Sbjct:   555 EPSTEANESSSTESSQDSTTQESSSSTEDPLSTESSTEATNESSSTESSQDSTTQESSSS 614

Query:   652 GRSP--AQNSSGGSVEDTSVTNSRASSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEP 709
                P   ++S+ GS E +S  +S+ S+    +K +    +      +  A+ SS+  S  
Sbjct:   615 TEGPLSTESSTEGSNESSSTESSQDSTT---QKSSSSTESPLSTEPSTEANESSSTESSQ 671

Query:   710 WQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHE 769
               S   + +++     S  P+   +E  S          E    +  P   E    + +E
Sbjct:   672 -DSTTQESSSSTEGPLSTEPSTEANESSSTESSQDSTTQESSSSSEGPLSTE-SSTEANE 729

Query:   770 SSF--SSMNALIESCVKYSEANVSA-PA--GDDIGMNLLASVAAGEMSKSDVVSPVGSPP 824
             SS   SS ++  +     +E+ +S  P+   ++      +  +  + S S    P+ + P
Sbjct:   730 SSSTESSQDSTTQESSSSTESPLSTEPSTEANESSSTESSQDSTTQESSSSTEGPLSTEP 789

Query:   825 RTPIHEPLCDDN--DSRVK----SFPGDHSTDSTDDEHEKQGID--RNLWAKNSDSNQDK 876
              T  +E    ++  DS  +    S  G  ST+S+ + +E    +  ++   + S S+ + 
Sbjct:   790 STEANESSSTESSQDSTTQESSSSSEGPLSTESSTEANESSSTESSQDSTTQESSSSTED 849

Query:   877 PAGGLTGHIST-SPVDLQQSGDPC-QENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDA 934
             P    +   +T      + S D   QE++ +++  +  E + +G+  +   + +      
Sbjct:   850 PLSTESSTEATYESSSTESSQDSTTQESSSSTEGPLSTESSTEGSNESSSTESSQDSTTQ 909

Query:   935 DGAPDGKQRISGPLSTEDKVSESTRGVETEAVEGSASNQ----SLEFDGE-NKKGVSEGL 989
             + +   +  +S   STE   S ST   +    + S+S+     S E   E N+   +E  
Sbjct:   910 ESSSSTESPLSTEPSTEANESSSTESSQDSTTQESSSSTEGPLSTESSTEANESSSTESS 969

Query:   990 NSGVKREQKPS---PITTHSESVKGKDGELLHTSGSGEDMPLXXXXXXXXXXXXXXXSKS 1046
                  +E   S   P++T S S +G + E   T  S +D                  S +
Sbjct:   970 QDSTTQESSSSTEGPLSTES-STEGSN-ESSSTESS-QDSTTQESSSSTESPLSTEPS-T 1025

Query:  1047 HVNQTEEQNSEWKSNAPMIREDRVVPHLGSAENEEKGNGKVDHRENLEGKEVKEELCA-- 1104
               N++    S   S           P   S E+  + + +    E+ +    +E   +  
Sbjct:  1026 EANESSSTESSQDSTTQESSSSTEGPL--STESSTEASNESSSTESSQDSTTQESSSSTE 1083

Query:  1105 GPALPEVSTALRAQE 1119
             GP   E ST +  QE
Sbjct:  1084 GPLSTESSTEV-TQE 1097

 Score = 162 (62.1 bits), Expect = 4.7e-05, Sum P(3) = 4.7e-05
 Identities = 105/577 (18%), Positives = 223/577 (38%)

Query:   449 SGNRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSS---PVSVKSAPLPASG 505
             S  +++ +STE  + +  +  A+ ++ + +  Q D+ T+ + SS   P+S +S+    + 
Sbjct:   539 STTQESSSSTESPLSTEPSTEAN-ESSSTESSQ-DSTTQESSSSTEDPLSTESSTEATNE 596

Query:   506 STDAKDGQPRNAASATGTTDLP-----STPAKDEKXXXXXXXXXXXXXCVGDHAKTGGLS 560
             S+  +  Q      ++ +T+ P     ST   +E                          
Sbjct:   597 SSSTESSQDSTTQESSSSTEGPLSTESSTEGSNESSSTESSQDSTTQKSSSSTESPLSTE 656

Query:   561 GKEDARSSATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDR 620
                +A  S++   + +  +  SS S +       ST A     T SS+++T   +S+S  
Sbjct:   657 PSTEANESSSTESSQDSTTQESSSSTEGPLSTEPSTEANESSSTESSQDSTTQESSSSSE 716

Query:   621 PSQPSLTCEKALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLP 680
                 + +  +A +    E +      +  +   SP          ++S T S   S    
Sbjct:   717 GPLSTESSTEANESSSTESSQDSTTQESSSSTESPLSTEPSTEANESSSTESSQDSTT-- 774

Query:   681 EKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGSPAVLPDEQGSKT 740
             ++ +           +  A+ SS+  S   Q +  +++++  +    + +     + S T
Sbjct:   775 QESSSSTEGPLSTEPSTEANESSSTESS--QDSTTQESSSSSEGPLSTESSTEANESSST 832

Query:   741 GDNCRKVVEDLEDNSLPP-GYEFKDVKLHESSF--SSMNALIESCVKYSEANVSAPAGDD 797
               +     ++   ++  P   E      +ESS   SS ++  +     +E  +S  +  +
Sbjct:   833 ESSQDSTTQESSSSTEDPLSTESSTEATYESSSTESSQDSTTQESSSSTEGPLSTESSTE 892

Query:   798 IGMNLLASVAAGE-----MSKSDVVSPVGSPPRTPIHEPLCDDN--DSRVK----SFPGD 846
              G N  +S  + +      S S   SP+ + P T  +E    ++  DS  +    S  G 
Sbjct:   893 -GSNESSSTESSQDSTTQESSSSTESPLSTEPSTEANESSSTESSQDSTTQESSSSTEGP 951

Query:   847 HSTDSTDDEHEKQGIDRNLWAKNSDSNQDKPAGGLTGHISTSPVDLQQSGDPCQENTENS 906
              ST+S+ + +E    + +  +   +S+     G L+   ST   +   S +  Q++T  +
Sbjct:   952 LSTESSTEANESSSTESSQDSTTQESSSSTE-GPLSTESSTEGSNESSSTESSQDST--T 1008

Query:   907 KEIIVAEETPDGAGRNPEEDKAGFRVDADGAP--DGKQRISGPLSTEDKVSESTRGVETE 964
             +E   + E+P     + E +++     +  +   +      GPLSTE     S     TE
Sbjct:  1009 QESSSSTESPLSTEPSTEANESSSTESSQDSTTQESSSSTEGPLSTESSTEASNESSSTE 1068

Query:   965 AVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSP 1001
             + + S + +S      + +G      S  +  Q+PSP
Sbjct:  1069 SSQDSTTQES----SSSTEGPLS-TESSTEVTQEPSP 1100

 Score = 150 (57.9 bits), Expect = 0.00095, Sum P(3) = 0.00095
 Identities = 117/652 (17%), Positives = 234/652 (35%)

Query:   456 ASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPR 515
             +S+  ++ +  TQ +SS T      Q ++ T +   S  S +S P   +  + +    P 
Sbjct:   326 SSSSESLPNSSTQDSSSSTETS--FQTESTTDATDESS-STESQPDSTTQESSSSTEGPL 382

Query:   516 NAASATGTTDLPSTP--AKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKEDARSSATV-- 571
             +  S+T  TD  S+   ++D                      T   S  E ++ S T   
Sbjct:   383 STESSTAVTDQSSSTESSQDSTTQESSSSTEGPLSTESSTEATNESSSTESSQDSTTQES 442

Query:   572 -SMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQPSLTCEK 630
              S T   +S  SS    + +    S+     +E+ SS    L   S+++  ++ S T E 
Sbjct:   443 SSSTEGPLSTESSTEATNESSSTESSQDSTTQESSSSTEGPLSTESSTEATNESSST-ES 501

Query:   631 ALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLPEKQNQFDRNF 690
             + D    E ++        + G    ++S+  + E +S  +S+ S+        +   + 
Sbjct:   502 SQDSTTQESSSS-------SEGPLSTESSTEATNESSSTESSQDSTTQESSSSTESPLST 554

Query:   691 KEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGDNCRKVVED 750
             +   +A  +  SS  +S+   S   + +++  D  S   +     + S T  +     ++
Sbjct:   555 EPSTEANES--SSTESSQ--DSTTQESSSSTEDPLSTESSTEATNESSSTESSQDSTTQE 610

Query:   751 LEDNSLPP-GYEFKDVKLHESSFSSMNALIESCVKYSEANVSAPAGDDIGMNLLASVAAG 809
                ++  P   E      +ESS  S  +  +S  + S ++  +P   +       S ++ 
Sbjct:   611 SSSSTEGPLSTESSTEGSNESS--STESSQDSTTQKSSSSTESPLSTEPSTEANES-SST 667

Query:   810 EMSKSDVVSPVGSPPRTPIH-EPLCDDNDSRVKSFPGDHSTDSTDDEHEKQGIDRNLWAK 868
             E S+        S    P+  EP  + N+S       D +T  +    E      +    
Sbjct:   668 ESSQDSTTQESSSSTEGPLSTEPSTEANESSSTESSQDSTTQESSSSSEGPLSTESSTEA 727

Query:   869 NSDSNQDKPAGGLTGHISTS---PVDLQQSGDPCQEN-TENSKEIIVAEETPDGAGRNPE 924
             N  S+ +      T   S+S   P+  + S +  + + TE+S++    E +    G  P 
Sbjct:   728 NESSSTESSQDSTTQESSSSTESPLSTEPSTEANESSSTESSQDSTTQESSSSTEG--PL 785

Query:   925 EDKAGFRVDADGAPDGKQRISGPLSTEDKVSESTRG-VETEAVEGSASNQSLEFDGENKK 983
               +     +   + +  Q      ST  + S S+ G + TE+   +  + S E   ++  
Sbjct:   786 STEPSTEANESSSTESSQD-----STTQESSSSSEGPLSTESSTEANESSSTESSQDSTT 840

Query:   984 GVSEGLNSGVKREQKPSPITTHSESVKGKDGELLHTSGSGEDMPLXXXXXXXXXXXXXXX 1043
               S          +  +  T  S S +         S S  + PL               
Sbjct:   841 QESSSSTEDPLSTESSTEATYESSSTESSQDSTTQESSSSTEGPLSTESSTEGSNESSST 900

Query:  1044 SKSHVNQTEEQNSEWKSNAPMIREDRVVPHLGSAENEEKGNGKVDHRENLEG 1095
               S  + T+E +S  +S  P+  E     +  S+    + +   +   + EG
Sbjct:   901 ESSQDSTTQESSSSTES--PLSTEPSTEANESSSTESSQDSTTQESSSSTEG 950

 Score = 145 (56.1 bits), Expect = 0.00094, Sum P(2) = 0.00094
 Identities = 120/649 (18%), Positives = 223/649 (34%)

Query:   473 KTGAVKLCQGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAASATGTTDLP-STPA 531
             K   +      +PT+ A  S   V S  +P   S+ A+  Q      +  +TD+  ST A
Sbjct:   156 KKDIISSSSDSSPTEDATYSSTQVSSTQVPEDASS-AESIQESTTQGSRSSTDISLSTEA 214

Query:   532 K-DEKXXXXXXXXXXXXXCVGDHAKTGG-----LSGKEDARS---SATVSMTLNKISVGS 582
               D+                   + T G     +S      S   S  +    + I   S
Sbjct:   215 SLDDIILSSESIVPTESSTTIISSSTEGSWESHISTDSSIGSKVESLLIEALYSLIQESS 274

Query:   583 SRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQPSLTCEKALDVPVVEGANP 642
             S S   V+  PS+        T S  ++T   +S+S+ P    L+ E A +    E ++ 
Sbjct:   275 SSSESPVSNEPSTGATDDSSSTESLPDSTQESSSSSESPVSFELSTE-ATN----ESSSS 329

Query:   643 KIIVKIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLPEKQNQFDRNFKE---KNDALRA 699
             + +     +  S +  +S  +   T  T+  +S+   P+   Q   +  E     ++  A
Sbjct:   330 ESLPNSSTQDSSSSTETSFQTESTTDATDESSSTESQPDSTTQESSSSTEGPLSTESSTA 389

Query:   700 DISSNMNSEPWQSNVNKDAAACPDE--GSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLP 757
                 + ++E  Q +  +++++  +    + S     +E  S          E       P
Sbjct:   390 VTDQSSSTESSQDSTTQESSSSTEGPLSTESSTEATNESSSTESSQDSTTQESSSSTEGP 449

Query:   758 PGYEFKDVKLHESSFSSMNALIESCVKYSEANVSAPAGDDIGMNLLASVAAGEMSKSDVV 817
                E      +ESS  S  +  +S  + S ++   P   +         ++ E S+    
Sbjct:   450 LSTESSTEATNESS--STESSQDSTTQESSSSTEGPLSTESSTEATNESSSTESSQDSTT 507

Query:   818 SPVGSPPRTPIHEPLCDDNDSRVKSFPGDHSTDSTDDEHEKQGIDRNLWAKNSDSNQDKP 877
                 S    P+        ++  +S   + S DST  E             ++++N+   
Sbjct:   508 QESSSSSEGPLSTE--SSTEATNESSSTESSQDSTTQESSSSTESPLSTEPSTEANESSS 565

Query:   878 AGGLTGHISTSPVDLQQSGDPC--QENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDAD 935
                 +   ST+      + DP   + +TE + E    E + D   +       G  +  +
Sbjct:   566 TE--SSQDSTTQESSSSTEDPLSTESSTEATNESSSTESSQDSTTQESSSSTEG-PLSTE 622

Query:   936 GAPDGKQRISGPLSTED----KVSESTRGV-----ETEAVEGSASNQSLEFDGENKKGVS 986
              + +G    S   S++D    K S ST         TEA E S++  S +   +     +
Sbjct:   623 SSTEGSNESSSTESSQDSTTQKSSSSTESPLSTEPSTEANESSSTESSQDSTTQESSSST 682

Query:   987 EGLNSGVKREQKPSPITTHSESVKGKDGELLHTSGSGEDMPLXXXXXXXXXXXXXXXSKS 1046
             EG  S      +PS     S S +         S S  + PL               S S
Sbjct:   683 EGPLS-----TEPSTEANESSSTESSQDSTTQESSSSSEGPLSTESSTEANESSSTES-S 736

Query:  1047 HVNQTEEQNSEWKSNAPMIREDRVVPHLGSAENEEKGNGKVDHRENLEG 1095
               + T+E +S  +S  P+  E     +  S+    + +   +   + EG
Sbjct:   737 QDSTTQESSSSTES--PLSTEPSTEANESSSTESSQDSTTQESSSSTEG 783

 Score = 45 (20.9 bits), Expect = 3.7e-05, Sum P(3) = 3.7e-05
 Identities = 15/64 (23%), Positives = 28/64 (43%)

Query:   225 TSTSQLKPGSDGAQNSASFPSQVKGKKRER-GDQSSEPVKRERSSKMEDGNSGHSRTETN 283
             +S+++ +P S   ++S+S    +  +      DQSS     + S+  E  +S      T 
Sbjct:   360 SSSTESQPDSTTQESSSSTEGPLSTESSTAVTDQSSSTESSQDSTTQESSSSTEGPLSTE 419

Query:   284 LKTE 287
               TE
Sbjct:   420 SSTE 423

 Score = 44 (20.5 bits), Expect = 1.7e-07, Sum P(3) = 1.7e-07
 Identities = 14/54 (25%), Positives = 24/54 (44%)

Query:   234 SDGAQNSASFPSQVKGKKRERGDQSSEPVKRERSSKMEDGNSGHSRTETNLKTE 287
             S GA + +S    +    +E    S  PV  E S++  + +S  S +  N  T+
Sbjct:   286 STGATDDSSSTESLPDSTQESSSSSESPVSFELSTEATNESSS-SESLPNSSTQ 338

 Score = 42 (19.8 bits), Expect = 2.6e-07, Sum P(3) = 2.6e-07
 Identities = 17/64 (26%), Positives = 30/64 (46%)

Query:   225 TSTSQLKPGSDGAQNSASFPSQVKGKKR-ERGDQSSEPVKRERSSKMEDGNSGHSRTETN 283
             +S+S+  P S    +S+S  +  + +   +  D+SS    +  S+  E  +S    TE  
Sbjct:   326 SSSSESLPNSSTQDSSSSTETSFQTESTTDATDESSSTESQPDSTTQESSSS----TEGP 381

Query:   284 LKTE 287
             L TE
Sbjct:   382 LSTE 385

 Score = 38 (18.4 bits), Expect = 1.7e-07, Sum P(3) = 1.7e-07
 Identities = 10/29 (34%), Positives = 13/29 (44%)

Query:  1535 AVNSQLMGPAGAVPSHFP-RPYVVSLPDG 1562
             A  S L  P G  P H   R    ++P+G
Sbjct:  1292 AETSTLKSPTGTTPGHQEDRTDCSNMPNG 1320


>UNIPROTKB|Q9NZW4 [details] [associations]
            symbol:DSPP "Dentin sialophosphoprotein" species:9606 "Homo
            sapiens" [GO:0031214 "biomineral tissue development" evidence=IEA]
            [GO:0071460 "cellular response to cell-matrix adhesion"
            evidence=IEA] [GO:0005578 "proteinaceous extracellular matrix"
            evidence=IEA] [GO:0001501 "skeletal system development"
            evidence=TAS] [GO:0005201 "extracellular matrix structural
            constituent" evidence=TAS] [GO:0005509 "calcium ion binding"
            evidence=TAS] [GO:0005518 "collagen binding" evidence=TAS]
            [GO:0007275 "multicellular organismal development" evidence=TAS]
            [GO:0001503 "ossification" evidence=TAS] GO:GO:0005578
            GO:GO:0005509 GO:GO:0001501 GO:GO:0005518 eggNOG:NOG12793
            GO:GO:0031214 GO:GO:0001503 GO:GO:0005201 EMBL:AF163151
            EMBL:AC093895 EMBL:AF094508 IPI:IPI00872967 RefSeq:NP_055023.2
            UniGene:Hs.678914 ProteinModelPortal:Q9NZW4 STRING:Q9NZW4
            PhosphoSite:Q9NZW4 DMDM:215273974 PaxDb:Q9NZW4 PRIDE:Q9NZW4
            Ensembl:ENST00000282478 Ensembl:ENST00000399271 GeneID:1834
            KEGG:hsa:1834 UCSC:uc003hqu.3 CTD:1834 GeneCards:GC04P088529
            HGNC:HGNC:3054 HPA:HPA036230 MIM:125420 MIM:125485 MIM:125490
            MIM:125500 MIM:605594 neXtProt:NX_Q9NZW4 Orphanet:1653
            Orphanet:166260 Orphanet:166265 PharmGKB:PA27507 HOVERGEN:HBG098252
            OMA:ERESKVQ OrthoDB:EOG41RPVG GenomeRNAi:1834 NextBio:7491
            PMAP-CutDB:A8MUI0 ArrayExpress:Q9NZW4 Bgee:Q9NZW4 CleanEx:HS_DSPP
            Genevestigator:Q9NZW4 GermOnline:ENSG00000152591 GO:GO:0071460
            Uniprot:Q9NZW4
        Length = 1301

 Score = 155 (59.6 bits), Expect = 1.8e-06, Sum P(2) = 1.8e-06
 Identities = 114/621 (18%), Positives = 225/621 (36%)

Query:   449 SGNRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAP-TKSAFSSPVS-VKSAPLPASGS 506
             S +  + +S++ +  S  +  + S   +      D+  + S+ SS  S   S+   +S S
Sbjct:   672 SDSSDSSSSSDSSNSSDSSDSSDSSNSSESSDSSDSSDSDSSDSSDSSNSNSSDSDSSNS 731

Query:   507 TDAKDGQPRNAASATGTTDLPSTPAKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKEDAR 566
             +D+ D    N++ ++ ++D  ++    +                 D + +   S   D+ 
Sbjct:   732 SDSSDSS--NSSDSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSNDSS 789

Query:   567 SSATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQPSL 626
             +S+  S + N  S  S+ S  S +   S + +    ++ +S +++   NS+    S  S 
Sbjct:   790 NSSDSSDSSNS-SDSSNSSDSSDSSDSSDSDSSNSSDSSNSSDSSDSSNSSDSSDSSDSS 848

Query:   627 TCEKALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLPEKQNQF 686
                 +      + +N        +   S + NSS  S  D+S +N  ++S    +  N  
Sbjct:   849 DGSDSDSSNRSDSSNSS-----DSSDSSDSSNSSDSS--DSSDSNESSNSSDSSDSSNSS 901

Query:   687 DRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGDNCRK 746
             D +  + +++  +  SSN +     SN + ++ +     S   +   D   S    N   
Sbjct:   902 DSDSSDSSNSSDSSDSSNSSDSSESSNSSDNSNSSDSSNSSDSSDSSDSSNSSDSSNSSD 961

Query:   747 VVE--DLED-NSLPPGYEFKDVKLHESSFSSMNALIESCVKYSEANVSAPAGDDIGMNLL 803
                  D  D NS             +SS SS ++        S+++ S+ + D    N  
Sbjct:   962 SSNSSDSSDSNSSDSSDSSNSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDS--SNSS 1019

Query:   804 ASVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSFPGDHS--TDSTDDEHEKQGI 861
              S  + + S S   S       +       D +DS   S   D S  +DS+D        
Sbjct:  1020 DSSNSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSS 1079

Query:   862 DRNLWAKNSDS-NQDKPAGGLTGHISTSPVDLQQSGDPCQE-NTENSKEIIVAEETPDGA 919
             D +  + +SDS N    +       S+   D   S D     N+ +S +   + ++ D +
Sbjct:  1080 DSSESSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSS 1139

Query:   920 GRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSESTRGVETEAVEGSASNQSLEFDG 979
               +   D +     +D +       S   S     S+S+   ++     S S+ S   D 
Sbjct:  1140 NSSDSSDSSESSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDS-SDSSDSSDS 1198

Query:   980 ENKKGVSEGLNSGVKREQKPSPITTHSESVKGKDGELLHTSGSGEDMPLXXXXXXXXXXX 1039
              +    S+  +S    +   S  +  S+S    D     +S S +               
Sbjct:  1199 SDSSDSSDSSDSSDSSDSSDS--SDSSDSSDSSDSN--ESSDSSDSSDSSDSSNSSDSSD 1254

Query:  1040 XXXXSKSHVNQTEEQNSEWKS 1060
                 S S  +  +E +S+ KS
Sbjct:  1255 SSDSSDSTSDSNDESDSQSKS 1275

 Score = 152 (58.6 bits), Expect = 1.5e-05, Sum P(3) = 1.5e-05
 Identities = 103/591 (17%), Positives = 212/591 (35%)

Query:   483 DAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAASATGTTDLPSTPAKDEKXXXXXXX 542
             D+   S  S      S+   +S  +D+ D    +++ +  +    S+ + D         
Sbjct:   556 DSSNSSDSSDSSDSDSSDSNSSSDSDSSDSDSSDSSDSDSSDSSNSSDSSDSSDSSDSSD 615

Query:   543 XXXXXXCVGDHAKTGGLSGKEDARSSATVSMTL----NKISVGSSRSRKSVNGYPSSTPA 598
                      D +K+   S   D++S ++ S +     N  S  SS S  S +   SS  +
Sbjct:   616 SSDSSDSKSDSSKSESDSSDSDSKSDSSDSNSSDSSDNSDSSDSSNSSNSSDSSDSSDSS 675

Query:   599 GVQRETVSSKNATLHRNSASDRPSQPSLTCEKA----LDVPVVEGANPKIIVKIPNRGRS 654
                  + SS ++    +S S   S+ S + + +     D      +N        +   S
Sbjct:   676 DSSSSSDSSNSSDSSDSSDSSNSSESSDSSDSSDSDSSDSSDSSNSNSSDSDSSNSSDSS 735

Query:   655 PAQNSSGGSVEDTSVTNSRASSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEPWQSNV 714
              + NSS  S  D+S +++ + S    +  N  D +  + +D+  +  SSN +     SN 
Sbjct:   736 DSSNSSDSS--DSSDSSNSSDSSDSSDSSNSSDSS--DSSDSSDSSDSSNSSDSNDSSNS 791

Query:   715 NKDAAACPDEGSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSS 774
             +  + +     S + +   D   S   D+      D   NS             +SS SS
Sbjct:   792 SDSSDSSNSSDSSNSSDSSDSSDSSDSDSSNS--SD-SSNSSDSSDSSNSSDSSDSSDSS 848

Query:   775 MNALIESCVKYSEANVSAPAGDDIGMNLLASVAAGEMSKSDVVSPVGSPPRTPIHEPLCD 834
               +  +S  +   +N S  +      N   S  + + ++S   S       +   +    
Sbjct:   849 DGSDSDSSNRSDSSNSSDSSDSSDSSNSSDSSDSSDSNESSNSSDSSDSSNSSDSDSSDS 908

Query:   835 DNDSRVKSFPGDHSTDSTDDEHEKQGIDRNLWAKNSDSNQDKPAGGLTGHISTSPVDLQQ 894
              N S   S    +S+DS++  +     + +  + +SDS+    +   +   ++S  D   
Sbjct:   909 SNSS--DSSDSSNSSDSSESSNSSDNSNSSDSSNSSDSSDSSDSSNSSDSSNSS--DSSN 964

Query:   895 SGDPCQENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKV 954
             S D    N+ +S +   + ++ D +  +   D +     +D +       S   S     
Sbjct:   965 SSDSSDSNSSDSSDSSNSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSNSSDSSNS 1024

Query:   955 SESTRGVET-EAVEGSASNQSLEF-DGENKKGVSEGLNSGVKREQKPSPITTHS-ESVKG 1011
             S+S+   ++ ++ + S S+ S +  D  +    S+  +S    +   S  ++ S +S + 
Sbjct:  1025 SDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSES 1084

Query:  1012 KDGELLHTSGSGEDMPLXXXXXXXXXXXXXXXSKSHVNQTEEQNSEWKSNA 1062
              D      S    D                  S    N ++  +S   S++
Sbjct:  1085 SDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDS 1135

 Score = 150 (57.9 bits), Expect = 6.0e-06, Sum P(2) = 6.0e-06
 Identities = 107/551 (19%), Positives = 209/551 (37%)

Query:   449 SGNRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPASGSTD 508
             S N    + +  +  S  +  +S  + +           S+ SS  S  S    +S S+D
Sbjct:   749 SSNSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSNDSSNSSDSSDSSNSSDSSNSSD 808

Query:   509 AKDGQPRNAASATGTTDLPSTPAKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKEDARSS 568
             + D    + + ++ ++D  S  +                   G  + +   S   ++  S
Sbjct:   809 SSDSSDSSDSDSSNSSD-SSNSSDSSDSSNSSDSSDSSDSSDGSDSDSSNRSDSSNSSDS 867

Query:   569 ATVSMTLNKI-SVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQPSLT 627
             +  S + N   S  SS S +S N   SS  +    ++ SS ++    +S S   S  S +
Sbjct:   868 SDSSDSSNSSDSSDSSDSNESSNSSDSSDSSN-SSDSDSSDSSNSSDSSDSSNSSDSSES 926

Query:   628 CEKALDVPVVEGANPKIIVKIPNRGRSP-AQNSSGGS-VEDTSVTNSRASSPVLPEKQNQ 685
                + +    + +N        +   S  + NSS  S   D+S +NS  SS    +  N 
Sbjct:   927 SNSSDNSNSSDSSNSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSNSSDSS----DSSNS 982

Query:   686 FDRN-FKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGDNC 744
              D +   + +D+  +  SSN +S+   S+ + D++   D  + S +    +    +  + 
Sbjct:   983 SDSSDSSDSSDSSDSSDSSN-SSDSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSSDSSD 1041

Query:   745 RKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIESCVKYSEANVSAPAGDDIGMNLLA 804
                  D  D+S        D    +SS SS ++        S+++ S+ + D    N   
Sbjct:  1042 SSDSSDSSDSS--DSSNSSDSS--DSSDSSDSSDSSDSSDSSDSSESSDSSDS--SNSSD 1095

Query:   805 SVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSFPGD--HSTDSTDDEHEKQGID 862
             S  + + S S   S       +       D +DS   S   D  +S+DS+D        D
Sbjct:  1096 SSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSSDSSESSDSSD 1155

Query:   863 RNLWAKNSDSNQDKPAGGLTGHISTSPVDLQQSGDPCQEN-TENSKEIIVAEETPDGAGR 921
              +  + +SDS+    +   +   ++S  D   S D    + + +S +   + ++ D +  
Sbjct:  1156 SSDSSDSSDSSDSSDSSDSSDSSNSS--DSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDS 1213

Query:   922 NPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSESTRGVETEAVEGSASNQSLEFDGEN 981
             +   D +     +D +   +   S   S     S S+   ++     S S+ + E D ++
Sbjct:  1214 SDSSDSSDSSDSSDSSDSNESSDSSDSSDSSDSSNSSDSSDSSDSSDSTSDSNDESDSQS 1273

Query:   982 KKGVSEGLNSG 992
             K G   G N+G
Sbjct:  1274 KSG--NGNNNG 1282

 Score = 146 (56.5 bits), Expect = 6.2e-05, Sum P(3) = 6.2e-05
 Identities = 112/624 (17%), Positives = 222/624 (35%)

Query:   449 SGNRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPASGSTD 508
             S +  +  S++ +  S  +  + SK+ + K     + + S   S  S  S     S S+D
Sbjct:   599 SNSSDSSDSSDSSDSSDSSDSSDSKSDSSKSESDSSDSDSKSDSSDSNSSDSSDNSDSSD 658

Query:   509 AKDGQPRNAASATGTTDLPSTPAKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKEDARSS 568
             + +    N++ ++ ++D   + +  +                 + + +   S  + + SS
Sbjct:   659 SSNSS--NSSDSSDSSDSSDSSSSSDSSNSSDSSDSSDSSNSSESSDSSDSSDSDSSDSS 716

Query:   569 -ATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQPSLT 627
              ++ S + +  S  SS S  S N   SS  +     + SS ++    +S S   S  S +
Sbjct:   717 DSSNSNSSDSDSSNSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSNSSDSSDSSDSSDS 776

Query:   628 CEKALDVPVVEGANPKIIVKIPNRG----RSPAQNSSGGSVEDTSVTNSRASSPVLPEKQ 683
              + +      + +N        N       S + +SS  S  D+S ++  ++S    +  
Sbjct:   777 SDSSNSSDSNDSSNSSDSSDSSNSSDSSNSSDSSDSSDSSDSDSSNSSDSSNSSDSSDSS 836

Query:   684 NQFDRN-FKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGD 742
             N  D +   + +D   +D S+  +S     + +   ++   + S S         S + D
Sbjct:   837 NSSDSSDSSDSSDGSDSDSSNRSDSSNSSDSSDSSDSSNSSDSSDSSDSNESSNSSDSSD 896

Query:   743 NCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIESCVKYSEANVSAPAGDDI-GMN 801
             +      D  D+S     +  D      S  S N+   S    S+++ S+ + D     N
Sbjct:   897 SSNSSDSDSSDSS--NSSDSSDSSNSSDSSESSNSSDNS--NSSDSSNSSDSSDSSDSSN 952

Query:   802 LLASVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSFPGDHSTDSTDDEHEKQGI 861
                S  + + S S   S   S   +       D +DS   S   D S+DS++        
Sbjct:   953 SSDSSNSSDSSNSSDSSDSNSSDSSDSSNS-SDSSDSSDSSDSSD-SSDSSNSSDSSDSS 1010

Query:   862 DRNLWAKNSDS-NQDKPAGGLTGHISTSPVDLQQSGDPCQEN-TENSKEIIVAEETPDGA 919
             D +  + +SDS N    +       S+   D   S D    + + NS +   + ++ D +
Sbjct:  1011 DSSDSSNSSDSSNSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSS 1070

Query:   920 GRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSESTRGVETEAVEGSASNQSLEFDG 979
               +   D +     +D +       S   S     S+S+   ++     S SN S   D 
Sbjct:  1071 DSSDSSDSSDSSESSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDS-SNSSDSSDS 1129

Query:   980 ENKKGVSEGLNSGVKREQKPSPITTHS-ESVKGKDGELLHTSGSGEDMPLXXXXXXXXXX 1038
              +    S+  NS    +   S  ++ S +S    D      S    D             
Sbjct:  1130 SDSSDSSDSSNSSDSSDSSESSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDS 1189

Query:  1039 XXXXXSKSHVNQTEEQNSEWKSNA 1062
                  S    + ++  +S   S++
Sbjct:  1190 SDSSDSSDSSDSSDSSDSSDSSDS 1213

 Score = 145 (56.1 bits), Expect = 2.0e-05, Sum P(2) = 2.0e-05
 Identities = 104/543 (19%), Positives = 195/543 (35%)

Query:   488 SAFSSPVSVKSAPLPASGSTDAKDGQPRNAASATGTTDLPSTPAKDEKXXXXXXXXXXXX 547
             S+ SS  S  S    +S S+D+ D    + +S +  +   S  +  +             
Sbjct:   773 SSDSSDSSNSSDSNDSSNSSDSSDSSNSSDSSNSSDSSDSSDSSDSDSSNSSDSSNSSDS 832

Query:   548 XCVGDHAKTGGLSGKEDARSSATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSS 607
                 + + +   S   D   S + + + +  S  SS S  S N   SS  +     + SS
Sbjct:   833 SDSSNSSDSSDSSDSSDGSDSDSSNRSDSSNSSDSSDSSDSSNSSDSSDSSDSNESSNSS 892

Query:   608 KNATLHRNSASDRP--SQPSLTCEKALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGS-V 664
              ++    +S SD    S  S + + +      E +N        N   S + NSS  S  
Sbjct:   893 DSSDSSNSSDSDSSDSSNSSDSSDSSNSSDSSESSNSS-----DNSNSSDSSNSSDSSDS 947

Query:   665 EDTSVTNSRASSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDE 724
              D+S ++  ++S       +  D N  + +D+  +  SS+ +S+   S+ + D++   D 
Sbjct:   948 SDSSNSSDSSNSSDSSNSSDSSDSNSSDSSDSSNSSDSSD-SSDSSDSSDSSDSSNSSDS 1006

Query:   725 GSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIESCVK 784
                S +   D   S    N        + +      +  D      S  S N+   S   
Sbjct:  1007 SDSSDS--SDSSNSSDSSNSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSS--D 1062

Query:   785 YSEANVSAPAGDDIGMNLLASVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSFP 844
              S+++ S+ + D    +   S  + + S S   S       +       D +DS   S  
Sbjct:  1063 SSDSSDSSDSSDSSDSS--DSSESSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSS-D 1119

Query:   845 GDHSTDSTDDEHEKQGIDRNLWAKNSDSNQDKPAGGLTGHISTSPVDLQQSGDPCQEN-T 903
               +S+DS+D        D +  + +SDS++   +   +    +S  D   S D    + +
Sbjct:  1120 SSNSSDSSDSSDSSDSSDSSNSSDSSDSSESSDSSDSSDSSDSS--DSSDSSDSSDSSDS 1177

Query:   904 ENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSESTRGVET 963
              NS +   + ++ D +  +   D +    D+  + D     S    + D    S      
Sbjct:  1178 SNSSDSSDSSDSSDSSDSSDSSDSS----DSSDSSDSSDS-SDSSDSSDSSDSSDSSDSN 1232

Query:   964 EAVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSPITTHSESVKGKD-GELLHTSGS 1022
             E+ + S S+ S   D  N    S+  +S            + S+S  G + G    +   
Sbjct:  1233 ESSDSSDSSDSS--DSSNSSDSSDSSDSSDSTSDSNDESDSQSKSGNGNNNGSDSDSDSE 1290

Query:  1023 GED 1025
             G D
Sbjct:  1291 GSD 1293

 Score = 145 (56.1 bits), Expect = 8.1e-05, Sum P(2) = 8.1e-05
 Identities = 104/526 (19%), Positives = 184/526 (34%)

Query:   551 GDHAKTGGLS-GKEDARS-SATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSK 608
             GD   +   S G +DA S S   S +    S  S  S+ + NG  S        ++ S  
Sbjct:   462 GDDPNSSDESNGNDDANSESDNNSSSRGDASYNSDESKDNGNGSDSKGAEDDDSDSTSDT 521

Query:   609 N---ATLHRNSASDRPSQPSLTCEKALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGSVE 665
             N   +  + N+ +D   +      K+ D    + ++        +   S + +S+  S  
Sbjct:   522 NNSDSNGNGNNGNDDNDKSDSGKGKS-DSSDSDSSDSSNSSDSSDSSDSDSSDSNSSSDS 580

Query:   666 DTSVTNSRASSPVLPEKQNQFDRNFKEKNDAL-RADISSNMNSEPWQSNVNKDAAACPDE 724
             D+S ++S  SS    +  +    N  + +D+   +D S + +S   +S+ +K  +   D 
Sbjct:   581 DSSDSDSSDSS----DSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSKSDSSKSESDSSDS 636

Query:   725 GSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIES-CV 783
              S S +   D   S + DN          NS     +  D     SS  S N+   S   
Sbjct:   637 DSKSDS--SDSNSSDSSDNSDSSDSSNSSNSSDSS-DSSDSSDSSSSSDSSNSSDSSDSS 693

Query:   784 KYSEANVSAPAGDDIGMNLLASVAAGEMSKSDVVSPVGSPPRTPIHEP-LCDDNDSRVKS 842
               S ++ S+ + D    +   S  +   + SD  S   S      +     D +DS   S
Sbjct:   694 DSSNSSESSDSSDSSDSDSSDSSDSSNSNSSDSDSSNSSDSSDSSNSSDSSDSSDSSNSS 753

Query:   843 FPGD-----HSTDSTDDEHEKQGIDR-NLWAKNSDSNQDKPAGGLTGHISTSPVDLQQSG 896
                D     +S+DS+D        D  N    N  SN    +       S++  D   S 
Sbjct:   754 DSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSNDSSNSSDSSDSSNSSDSSNSSDSSDSS 813

Query:   897 DPCQENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSE 956
             D    ++ NS +   + ++ D +  +   D +     +DG+       S   ++ D    
Sbjct:   814 DSSDSDSSNSSDSSNSSDSSDSSNSSDSSDSSD---SSDGSDSDSSNRSDSSNSSDSSDS 870

Query:   957 STRGVETEAVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSPITTHSESVKGKDGEL 1016
             S     +++ + S SN+S      N    S+  NS        S  +  S+S    D   
Sbjct:   871 SDSSNSSDSSDSSDSNES-----SNSSDSSDSSNSSDSDSSDSSNSSDSSDSSNSSDSSE 925

Query:  1017 LHTSGSGEDMPLXXXXXXXXXXXXXXXSKSHVNQTEEQNSEWKSNA 1062
                S    +                  S    N ++  NS   S++
Sbjct:   926 SSNSSDNSNSSDSSNSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDS 971

 Score = 53 (23.7 bits), Expect = 1.8e-06, Sum P(2) = 1.8e-06
 Identities = 19/67 (28%), Positives = 30/67 (44%)

Query:   219 KPMNG--PTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKRERSSKMEDGNSG 276
             K M G  P S+ +   G+D A + +   S  +G      D+S +      S   ED +S 
Sbjct:   458 KSMQGDDPNSSDESN-GNDDANSESDNNSSSRGDASYNSDESKDNGNGSDSKGAEDDDSD 516

Query:   277 HSRTETN 283
              S ++TN
Sbjct:   517 -STSDTN 522

 Score = 50 (22.7 bits), Expect = 0.00012, Sum P(3) = 0.00012
 Identities = 17/72 (23%), Positives = 28/72 (38%)

Query:   213 SGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKRERSSKME- 271
             S   S    +   S S     SD + +S S  S       +  D S     +  SSK E 
Sbjct:   572 SDSNSSSDSDSSDSDSSDSSDSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSKSDSSKSES 631

Query:   272 DGNSGHSRTETN 283
             D +   S+++++
Sbjct:   632 DSSDSDSKSDSS 643

 Score = 47 (21.6 bits), Expect = 8.1e-05, Sum P(2) = 8.1e-05
 Identities = 30/142 (21%), Positives = 49/142 (34%)

Query:  1287 NVPDERVLEDLASRSSVQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELI 1346
             N  D     D +  S+  D+  +SD +N+ D S        S         D + + +  
Sbjct:  1002 NSSDSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSS 1061

Query:  1347 DIGNYSTSNGNKIDVPVQPGTSSGGLLNGEVNVRRDFDLNDGPVLDDCS-AEPSVFPQHP 1405
             D  + S S+ +  D      +S     +   N     D +D     D S +  S      
Sbjct:  1062 DSSDSSDSSDSS-DSSDSSDSSESSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDS 1120

Query:  1406 RNVSQAPVSGLRLSSADTVNFS 1427
              N S +  S     S+D+ N S
Sbjct:  1121 SNSSDSSDSSDSSDSSDSSNSS 1142

 Score = 44 (20.5 bits), Expect = 0.00012, Sum P(3) = 0.00012
 Identities = 11/61 (18%), Positives = 29/61 (47%)

Query:   222 NGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKRERSSKMEDGNSGHSRTE 281
             N  +S       SD ++++ +  S  KG + +  D +S+    + +    +GN  + +++
Sbjct:   483 NNSSSRGDASYNSDESKDNGN-GSDSKGAEDDDSDSTSDTNNSDSNGNGNNGNDDNDKSD 541

Query:   282 T 282
             +
Sbjct:   542 S 542

 Score = 43 (20.2 bits), Expect = 1.5e-05, Sum P(3) = 1.5e-05
 Identities = 17/80 (21%), Positives = 28/80 (35%)

Query:  1296 DLASRSSVQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELIDIGNYSTSN 1355
             D +  S   D+  +SD +++ D S        S         D + + +  D  + S S 
Sbjct:  1203 DSSDSSDSSDSSDSSDSSDSSDSSDSSDSNESSDSSDSSDSSDSSNSSDSSDSSDSSDST 1262

Query:  1356 GNKIDVPVQPGTSSGGLLNG 1375
              +  D       S  G  NG
Sbjct:  1263 SDSNDESDSQSKSGNGNNNG 1282

 Score = 42 (19.8 bits), Expect = 0.00026, Sum P(2) = 0.00026
 Identities = 24/102 (23%), Positives = 38/102 (37%)

Query:  1296 DLASRSSVQDTVTASDHTNNRDGSRCEVMGSK-SVRGSVGLDL-DLNRAEELIDIGNYST 1353
             D +  S+  D+  +SD +N+ D S      S  S   S   D  D + + +  D  N S 
Sbjct:   946 DSSDSSNSSDSSNSSDSSNSSDSSDSNSSDSSDSSNSSDSSDSSDSSDSSDSSDSSNSSD 1005

Query:  1354 SNGNKIDVPVQPGTSSGGLLNGEVNVRRDFDLNDGPVLDDCS 1395
             S+ +  D      +S     +   N     D +D     D S
Sbjct:  1006 SSDSS-DSSDSSNSSDSSNSSDSSNSSDSSDSSDSSDSSDSS 1046

 Score = 41 (19.5 bits), Expect = 2.4e-05, Sum P(3) = 2.4e-05
 Identities = 15/71 (21%), Positives = 26/71 (36%)

Query:  1287 NVPDERVLEDLASRSSVQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELI 1346
             N  D     D +  S   D+  +SD +++ D S        S         D N + +  
Sbjct:  1179 NSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSNESSDSS 1238

Query:  1347 DIGNYSTSNGN 1357
             D  + S S+ +
Sbjct:  1239 DSSDSSDSSNS 1249

 Score = 41 (19.5 bits), Expect = 0.00023, Sum P(3) = 0.00023
 Identities = 13/51 (25%), Positives = 22/51 (43%)

Query:   233 GSDGAQNSASFPSQVKGKKRERGDQSSEPVKRERSSKMEDGNSGHSRTETN 283
             G++G  ++    S  KGK       SS+      SS   D +S  S + ++
Sbjct:   530 GNNGNDDNDKSDSG-KGKSDSSDSDSSDSSNSSDSSDSSDSDSSDSNSSSD 579

 Score = 40 (19.1 bits), Expect = 0.00042, Sum P(2) = 0.00042
 Identities = 13/60 (21%), Positives = 25/60 (41%)

Query:  1296 DLASRSSVQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELIDIGNYSTSN 1355
             D ++ S   D+  +SD +N+ D S      + S         D + + +  D  + S S+
Sbjct:   999 DSSNSSDSSDSSDSSDSSNSSDSSNSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSNSS 1058

 Score = 38 (18.4 bits), Expect = 4.7e-05, Sum P(3) = 4.7e-05
 Identities = 13/60 (21%), Positives = 24/60 (40%)

Query:  1296 DLASRSSVQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELIDIGNYSTSN 1355
             D +  S   D+  +SD +N+ D S        S   +     D + + +  D  + S S+
Sbjct:  1104 DSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSSNSSDSSDSSESSDSSDSSDSSDSS 1163

 Score = 38 (18.4 bits), Expect = 4.7e-05, Sum P(3) = 4.7e-05
 Identities = 14/66 (21%), Positives = 26/66 (39%)

Query:  1290 DERVLEDLASRSSVQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELIDIG 1349
             D     D +  S+  D+  +SD +++ D S        S   +     D + + +  D  
Sbjct:  1080 DSSESSDSSDSSNSSDSSDSSDSSDSSDSSDSSDSSDSSDSSNSSDSSDSSDSSDSSDSS 1139

Query:  1350 NYSTSN 1355
             N S S+
Sbjct:  1140 NSSDSS 1145

 Score = 38 (18.4 bits), Expect = 0.00067, Sum P(2) = 0.00067
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query:  1296 DLASRSSVQDTVTASDHTNNRDGS 1319
             D +  S+  D   +SD +N+ D S
Sbjct:   922 DSSESSNSSDNSNSSDSSNSSDSS 945

 Score = 37 (18.1 bits), Expect = 5.9e-05, Sum P(3) = 5.9e-05
 Identities = 8/24 (33%), Positives = 13/24 (54%)

Query:  1296 DLASRSSVQDTVTASDHTNNRDGS 1319
             D +  S   D+  +SD +N+ D S
Sbjct:  1161 DSSDSSDSSDSSDSSDSSNSSDSS 1184


>WB|WBGene00019435 [details] [associations]
            symbol:K06A9.1 species:6239 "Caenorhabditis elegans"
            [GO:0016021 "integral to membrane" evidence=IEA]
            GeneTree:ENSGT00700000104174 EMBL:FO081100 UniGene:Cel.5442
            GeneID:180494 KEGG:cel:CELE_K06A9.1 CTD:180494 PIR:T34434
            RefSeq:NP_508292.1 ProteinModelPortal:Q8IFX6 PRIDE:Q8IFX6
            EnsemblMetazoa:K06A9.1b UCSC:K06A9.1b WormBase:K06A9.1b
            InParanoid:Q8IFX6 OMA:TITWATP NextBio:909622 ArrayExpress:Q8IFX6
            Uniprot:Q8IFX6
        Length = 2232

 Score = 136 (52.9 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 114/595 (19%), Positives = 200/595 (33%)

Query:   447 PHSGNRQTGASTEVA-IKSLVTQPASSKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPASG 505
             P S    TG++  ++   S++    S  T  V    G + +  A S+     S+ +P+S 
Sbjct:   537 PSSSPASTGSTITISGSSSIIVSTVSGST--VSGSTGTSQSTLASSTATPGSSSTVPSSS 594

Query:   506 STDAKDGQPR-NAASATGTTDLPSTPAKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKED 564
             S       P  N  S T +     +P+                    + +     +G   
Sbjct:   595 SPQPSSQSPAPNTGSTTPSQTSSQSPSPSMNPSSSTPTGSSQSTITPEGSTASSPTGSTG 654

Query:   565 ARSSATVSMTLNK-ISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPS- 622
             +  S    +T    +  GSS   +S N  PS  P+ +   T      T   + +S + S 
Sbjct:   655 STFSVATEVTSQSTVPSGSSLGTQSTNSSPS--PSSLSPSTSGMSTLTSEPSPSSTQSSG 712

Query:   623 -QPSLTCEK---ALDVPVVEGANPKIIVKIPNRGR---SPAQNSSGGSVE-DTSVTNSRA 674
              Q +LT      +     +E +         + G    SP+Q+SS GS +  TS   S  
Sbjct:   713 AQSTLTTPSPNPSQSTSSLESSTSGATTSSGSAGTTMTSPSQSSSVGSSQGSTSPAASTT 772

Query:   675 SSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGSPAVLPD 734
             S  +  +   Q   +    + A+      ++++    S V + +       SGS   +  
Sbjct:   773 SGEMTSQGSTQTPGSSVSTSAAILTSTQQSVSTNSPGSTVTRPSTVSGSTSSGSTVTVGS 832

Query:   735 EQGSKTGDNCRKVVEDLEDNSLP-PGYEFKDVKLHESSFSSMNALIESCVKYSEANVSAP 793
              + S +G +          +  P P        + +S + S +    S V+ S    + P
Sbjct:   833 TEASTSGSSVASSSPAPSTSQNPNPSTSSGSSMITQSPYPSQST---SPVESS----TTP 885

Query:   794 AGDDIGMNLLASVAAGEMSKSDVVSPVGS--PPRTPIHEPLCDDNDSRVKSFPGDHSTD- 850
             +    G  L  S +      + + S  GS  P  +   E +     ++     G   T  
Sbjct:   886 SPGSPGTTL-TSTSPSPSQSTTIGSTQGSTSPGISTTSEEMTSQGSTQTPGSTGSTVTQP 944

Query:   851 STDDEHEKQGIDRNLWAKN-------SDSNQDKPAGGLTGHISTSPVDLQQSGDPCQENT 903
             ST  +    G    + +         S S    P+      +ST      QS  P + +T
Sbjct:   945 STVSDSTSSGSTVTVGSTEGSSSPIPSTSQNTNPSTSSGSSMSTQTPQSSQSTSPVESST 1004

Query:   904 ENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSESTRGVET 963
               +     +  T      +P    +     + G+           STE   S  +   + 
Sbjct:  1005 SGATSSSGSPGTTL-TSISPSPSPSSTIGSSQGSTSPVVSTISQGSTETPGSTGSTVTKP 1063

Query:   964 EAVEGSASNQSLEFDGENK-KGVSEGLNSGVKREQKPSPITTHSESVKGKDGELL 1017
               V GSAS+ S    G  +    S G ++     Q  SP T+ + S  G  G  L
Sbjct:  1064 STVSGSASSGSTATMGSTEASSTSGGSSTSPNPSQSTSPSTSGATSSPGSSGTTL 1118

 Score = 72 (30.4 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 61/262 (23%), Positives = 89/262 (33%)

Query:  1296 DLASRSSVQ-DTVTASDHTNNRDGS-RCEVMGSKSVRGSVGLDLD-LNRAEELIDIGNYS 1352
             D+ S+ S Q    T S  T    GS      G  + +GS       L+ +  +      S
Sbjct:  1148 DMTSQGSTQIPGSTGSTVTQPSTGSGSTSTSGEITSQGSTQTPRSSLSTSPAISTSTQQS 1207

Query:  1353 TSNGNKIDVPVQPGTSSGGLLNGEVNVRRDFDLNDGPVLDDCSAEPSVFPQHP-RNVSQA 1411
              S  +      QP T  G   +G           +G      S+  S+    P  + SQ+
Sbjct:  1208 VSTNSPGSTVTQPSTVRGSTSSGSTVTTGS---TEGSSTSGSSSATSLSSSSPVPSTSQS 1264

Query:  1412 PVSGLRLSSADTVNFSSWFPRGNTYSTIAVPS-VLPDRGEQPFP-IIAPCAPQRMLVPST 1469
             P      SS  T N     P  +T   ++  +  +   G    P  I     Q   V  +
Sbjct:  1265 PNPSTSGSSTPTPN-----PSQSTSPVVSTTTGEMTSHGSTQTPSTIGSTVTQPSTVSGS 1319

Query:  1470 SGSPFGPDVFRGXXXXXXXXXXXXXXXXQYPVFPFGTSFPLPSATFSGGTT--TYVDSSS 1527
             + S  G  V  G                   + P  TS P+PS TF+  T+  T  D SS
Sbjct:  1320 NSS--GSTVTIGSSEASTSGSSFKTSPSS--ISPVPTSSPIPSTTFASSTSGSTISDVSS 1375

Query:  1528 GGRFCFPAVNSQLMGPAGAVPS 1549
                     ++S L  P+  VPS
Sbjct:  1376 VSTTSLAPLSSSL--PS-TVPS 1394

 Score = 56 (24.8 bits), Expect = 0.00070, Sum P(3) = 0.00070
 Identities = 40/140 (28%), Positives = 53/140 (37%)

Query:  1409 SQAPVSGLRL----SSADTVNFSSWFPRGNTYSTIAVPSVLPDRGEQPFPIIAPCAPQR- 1463
             S+A  SG       SS   V  SS  P   T+++    S + D        +AP +    
Sbjct:  1331 SEASTSGSSFKTSPSSISPVPTSSPIP-STTFASSTSGSTISDVSSVSTTSLAPLSSSLP 1389

Query:  1464 MLVPSTSGSPFGPDVFRGXXXXXXXXXXXXXXXXQYPVFPFGT--SFPLPSATFSGGT-- 1519
               VPS++ S F      G                  P  P G+  S  L S+T SG T  
Sbjct:  1390 STVPSSTQS-FS-STSEGSSKASSSPVPSQTSST--PTNPTGSTESSTLLSSTISGSTQH 1445

Query:  1520 TTYVDSSSGGRFCFPAVNSQ 1539
             TT   +SSG     P+ NSQ
Sbjct:  1446 TTMSKASSGSTS--PSTNSQ 1463

 Score = 47 (21.6 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 12/37 (32%), Positives = 17/37 (45%)

Query:   211 VPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQV 247
             VP    SP P + P  +S   P S G+  + S  S +
Sbjct:   522 VPGSSTSPAPSSSPNPSSS--PASTGSTITISGSSSI 556

 Score = 47 (21.6 bits), Expect = 1.9e-05, Sum P(3) = 1.9e-05
 Identities = 15/37 (40%), Positives = 17/37 (45%)

Query:   210 TVPSG-GRSPKPMNGPTSTSQLKPGSDGAQNSASFPS 245
             T PS  G S  P+    STS L   S  A +S S  S
Sbjct:   231 TSPSSLGTSSSPLPSSISTSALPIASSSASSSPSAAS 267

 Score = 43 (20.2 bits), Expect = 4.6e-05, Sum P(3) = 4.6e-05
 Identities = 8/19 (42%), Positives = 11/19 (57%)

Query:   209 ATVPSGGRSPKPMNGPTST 227
             +T P+   SP P + P ST
Sbjct:   526 STSPAPSSSPNPSSSPAST 544

 Score = 40 (19.1 bits), Expect = 9.2e-05, Sum P(3) = 9.2e-05
 Identities = 12/48 (25%), Positives = 18/48 (37%)

Query:   211 VPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQS 258
             + S   S  P    ++T  +   S    +S +FPS V       G  S
Sbjct:   254 IASSSASSSPSAASSTTPVVLSSSTIQSSSGTFPSSVASSPSTVGSTS 301


>TAIR|locus:504955937 [details] [associations]
            symbol:AT2G22795 "AT2G22795" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0008150
            "biological_process" evidence=ND] [GO:0005794 "Golgi apparatus"
            evidence=IDA] [GO:0005768 "endosome" evidence=IDA] [GO:0005802
            "trans-Golgi network" evidence=IDA] GO:GO:0005794 EMBL:CP002685
            GO:GO:0005768 GO:GO:0005802 IPI:IPI00534353 RefSeq:NP_850032.1
            UniGene:At.45724 UniGene:At.74141 PRIDE:F4IKG5
            EnsemblPlants:AT2G22795.1 GeneID:816810 KEGG:ath:AT2G22795
            OMA:ESTCRKT Uniprot:F4IKG5
        Length = 734

 Score = 141 (54.7 bits), Expect = 3.3e-05, Sum P(2) = 3.3e-05
 Identities = 109/556 (19%), Positives = 215/556 (38%)

Query:   657 QNSSGGSVEDTSVTNSRASSPVLPEKQNQFDRNFKEKNDALRADISS---NMNSEPWQSN 713
             +    G  E+   + +  S   + E+++       EK+    +++     N  +E  + +
Sbjct:   167 ERKDNGGTEENEKSGTEESE--VEERKDNGGTEENEKSGTEESEVEERKENGGTEENEKS 224

Query:   714 VNKDAAACPDEGSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLH---ES 770
              ++++     + +G      +E   K+G      VE+ +DN      E ++ K +   + 
Sbjct:   225 GSEESEVEEKKDNGGT----EESREKSGTE-ESEVEEKKDNGSSEESEVEEKKENRGIDE 279

Query:   771 SFSSMNALIESCVKYSEANVSAPAGDDIGMNLL--ASVAAGEMSKSDVVSPVGSPPRTPI 828
             S  S    I+      EA  +   GDD    ++  +     E   S+ V          +
Sbjct:   280 SEESKEKDIDEKANIEEARENNYKGDDASSEVVHESEEKTSESENSEKVEDKSGIKTEEV 339

Query:   829 HE-------PLCDDN-----DSRVKSFPGDHSTDSTDD-EHEKQGIDRN-LWAKN-SDSN 873
              +       P   DN     D +       H +DS +  + E + +++N L  K  +DSN
Sbjct:   340 EDSVIKSVLPNTTDNGESSSDEKSTGSSSGHESDSLEGIKSEGESMEKNELLEKEFNDSN 399

Query:   874 QD-----KPAG-GLTGHISTSPVDLQQSGDPCQENTENSKEIIVAEETPDGAGRNPEEDK 927
              +     K  G G  G   TS V  Q+     +  T++ +E    EE+ D      E+++
Sbjct:   400 GESSVTGKSTGSGDGGSQETSEVSSQEESKGKESETKDKEESSSQEESKDRETETKEKEE 459

Query:   928 AGFRVDA-DGAPDGKQRISGPLSTEDKVSESTRGVETEAVEGSASNQSLEFDGENKKGVS 986
             +  + +  D   + K+++    S+++K  +     ETE +E S   ++ E + E K+   
Sbjct:   460 SSSQEETMDKETEAKEKVES--SSQEKNEDK----ETEKIESSFLEETKEKEDETKE--K 511

Query:   987 EGLNSGVKREQKPSPITTHSESVKGKDGELLHTSGSGEDMPLXXXXXXXXXXXXXXXSKS 1046
             E  +S  K E+K +    + ES   ++ +        E +                 +K 
Sbjct:   512 EESSSQEKTEEKETETKDNEESSSQEETK----DKENEKIEKEEASSQEESKENETETKE 567

Query:  1047 HVNQTEEQNSEWKSNAPMIREDRVVPHLGSAENEEKGNGKVDHRENLEGKEVKEELCAGP 1106
                 + ++ ++ K N  + +E+   P     E +EK N K++  E+   +E KE+     
Sbjct:   568 KEESSSQEETKEKENEKIEKEESA-PQ---EETKEKENEKIEKEESASQEETKEKETETK 623

Query:  1107 ALPEVSTALRAQETGQLVRTGAVKLT-ISEGDKAQEXXXXXXXXXXXXVGVSDMEAKVEF 1165
                E S++  +QE    V T + K   + E +K  +              VSD E K   
Sbjct:   624 EKEE-SSSNESQEN---VNTESEKKEQVEENEKKTDEDTSESSKENS---VSDTEQKQSE 676

Query:  1166 DLNEGFDGDDGKYGES 1181
             + +E    +  K GE+
Sbjct:   677 ETSE--KEESNKNGET 690

 Score = 49 (22.3 bits), Expect = 3.3e-05, Sum P(2) = 3.3e-05
 Identities = 23/86 (26%), Positives = 40/86 (46%)

Query:   234 SDGAQNSASFPSQVKGKKRERGDQSSE-PVKRERSSKMEDGNSGHSRTETNLKTEIAKIT 292
             S+G  ++    S+++ KK   G + SE   KR+     E+     S TE   ++E+ +  
Sbjct:   115 SEGIVSNEDSNSEIEEKKDSGGVEESEVEEKRDNGGGTEENEK--SGTE---ESEVEERK 169

Query:   293 EKGGLVDYD--GVEKLVQLMVPERND 316
             + GG  + +  G E   +  V ER D
Sbjct:   170 DNGGTEENEKSGTE---ESEVEERKD 192


>DICTYBASE|DDB_G0271670 [details] [associations]
            symbol:DDB_G0271670 species:44689 "Dictyostelium
            discoideum" [GO:0005576 "extracellular region" evidence=IEA]
            dictyBase:DDB_G0271670 GO:GO:0005576 EMBL:AAFI02000006
            ProtClustDB:CLSZ2431310 RefSeq:XP_645495.1
            ProteinModelPortal:Q75JC9 EnsemblProtists:DDB0168484 GeneID:8618123
            KEGG:ddi:DDB_G0271670 OMA:EITNEEP Uniprot:Q75JC9
        Length = 374

 Score = 134 (52.2 bits), Expect = 3.5e-05, P = 3.5e-05
 Identities = 48/295 (16%), Positives = 115/295 (38%)

Query:   449 SGNRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPASGSTD 508
             S +  + +S+  +  S  +  +SS + +       + + S+ SS  S  S+   +S S+ 
Sbjct:    80 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 139

Query:   509 AKDGQPRNAASATGTTDLPSTPAKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKEDARSS 568
             +      +++S++ ++   S+ +                      + +   S    + SS
Sbjct:   140 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 199

Query:   569 ATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQPSLTC 628
             ++ S + +  S  SS S  S +   SS+ +     + SS +++   +S+S   S  S + 
Sbjct:   200 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 259

Query:   629 EKALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLPEKQNQFDR 688
               +        ++        +   S + +SS  S   +S ++S +SS       +    
Sbjct:   260 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSS 319

Query:   689 NFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGDN 743
             +    + +  +  SS+ +S    S+ +  +++     S S +       S +G+N
Sbjct:   320 SSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSSGEN 374


>FB|FBgn0036181 [details] [associations]
            symbol:Muc68Ca "Mucin 68Ca" species:7227 "Drosophila
            melanogaster" [GO:0005201 "extracellular matrix structural
            constituent" evidence=ISM] [GO:0031012 "extracellular matrix"
            evidence=ISM] EMBL:AE014296 eggNOG:NOG12793 GO:GO:0031012
            GO:GO:0005201 OrthoDB:EOG47SQVR GeneTree:ENSGT00700000104174
            RefSeq:NP_996054.1 UniGene:Dm.19505 STRING:Q7KUH2 PRIDE:Q7KUH2
            EnsemblMetazoa:FBtr0076140 GeneID:2768980 KEGG:dme:Dmel_CG18331
            UCSC:CG18331-RA CTD:2768980 FlyBase:FBgn0036181 InParanoid:Q7KUH2
            OMA:SDEGQTT GenomeRNAi:2768980 NextBio:848925 ArrayExpress:Q7KUH2
            Bgee:Q7KUH2 Uniprot:Q7KUH2
        Length = 3135

 Score = 131 (51.2 bits), Expect = 5.3e-05, Sum P(3) = 5.3e-05
 Identities = 92/425 (21%), Positives = 153/425 (36%)

Query:   449 SGNRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAPT---KSAFSSPVSVKSAPLPASG 505
             + + Q+ +ST     S   Q  SS    V++ QG +      S  SS  +  +    + G
Sbjct:  2179 NSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGSSSNGDGNSTQSSTTTTTTTTTSSDG 2238

Query:   506 STDAKDGQPRNAASATGTTDLPSTPAKDEKXXXXXXXXXXXXXCVGDHAKTG-GLSGKED 564
                     P    S     D  ST +                       +   G S   D
Sbjct:  2239 GESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGSSSNGD 2298

Query:   565 ARSSATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQP 624
               S+ + S T    +  SS   +S     SS P     +  +  N++   +S++    + 
Sbjct:  2299 GNSTQS-STTTTTTTTTSSDGGESTT---SSDPVVEVSQGTNGDNSSSQSSSSTTTTKEV 2354

Query:   625 SLTCEKALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLPEKQN 684
             SL   ++              ++IPN GR    NSS  S   T+VT+S +S P    K +
Sbjct:  2355 SLKDNRSPKWNRTTKTYSSRTIRIPNSGRK--LNSSS-SETSTTVTSSSSSKP--QTKYS 2409

Query:   685 QFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGS--PAVLPDEQGSKTGD 742
                 + K  N        +   ++  + N N  +    +E S S   A +   QG+   D
Sbjct:  2410 WSSSSKKSNNGGKNKKYWTKRWTKKSRKNNNGSSTIVGEESSDSLTDAGVDVTQGNGLND 2469

Query:   743 NCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIESCVKYSEANVSAPAGDDIGMNL 802
                   +    +SLP      DV+  ESS +S     E+  KYS  +   P  +  G + 
Sbjct:  2470 EGNSS-QSTVTSSLPVVDTSADVQNSESSLTST----ENTTKYSSKSFKVPKSN--GQS- 2521

Query:   803 LASVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSFPGDHSTDSTDDEHEK---Q 859
               S++A + +K+ V +   S P          ++   VK+     +T S+D        Q
Sbjct:  2522 --SISASKTTKT-VTTSTSSTPNVKSSSKKTSNSGKSVKTSSTTITTTSSDPGQSSSITQ 2578

Query:   860 GIDRN 864
             GI +N
Sbjct:  2579 GIPQN 2583

 Score = 73 (30.8 bits), Expect = 5.3e-05, Sum P(3) = 5.3e-05
 Identities = 23/79 (29%), Positives = 33/79 (41%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG+S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   714 TSSDGGQSTTSSDPVVEASQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVSEVAQGS 773

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S + DGNS  S T T   T
Sbjct:   774 SSIGDGNSTQSSTTTTTTT 792

 Score = 73 (30.8 bits), Expect = 5.3e-05, Sum P(3) = 5.3e-05
 Identities = 23/79 (29%), Positives = 33/79 (41%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG+S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   554 TSSDGGQSTTSSDPVVEASQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVSEVAQGS 613

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S + DGNS  S T T   T
Sbjct:   614 SSIGDGNSTQSSTTTTTTT 632

 Score = 70 (29.7 bits), Expect = 0.00010, Sum P(3) = 0.00010
 Identities = 23/79 (29%), Positives = 31/79 (39%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   954 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSDPVSEVAQGS 1013

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1014 SSTGDGNSTQSSTTTTTTT 1032

 Score = 68 (29.0 bits), Expect = 0.00016, Sum P(3) = 0.00016
 Identities = 23/79 (29%), Positives = 31/79 (39%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVK--RERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS PV    + S
Sbjct:  1354 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQATSSSAPVVDISQGS 1413

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T + T
Sbjct:  1414 SSNGDGNSTQSSTTTTITT 1432

 Score = 68 (29.0 bits), Expect = 0.00016, Sum P(3) = 0.00016
 Identities = 23/79 (29%), Positives = 32/79 (40%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG+S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   874 TSSDGGQSTTSSDPVVEVSQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 933

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:   934 SSNGDGNSTQSSTTTTTTT 952

 Score = 68 (29.0 bits), Expect = 0.00016, Sum P(3) = 0.00016
 Identities = 23/79 (29%), Positives = 32/79 (40%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG+S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   794 TSSDGGQSTTSSDPVVEASQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 853

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:   854 SSNGDGNSTQSSTTTTTTT 872

 Score = 68 (29.0 bits), Expect = 0.00016, Sum P(3) = 0.00016
 Identities = 23/79 (29%), Positives = 32/79 (40%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG+S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   634 TSSDGGQSTTSSDPVVEASQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 693

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:   694 SSNGDGNSTQSSTTTTTTT 712

 Score = 68 (29.0 bits), Expect = 0.00016, Sum P(3) = 0.00016
 Identities = 23/79 (29%), Positives = 32/79 (40%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG+S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   474 TSSDGGQSTTSSDPVVEVSQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 533

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:   534 SSNGDGNSTQSSTTTTTTT 552

 Score = 68 (29.0 bits), Expect = 0.00016, Sum P(3) = 0.00016
 Identities = 23/79 (29%), Positives = 32/79 (40%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG+S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   394 TSSDGGQSTTSSDPVVEVSQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 453

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:   454 SSNGDGNSTQSSTTTTTTT 472

 Score = 68 (29.0 bits), Expect = 0.00016, Sum P(3) = 0.00016
 Identities = 23/79 (29%), Positives = 32/79 (40%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG+S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   314 TSSDGGQSTTSSDPVVEVSQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 373

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:   374 SSNGDGNSTQSSTTTTTTT 392

 Score = 68 (29.0 bits), Expect = 0.00016, Sum P(3) = 0.00016
 Identities = 23/79 (29%), Positives = 32/79 (40%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG+S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   234 TSSDGGQSTTSSDPVVEVSQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 293

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:   294 SSNGDGNSTQSSTTTTTTT 312

 Score = 67 (28.6 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 23/79 (29%), Positives = 31/79 (39%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:  2074 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 2133

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  2134 SSNGDGNSTQSSTTTTTTT 2152

 Score = 67 (28.6 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 23/79 (29%), Positives = 31/79 (39%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:  1994 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 2053

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  2054 SSNGDGNSTQSSTTTTTTT 2072

 Score = 67 (28.6 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 23/79 (29%), Positives = 31/79 (39%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:  1914 TSSDGGESTTSSDPVVEVSQGTNGDNNSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 1973

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1974 SSNGDGNSTQSSTTTTTTT 1992

 Score = 67 (28.6 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 23/79 (29%), Positives = 31/79 (39%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:  1274 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 1333

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1334 SSNGDGNSTQSSTTTTTTT 1352

 Score = 67 (28.6 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 23/79 (29%), Positives = 31/79 (39%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:  1194 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 1253

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1254 SSNGDGNSTQSSTTTTTTT 1272

 Score = 67 (28.6 bits), Expect = 0.00021, Sum P(3) = 0.00021
 Identities = 23/79 (29%), Positives = 31/79 (39%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:  1034 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSDPVSEVAQGS 1093

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1094 SLNGDGNSTQSSTTTTTTT 1112

 Score = 65 (27.9 bits), Expect = 0.00032, Sum P(3) = 0.00032
 Identities = 23/79 (29%), Positives = 30/79 (37%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVK--RERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS PV    + S
Sbjct:  1514 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQATSSSAPVVDISQGS 1573

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1574 SSNGDGNSTQSSTTTTTTT 1592

 Score = 64 (27.6 bits), Expect = 0.00041, Sum P(3) = 0.00041
 Identities = 23/79 (29%), Positives = 30/79 (37%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS PV    + S
Sbjct:  1674 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSAPVVEVTQGS 1733

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1734 SSNGDGNSTQSSTTTTTTT 1752

 Score = 64 (27.6 bits), Expect = 0.00041, Sum P(3) = 0.00041
 Identities = 23/79 (29%), Positives = 30/79 (37%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS PV    + S
Sbjct:  1114 TSSDGGESTTSSDPVVEVSQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSAPVVEVTQGS 1173

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1174 SSNGDGNSTQSSTTTTTTT 1192

 Score = 63 (27.2 bits), Expect = 0.00051, Sum P(3) = 0.00051
 Identities = 23/79 (29%), Positives = 30/79 (37%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVK--RERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS PV    + S
Sbjct:  1834 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSAPVVDISQGS 1893

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1894 SSNGDGNSTQSSTTTTTTT 1912

 Score = 63 (27.2 bits), Expect = 0.00051, Sum P(3) = 0.00051
 Identities = 23/79 (29%), Positives = 30/79 (37%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVK--RERS 267
             T   GG S    +     SQ   G + +  S+S  +        +   SS PV    + S
Sbjct:  1754 TSSDGGESTTSSDPVVEVSQGTNGDNSSTQSSSSTTTTTSSDEGQTTSSSAPVVDISQGS 1813

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:  1814 SSNGDGNSTQSSTTTTTTT 1832

 Score = 61 (26.5 bits), Expect = 0.00080, Sum P(3) = 0.00080
 Identities = 22/79 (27%), Positives = 30/79 (37%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR--ERS 267
             T   GG      +     SQ   G + +  S+S  +        +   SS+PV    + S
Sbjct:   154 TSSDGGEFTTSSDPVVEVSQGTNGGNSSTQSSSSTTTTTSSDEGQTTSSSDPVVEVAQGS 213

Query:   268 SKMEDGNSGHSRTETNLKT 286
             S   DGNS  S T T   T
Sbjct:   214 SSNGDGNSTQSSTTTTTTT 232

 Score = 61 (26.5 bits), Expect = 0.00080, Sum P(3) = 0.00080
 Identities = 23/90 (25%), Positives = 39/90 (43%)

Query:   213 SGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKRERSSKMED 272
             S G      +  T+T+     SDG Q++ S    V+  +   GD SS    +  SS    
Sbjct:  1575 SNGDGNSTQSSTTTTTTTTTSSDGDQSTTSSDPVVEVSQGTNGDNSST---QSSSSTTTT 1631

Query:   273 GNSGHSRTETNLKTEIAKITEKGGLVDYDG 302
              +S   +T T+    + ++T+ G   + DG
Sbjct:  1632 TSSDEGQT-TSSSAPVVEVTQ-GSSSNGDG 1659

 Score = 61 (26.5 bits), Expect = 0.00080, Sum P(3) = 0.00080
 Identities = 23/98 (23%), Positives = 42/98 (42%)

Query:   205 IEMHATVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKR 264
             +E+     S G      +  T+T+     SDG +++ S    V+  +   GD SS    +
Sbjct:  1647 VEVTQGSSSNGDGNSTQSSTTTTTTTTTSSDGGESTTSSDPVVEVSQGTNGDNSST---Q 1703

Query:   265 ERSSKMEDGNSGHSRTETNLKTEIAKITEKGGLVDYDG 302
               SS     +S   +T T+    + ++T+ G   + DG
Sbjct:  1704 SSSSTTTTTSSDEGQT-TSSSAPVVEVTQ-GSSSNGDG 1739

 Score = 51 (23.0 bits), Expect = 5.3e-05, Sum P(3) = 5.3e-05
 Identities = 31/137 (22%), Positives = 57/137 (41%)

Query:  1235 KVELGWKGSAATSAFRPAEPRKILEMPLGATSISVPDSTSGKLGRPLLDIDLNVPDERVL 1294
             K    +K +  T+  + +   K  E     +S S   S   +L  P ++  ++V      
Sbjct:  2914 KASSSFKTTTTTTTSKTSSVPKT-ESKYSWSSSSKKTSNPIRLTLPNINAGISVGGG--- 2969

Query:  1295 EDLASRSSVQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELIDIGNYSTS 1354
             +   S S +    T+SD TN  DG       S S+ G+ G   D + ++     G+ S++
Sbjct:  2970 DSSGSWSKLIKRSTSSDETNASDGPSL----SGSIVGAGGSQSDESWSQRSGFSGDSSSA 3025

Query:  1355 NGNKIDVPVQPGTSSGG 1371
              G+  D+ ++ G    G
Sbjct:  3026 QGSP-DIRIRLGREQSG 3041

 Score = 39 (18.8 bits), Expect = 0.00080, Sum P(3) = 0.00080
 Identities = 27/128 (21%), Positives = 51/128 (39%)

Query:  1243 SAATSAFRPAEPRKILEMPLGATSISVPDSTSGKLGRPLLDIDLNVPDERVLEDLASRSS 1302
             +  T++   ++  KIL +P     IS+    SG      + I     +  V +  +S  +
Sbjct:  2864 TTTTTSTNGSKSSKILTVPKVDAGISIDGGISGSTSTKTIKITSK--NSAVPKASSSFKT 2921

Query:  1303 VQDTVTASDHTNNRDGSRCEVMGSKSVRGSVGLDLDLNRAEELIDIGNYSTSNG-NKIDV 1361
                T T+   +  +  S+     S S + S  + L L      I +G   +S   +K+  
Sbjct:  2922 TTTTTTSKTSSVPKTESKYS-WSSSSKKTSNPIRLTLPNINAGISVGGGDSSGSWSKL-- 2978

Query:  1362 PVQPGTSS 1369
              ++  TSS
Sbjct:  2979 -IKRSTSS 2985


>UNIPROTKB|Q5JRI8 [details] [associations]
            symbol:TCEA2 "Transcription elongation factor A protein 2"
            species:9606 "Homo sapiens" [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=IEA] [GO:0006351 "transcription,
            DNA-dependent" evidence=IEA] [GO:0045944 "positive regulation of
            transcription from RNA polymerase II promoter" evidence=IEA]
            InterPro:IPR017923 Pfam:PF08711 PROSITE:PS51319 GO:GO:0005634
            GO:GO:0003677 GO:GO:0006351 GO:GO:0003746 Gene3D:1.20.930.10
            SUPFAM:SSF47676 EMBL:AL590548 EMBL:AL355803 UniGene:Hs.505004
            UniGene:Hs.736326 HGNC:HGNC:11614 IPI:IPI00384330 SMR:Q5JRI8
            STRING:Q5JRI8 Ensembl:ENST00000415602 UCSC:uc010gku.2
            Uniprot:Q5JRI8
        Length = 82

 Score = 107 (42.7 bits), Expect = 0.00014, P = 0.00014
 Identities = 25/60 (41%), Positives = 37/60 (61%)

Query:   384 LLRALDKLPVNLNALQMCNIGKSVNHLRTHKN-VEIQKKARSLVDTWKKRVEAEMDARPR 442
             LLR L  +P+ L+ LQ   +G SVN LR   +  E+   A+SL+ +WKK ++A  DA+ R
Sbjct:     3 LLRELKAMPITLHLLQSTRVGMSVNALRKQSSDEEVIALAKSLIKSWKKLLDAS-DAKAR 61


>MGI|MGI:97309 [details] [associations]
            symbol:Nefh "neurofilament, heavy polypeptide" species:10090
            "Mus musculus" [GO:0000226 "microtubule cytoskeleton organization"
            evidence=IGI] [GO:0005739 "mitochondrion" evidence=IDA] [GO:0005882
            "intermediate filament" evidence=IEA] [GO:0005883 "neurofilament"
            evidence=IDA] [GO:0030424 "axon" evidence=IDA] [GO:0045104
            "intermediate filament cytoskeleton organization" evidence=IMP]
            [GO:0045110 "intermediate filament bundle assembly" evidence=IGI]
            [GO:0045735 "nutrient reservoir activity" evidence=IEA] [GO:0048936
            "peripheral nervous system neuron axonogenesis" evidence=IMP]
            [GO:0060052 "neurofilament cytoskeleton organization"
            evidence=IGI;IMP] InterPro:IPR001419 Pfam:PF03157 MGI:MGI:97309
            GO:GO:0045735 GO:GO:0005739 GO:GO:0000226 eggNOG:NOG12793
            GeneTree:ENSGT00690000102043 HOGENOM:HOG000230977
            HOVERGEN:HBG013015 GO:GO:0005883 GO:GO:0060052 InterPro:IPR016044
            InterPro:IPR018039 Pfam:PF00038 PROSITE:PS00226 EMBL:AL645522
            GO:GO:0045110 CTD:4744 KO:K04574 OrthoDB:EOG4NS3BW
            InterPro:IPR010790 Pfam:PF07142 EMBL:M35131 EMBL:M24496 EMBL:M23349
            EMBL:M24494 EMBL:M24495 EMBL:Z31012 EMBL:EF101556 IPI:IPI00114241
            PIR:JT0368 RefSeq:NP_035034.2 UniGene:Mm.298283 DisProt:DP00050
            ProteinModelPortal:P19246 SMR:P19246 IntAct:P19246 STRING:P19246
            PhosphoSite:P19246 UCD-2DPAGE:P19246 PaxDb:P19246 PRIDE:P19246
            DNASU:380684 Ensembl:ENSMUST00000093369 GeneID:380684
            KEGG:mmu:380684 UCSC:uc007hvm.1 InParanoid:P19246 OMA:EAKSPGE
            NextBio:401100 Bgee:P19246 CleanEx:MM_NEFH Genevestigator:P19246
            GermOnline:ENSMUSG00000020396 Uniprot:P19246
        Length = 1090

 Score = 145 (56.1 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 117/575 (20%), Positives = 199/575 (34%)

Query:   469 PASSKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAAS--ATGTTDL 526
             PA +K+       G+A +     SP   KS   P S +      +P++ A+  + G    
Sbjct:   536 PAEAKSPGEAKSPGEAKSPGEAKSPAEPKSPAEPKSPAEAKSPAEPKSPATVKSPGEAKS 595

Query:   527 PSTPAKDEKXXXXXXXXXXXXXCVGDHAKTGG-LSGKEDARSSATVSMTLNKISVGSSRS 585
             PS      +                  AK+        +A+S ATV       S G ++S
Sbjct:   596 PSEAKSPAEAKSPAEAKSPAEAKSPAEAKSPAEAKSPAEAKSPATVK------SPGEAKS 649

Query:   586 RKSVNG-YPSSTPAGVQRETVSSKNATLHRNSASDRPSQPSLTCEKALDVPV---VEGAN 641
                      + +PA  +    +   A +     +  P++P    E      V    E  +
Sbjct:   650 PSEAKSPAEAKSPAEAKSPAEAKSPAEVKSPGEAKSPAEPKSPAEAKSPAEVKSPAEAKS 709

Query:   642 PKII-----------VKIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLPEKQNQFDRNF 690
             P  +           VK P   +SPA   S G  +      S A +    E ++  +   
Sbjct:   710 PAEVKSPGEAKSPAAVKSPAEAKSPAAVKSPGEAKSPGEAKSPAEAKSPAEAKSPIEVKS 769

Query:   691 KEK-NDALRADISSNMNS---EPWQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGDNCRK 746
              EK    ++    S   +   E  +S V +D     +  S   A  P ++G+K  +  + 
Sbjct:   770 PEKAKTPVKEGAKSPAEAKSPEKAKSPVKEDIKPPAEAKSPEKAKSPVKEGAKPPEKAKP 829

Query:   747 VVEDLEDNSLPPGYEFK---DVKLHESSFSSMNALIESCVKYSEANVS---APAGDDIGM 800
             +     +   P   E K   D++  E   S      +S  K  EA  S   AP  +++  
Sbjct:   830 LDVKSPEAQTPVQEEAKHPTDIRPPEQVKSPAKEKAKSPEK-EEAKTSEKVAPKKEEVKS 888

Query:   801 NLLASVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSFPGDHSTDSTDDEHEKQG 860
              +   V A E  K          P+T   E      D   K  P     +  +   EK  
Sbjct:   889 PVKEEVKAKEPPKKVEEEKTLPTPKTEAKE---SKKDEAPKEAPKPKVEEKKETPTEKPK 945

Query:   861 IDRNLWAKNSDSNQDKPAGGLTGHISTSPVDLQQSGDPCQENTENSKEIIVAEETPDGAG 920
              D    AK  ++ + K A   +   + + + +++   P +E TE +K    AE+T     
Sbjct:   946 -DSTAEAKKEEAGEKKKAVA-SEEETPAKLGVKEEAKP-KEKTETTKT--EAEDTKAKEP 1000

Query:   921 RNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSESTRGVETEAVEGSASNQSLEFDGE 980
               P E +   + +   AP+ K       + E+K +ES +  E   +E            E
Sbjct:  1001 SKPTETEKPKKEEMPAAPEKKD------TKEEKTTESRKPEEKPKMEAKVKEDDKSLSKE 1054

Query:   981 NKKGVSEGLNSGVKREQK---PSPITTHSESVKGK 1012
               K  +E        +QK   P   TT  ++ KG+
Sbjct:  1055 PSKPKTEKAEKSSSTDQKESQPPEKTTEDKATKGE 1089

 Score = 43 (20.2 bits), Expect = 0.00014, Sum P(2) = 0.00014
 Identities = 14/44 (31%), Positives = 19/44 (43%)

Query:   482 GDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPRNAASATGTTD 525
             G   T+SA  S     S    +  S  A   + R AAS+T + D
Sbjct:    33 GPGGTRSAAGSSSGFHSWARTSVSSVSASPSRFRGAASSTDSLD 76

 Score = 42 (19.8 bits), Expect = 0.00017, Sum P(2) = 0.00017
 Identities = 18/56 (32%), Positives = 28/56 (50%)

Query:   416 VEIQKKARSLVDT--WKKRVEAEMDARPRLPEVPHSGNRQTGASTEV--AIKSLVT 467
             VE+QKKA++L +   + +R   E +    L ++   G  Q  A  E   A+K  VT
Sbjct:   212 VELQKKAQALQEECGYLRRHHQE-EVGELLGQIQGCGAAQAQAQAEARDALKCDVT 266


>DICTYBASE|DDB_G0289357 [details] [associations]
            symbol:DDB_G0289357 species:44689 "Dictyostelium
            discoideum" [GO:0016021 "integral to membrane" evidence=IEA]
            [GO:0016020 "membrane" evidence=IEA] [GO:0044351 "macropinocytosis"
            evidence=RCA] dictyBase:DDB_G0289357 GO:GO:0016021
            EMBL:AAFI02000139 RefSeq:XP_636279.1 EnsemblProtists:DDB0188381
            GeneID:8627097 KEGG:ddi:DDB_G0289357 OMA:KQSWSEL Uniprot:Q54HM2
        Length = 556

 Score = 103 (41.3 bits), Expect = 0.00018, Sum P(2) = 0.00018
 Identities = 27/96 (28%), Positives = 44/96 (45%)

Query:   185 DQDYINERQEEVDQLLYKTHIEMHATVPSGGRSPKPMNG----PTSTSQLKPGSDGAQNS 240
             D D   E +E     LY  H+E   T      S    N     PT+ S L P ++ + NS
Sbjct:    56 DDDEEEEEEESSWDTLYAKHVETSNTTQPFNNSNSNNNNFQTQPTNISTLNPNNNNSNNS 115

Query:   241 ASFPSQVKGKKRERGDQSSEPVKRERSSKMEDGNSG 276
             +S  S  +G +  RG +S+ P +  ++  ++  +SG
Sbjct:   116 SSGSSSSRGVRTPRGTRSNSPPQPSKNETVQKESSG 151

 Score = 78 (32.5 bits), Expect = 0.00018, Sum P(2) = 0.00018
 Identities = 76/424 (17%), Positives = 152/424 (35%)

Query:   606 SSKNATLHRNSASDRPSQPSL--TCEKALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGS 663
             SS+     R + S+ P QPS   T +K     + EG     ++  PN       N+   +
Sbjct:   121 SSRGVRTPRGTRSNSPPQPSKNETVQKESSGDISEGFT---LIDSPNDNNDNKNNNKNNN 177

Query:   664 VEDTSVTNSRASSPVLP--EKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAAC 721
               D+++ +        P   K+NQ  R  K+   +  +  S  ++ +   S +       
Sbjct:   178 -NDSNIVDDDEDEEEFPTLSKKNQ-KRKPKKSTSSPSSTSSPIVSPQTQTSKLESSMDVS 235

Query:   722 PDEGSGSPAVL----PDEQ-----GSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSF 772
             P  G  S + L     DE       +   +N  +  ++ E+      Y+     +  +S 
Sbjct:   236 PSSGKQSWSELLKNVADEDINNNNNNNNNNNSNQYHQEEENYYDSDDYDSSPFAIINNSS 295

Query:   773 SSMNALIESCVKYSEANVSAPAGDDIGMNLLASVA--AGEMSKSDVVSPVGSPPRT---P 827
             ++ N    +    +    +    +   + ++ S +   GE   SD+   +G  P+T   P
Sbjct:   296 TTTNNNNNNNNNTTTTTTTTTTTNSSSLPIVNSQSFEEGEEITSDI--KIGIKPKTVTVP 353

Query:   828 IHEPLC-DDNDSRVKSFPGDH--STDSTDDEHEKQGIDRNLWAKNSDSNQDKPAGGLTGH 884
                 L       ++K        S+ S  + +  + +D N +A   +  +   +      
Sbjct:   354 FQSTLSLRARTKQIKKVQQQQQQSSKSKPNNNNNKFVDNNPYAVLEEEERALQSAIKASL 413

Query:   885 ISTSPVDLQQSGDPCQENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRI 944
             +  SPVDL    D  Q+N    K+    E+ P     +    K+      D        +
Sbjct:   414 LLNSPVDL----DSKQQNVSQQKQ--QQEQQPTTTTNSVSSSKSKSVATTDKNRTTSTAV 467

Query:   945 SGPLSTEDKVSESTRGVETEAVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSPITT 1004
             +   S+  K ++S +   T     +A+  +     +NK   ++  +S V      +  T 
Sbjct:   468 APTTSSNKKANKSNK---TSTANTTATTTTTASSKKNKSNSNK--SSNVSNTTTTTSTTE 522

Query:  1005 HSES 1008
             +S S
Sbjct:   523 NSAS 526


>DICTYBASE|DDB_G0295719 [details] [associations]
            symbol:DDB_G0295719 "unknown" species:44689
            "Dictyostelium discoideum" [GO:0008150 "biological_process"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            dictyBase:DDB_G0295719 EMBL:AAFI02000070 KO:K10867
            RefSeq:XP_002649141.1 EnsemblProtists:DDB0252814 GeneID:8624735
            KEGG:ddi:DDB_G0295719 OMA:ILITSEF Uniprot:C7G031
        Length = 616

 Score = 130 (50.8 bits), Expect = 0.00022, P = 0.00022
 Identities = 106/578 (18%), Positives = 200/578 (34%)

Query:   451 NRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPA-SGSTDA 509
             N   G  + VA     T P+SS+   V   Q D    SA       +    PA S +  +
Sbjct:    22 NVSNGQDSSVAQSENPT-PSSSEQQVVPSSQEDQTASSAVPEQTDSQQQEAPAQSSAAPS 80

Query:   510 KDGQPRNAASATGTTDLP-STPAKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKEDARSS 568
              D Q     S    T      PA  E                 +  +    S   + ++ 
Sbjct:    81 TDSQQAPTESTEAPTQSSVEQPAPTESTETPAQSSAAEQQAPAESTEAPAQSSAAEQQAP 140

Query:   569 ATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQPSLTC 628
             A  +    + S    ++       P+ + A  Q+    S  A   ++SA+++ + P+ + 
Sbjct:   141 AESTEAPAQSSAAEQQAPAESTEAPAQSSAAEQQAPAESTEAPA-QSSAAEQQA-PAEST 198

Query:   629 EKALDVPVVEGANPKIIVKIPNRGRSPAQNS-SGGSVEDTSVTNSRA--SSPVLPEKQNQ 685
             E        E   P    + P +  +  Q++ S  +   +S    +A   S   P + + 
Sbjct:   199 EAPAQSSAAEQQAPAESNEAPAQSSAAEQSAESNEAPAQSSAAEQQAPAESNEAPAQSSA 258

Query:   686 FDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGDNCR 745
              ++   E N+A  A  SS +   P +S+   +A   P E +  P    + +  +T D+ +
Sbjct:   259 AEQP-AESNEA-PAQSSSTVEQPPAESSAAPEATEVPAESAEQPTESSNAEQQQT-DSQQ 315

Query:   746 KVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIESCVKYSEANVSAPAGDDIGMNLLAS 805
                   E+   P   + +     + +  S           ++++      +         
Sbjct:   316 PTQSSGEEQQQPTDSQQQPTDSQQQTTDSQQQQTSESSNPTQSSGEQQPTESSNAEQPTD 375

Query:   806 VAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSFPGDHSTDSTDDEHEKQGIDRNL 865
                     S       SP ++   +P   ++     + P + +  S+ +E ++Q  D   
Sbjct:   376 SQQPPAESSSAPEATESPEQSGEEQPT--ESQQETSASPTEENQTSSTEE-QQQPAD--- 429

Query:   866 WAKNSDSNQDKPAGGLTGHISTSPVDLQQSGDPCQENTENSKEIIVAEETPDGAGRNPEE 925
                 S ++ ++PA   +    TS    Q +  P  E++ +S     +  TP  +   P E
Sbjct:   430 ----STASVEQPAETSSSQ-QTSEAPAQSNEQPT-ESSASSNPTESSSATPTESSTAPTE 483

Query:   926 DKAGFRVDADGAPDGKQR-ISGPLSTEDKVSESTRGVETEAVEGSASNQSLEF-DGENKK 983
               A     +    +  +  +  P S ED  SE+    ETE  E S  ++  E  + E   
Sbjct:   484 SSATPAESSTATSESSEETVVKPASGEDG-SEAIESGETETSEESIESEETETSNSETSP 542

Query:   984 GVSEG-LNSGVKREQKPSPITTHSESVKGKDGELLHTS 1020
              +S+  +NS                S KG     LH+S
Sbjct:   543 AISKSPINSDPTLPNINEFSDDEKASQKGGKQNKLHSS 580


>SGD|S000001458 [details] [associations]
            symbol:FLO11 "GPI-anchored cell surface glycoprotein
            (flocculin)" species:4932 "Saccharomyces cerevisiae" [GO:0005576
            "extracellular region" evidence=IEA;IDA] [GO:0005935 "cellular bud
            neck" evidence=IDA] [GO:0016337 "cell-cell adhesion" evidence=IMP]
            [GO:0031225 "anchored to membrane" evidence=IEA] [GO:0016020
            "membrane" evidence=IEA] [GO:0005618 "cell wall" evidence=IEA]
            [GO:0001403 "invasive growth in response to glucose limitation"
            evidence=IMP] [GO:0007124 "pseudohyphal growth" evidence=IMP]
            [GO:0005886 "plasma membrane" evidence=IDA] [GO:0003674
            "molecular_function" evidence=ND] [GO:0030447 "filamentous growth"
            evidence=IDA] [GO:0000128 "flocculation" evidence=IMP] [GO:0042710
            "biofilm formation" evidence=IMP] SGD:S000001458 GO:GO:0005886
            GO:GO:0005618 GO:GO:0005576 GO:GO:0005935 GO:GO:0031225
            EMBL:BK006942 GO:GO:0016337 eggNOG:NOG12793 GO:GO:0001403
            EMBL:Z38061 GO:GO:0042710 KO:K01178 GO:GO:0007124 OrthoDB:EOG4BS0WG
            InterPro:IPR018789 Pfam:PF10182 GO:GO:0000128 KO:K14819
            RefSeq:NP_012284.3 GeneID:854836 KEGG:sce:YIR019C EMBL:M16164
            EMBL:M16165 EMBL:X13857 PIR:S48478 RefSeq:NP_012292.3
            ProteinModelPortal:P08640 IntAct:P08640 MINT:MINT-2782681
            STRING:P08640 PeptideAtlas:P08640 EnsemblFungi:YIR019C
            GeneID:854844 KEGG:sce:YIR026C CYGD:YIR019c
            GeneTree:ENSGT00700000104174 OMA:ADQFTYV NextBio:977713
            Genevestigator:P08640 GermOnline:YIR019C Uniprot:P08640
        Length = 1367

 Score = 143 (55.4 bits), Expect = 0.00023, Sum P(2) = 0.00023
 Identities = 113/590 (19%), Positives = 198/590 (33%)

Query:   436 EMDARPRLPEVPHSGNRQTGASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSSPVS 495
             E  + P       S +    +ST  +  + V  P+SS T +       + T+S+ S+PV+
Sbjct:   365 ESSSAPVTSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSSAPVTSSTTESS-SAPVT 423

Query:   496 ---VKSAPLPASGSTDAKDGQPRNAASATGTTDLP-STPAKDEKXXXXXXXXXXXXXCVG 551
                 +S+  P + ST      P  + S T ++  P  TP+                    
Sbjct:   424 SSTTESSSAPVTSSTTESSSAPVTS-STTESSSAPVPTPSSSTTESSSAPVTSSTTESSS 482

Query:   552 DHAKTGGLSGKEDARSSATVSMTLNK---ISVGSSRSRKSVNGYPSSTPAGVQRETVSSK 608
                 T   S  E + +  T S T +    +   SS + +S +  P+ TP+    E+ SS 
Sbjct:   483 APVPTPSSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSSA-PAPTPSSSTTES-SSA 540

Query:   609 NATLHRNSASDRP-SQPSLTCEKALDVPVVEGANPKIIVKIPNRGRSPAQNSSGGSVEDT 667
               T     +S  P   PS +  ++   PV           +P    S  ++SS      +
Sbjct:   541 PVTSSTTESSSAPVPTPSSSTTESSSTPVTSSTTESSSAPVPTPSSSTTESSSAPVPTPS 600

Query:   668 SVTNSRASSPVLPEKQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSG 727
             S T   +S+P  P   +    +      +   + SS     P  S     +A  P   S 
Sbjct:   601 SSTTESSSAPA-PTPSSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSSAPVPTPSSS 659

Query:   728 ----SPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIESCV 783
                 S A +P    S T  +   V     ++S  P          ESS + +     S  
Sbjct:   660 TTESSSAPVPTPSSSTTESSSAPVTSSTTESSSAPVTS----STTESSSAPVPTPSSSTT 715

Query:   784 KYSEANVSAPAGDDIGMNLLA----SVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSR 839
             + S A V  P+      +       S +  E S + V S        P+  P     +S 
Sbjct:   716 ESSSAPVPTPSSSTTESSSAPVPTPSSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESS 775

Query:   840 VKSFPGDHSTDSTDDEHEKQGIDRNLWAKNSDSNQDKPAGGLTGHISTSPVDLQQSGDPC 899
                 P   +  S+  E     +     +    S    P    + +I++S      S  P 
Sbjct:   776 SAPVP---TPSSSTTESSSAPVPTPSSSTTESSVAPVPTPSSSSNITSSA----PSSTPF 828

Query:   900 QENTENSKEIIV--AEETPDGAGRNPEEDKAGFRVDADGAPDGKQRISGPLSTEDKVSES 957
               +TE+S   +   +  T + +            V     P     I+    +    S +
Sbjct:   829 SSSTESSSVPVPTPSSSTTESSSAPVSSSTTESSVAPVPTPSSSSNITSSAPSSIPFSST 888

Query:   958 TRGVETEAVEGSASNQSLEFDG-ENKKGVSEGLNSGVKREQKPSPITTHS 1006
             T    T     + +  S ++ G + +  VS    + +   +  + +TT S
Sbjct:   889 TESFSTGT---TVTPSSSKYPGSQTETSVSSTTETTIVPTKTTTSVTTPS 935

 Score = 134 (52.2 bits), Expect = 0.00022, P = 0.00022
 Identities = 106/545 (19%), Positives = 175/545 (32%)

Query:   456 ASTEVAIKSLVTQPASSKTGAVKLCQGDAPTKSAFSSPVSVKSAPLPASGSTDAKDGQPR 515
             +ST  +  + V  P+SS T +       + T+S+ S+PV   S+    S S         
Sbjct:   319 SSTTESSSAPVPTPSSSTTESSSAPVTSSTTESS-SAPVPTPSSSTTESSSAPVTSSTTE 377

Query:   516 NAAS--ATGTTDLPSTPAKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKEDARSSATVSM 573
             ++++   + TT+  S P                     + +     S   ++ S+   S 
Sbjct:   378 SSSAPVTSSTTESSSAPVPTPSSSTTESSSAPVTSSTTESSSAPVTSSTTESSSAPVTSS 437

Query:   574 TLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRP-SQPSLTCEKAL 632
             T    S   + S    +  P  TP+    E+ SS   T     +S  P   PS +  ++ 
Sbjct:   438 TTESSSAPVTSSTTESSSAPVPTPSSSTTES-SSAPVTSSTTESSSAPVPTPSSSTTESS 496

Query:   633 DVPVVEGANPKIIVKIPNRGRS-------PAQNSSGGSVEDTS--VTNS--RASSPVLPE 681
               PV           +P    S       PA   S  + E +S  VT+S   +SS  +P 
Sbjct:   497 SAPVTSSTTESSSAPVPTPSSSTTESSSAPAPTPSSSTTESSSAPVTSSTTESSSAPVPT 556

Query:   682 KQNQFDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSG----SPAVLPDEQG 737
               +    +      +   + SS     P  S     +A  P   S     S A  P    
Sbjct:   557 PSSSTTESSSTPVTSSTTESSSAPVPTPSSSTTESSSAPVPTPSSSTTESSSAPAPTPSS 616

Query:   738 SKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSMNALIESCVKYSEANVSAPAGDD 797
             S T  +   V     ++S  P          ESS + +     S  + S A V  P+   
Sbjct:   617 STTESSSAPVTSSTTESSSAP-VPTPSSSTTESSSAPVPTPSSSTTESSSAPVPTPSSST 675

Query:   798 I-GMNLLASVAAGEMSKSDVVSPVGSPPRTPIHEPLCDDNDSRVKSFPGDHSTDSTDDEH 856
                 +   + +  E S + V S        P+  P     +S     P   +  S+  E 
Sbjct:   676 TESSSAPVTSSTTESSSAPVTSSTTESSSAPVPTPSSSTTESSSAPVP---TPSSSTTES 732

Query:   857 EKQGIDRNLWAKNSDSNQDKPAGGLTGHISTSPVDLQQSGDPCQENTENSKEIIV--AEE 914
                 +     + ++  +   P    T   S++PV       P    TE+S   +   +  
Sbjct:   733 SSAPVPTP--SSSTTESSSAPVTSSTTESSSAPVPT-----PSSSTTESSSAPVPTPSSS 785

Query:   915 TPDGAGR---NPEEDKAGFRVDADGAPDGKQRI--SGPLSTEDKVSESTRGVETEAVEGS 969
             T + +      P        V     P     I  S P ST    S  +  V       S
Sbjct:   786 TTESSSAPVPTPSSSTTESSVAPVPTPSSSSNITSSAPSSTPFSSSTESSSVPVPTPSSS 845

Query:   970 ASNQS 974
              +  S
Sbjct:   846 TTESS 850

 Score = 45 (20.9 bits), Expect = 0.00023, Sum P(2) = 0.00023
 Identities = 24/86 (27%), Positives = 33/86 (38%)

Query:   210 TVPSGGRSPKPMNGPTSTSQLKPGSDGAQNSASFPSQVKGKKRERGDQSSEPVKRERSSK 269
             T  S    P P +  T +S   P +     S+S P  V     E    SS PV    SS 
Sbjct:   349 TESSSAPVPTPSSSTTESSSA-PVTSSTTESSSAP--VTSSTTE---SSSAPVPTPSSST 402

Query:   270 MEDGNSG--HSRTETNLKTEIAKITE 293
              E  ++    S TE++     +  TE
Sbjct:   403 TESSSAPVTSSTTESSSAPVTSSTTE 428


>DICTYBASE|DDB_G0293012 [details] [associations]
            symbol:DDB_G0293012 "CHR group protein" species:44689
            "Dictyostelium discoideum" [GO:0005634 "nucleus" evidence=IEA]
            [GO:0005524 "ATP binding" evidence=IEA] [GO:0004386 "helicase
            activity" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0003676 "nucleic acid binding" evidence=IEA] [GO:0006357
            "regulation of transcription from RNA polymerase II promoter"
            evidence=ISS] [GO:0004003 "ATP-dependent DNA helicase activity"
            evidence=ISS] InterPro:IPR000330 InterPro:IPR000626
            InterPro:IPR000953 InterPro:IPR001650 Pfam:PF00176 Pfam:PF00271
            PROSITE:PS50013 PROSITE:PS51194 SMART:SM00213 SMART:SM00298
            SMART:SM00490 dictyBase:DDB_G0293012 Pfam:PF00385 GO:GO:0005524
            GO:GO:0005634 GO:GO:0003677 GO:GO:0006357 GO:GO:0004003
            InterPro:IPR016197 SUPFAM:SSF54160 eggNOG:COG0553
            InterPro:IPR014001 SMART:SM00487 PROSITE:PS51192 InterPro:IPR023780
            EMBL:AAFI02000199 InterPro:IPR022617 Pfam:PF11976
            RefSeq:XP_629261.1 ProteinModelPortal:Q54CF8
            EnsemblProtists:DDB0220643 GeneID:8628981 KEGG:ddi:DDB_G0293012
            InParanoid:Q54CF8 OMA:IMINEDI Uniprot:Q54CF8
        Length = 3071

 Score = 98 (39.6 bits), Expect = 0.00035, Sum P(3) = 0.00035
 Identities = 42/194 (21%), Positives = 87/194 (44%)

Query:   825 RTPIHEPLCDDNDSRVKSFPGDHSTDSTDDEHEKQGI-------DRNLWAKNSDSNQDKP 877
             R  I EP  +D ++++K    + ST+S    ++++ +       ++ + +   D N++K 
Sbjct:  1862 RWNIEEPKSNDKETQIKVEEKEKSTESKSSSNDEKDVCEKVEKGEKEISSDKVDENKEKV 1921

Query:   878 AGGLTGHISTSPVDLQQSGDPCQ-ENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADG 936
                   ++     D ++S D  + ++ E SK+    E++ D   ++ +++K+  + D   
Sbjct:  1922 ENNEKVNVDEKSKDNEKSNDSEKSKDNEKSKD---DEKSKDDE-KSKDDEKS--QDDEKS 1975

Query:   937 APDGKQRISGPLSTEDKVSEST-RGVETEAVEGSASN-QSLEFDGENKKGVSEGLNSGVK 994
               D K +       E+   EST   V+ E+   S+S  QS+E   E +       N    
Sbjct:  1976 KDDEKSKDDEKSKLEENQDESTPMEVDKESKSTSSSEKQSIEIKEEKESEKKPQDNDNAT 2035

Query:   995 REQKPSPITTHSES 1008
                  SP TT+++S
Sbjct:  2036 ENGVKSPSTTNTDS 2049

 Score = 89 (36.4 bits), Expect = 0.00035, Sum P(3) = 0.00035
 Identities = 41/169 (24%), Positives = 67/169 (39%)

Query:   569 ATVSMTLNKISVGSSRSRKSVNGYPSSTPAGVQRETVSSKNATLHRNSASDRPSQPSLTC 628
             AT  + L+ I + +++ +K  N  P ST       T +S ++ +   S  D PS  +   
Sbjct:  1348 ATKKLLLDHIVLSTNKDKKQTNTTPISTTTSSSSTTTASSSSNVDSPSPGDTPSNSNPFK 1407

Query:   629 EKALDVPVVEGAN-PKIIVKIPNRGRSP--AQNSSGGSVEDTSVTNSRASSPVLPEKQNQ 685
               A D P+VE  N P  I ++   G +   A+ +   + +D  + N      +      Q
Sbjct:  1408 VSAGDAPLVEDPNTPSQISQMLKYGAAYLFAETTDEANKQDKIMINEDIDKILERSTTIQ 1467

Query:   686 FDRNFKEKNDALRADISSNMNSEPWQSNVNKDAAACPDEGSGSPAVLPD 734
             FD     K  +L     +   S     NV+ +     DE      VLPD
Sbjct:  1468 FDEK-NSKPSSLSGFSKATFASSETDMNVDMN-----DENFWEK-VLPD 1509

 Score = 60 (26.2 bits), Expect = 0.00035, Sum P(3) = 0.00035
 Identities = 30/97 (30%), Positives = 46/97 (47%)

Query:   184 TDQDYINERQEEVDQLL--YKTHIEMHAT-VP-SGGRSPKPMNGPTSTSQLKPGSDGAQN 239
             T  + I    EE D+    Y+T I    + VP   G S K    PT+T   K  S+ + +
Sbjct:   382 TPLNKIESESEESDESSDEYETPITTSKSKVPIKKGGSKKAAASPTTTKLDKSKSNISSS 441

Query:   240 SASFPSQVKGKKRERG-DQSSEPVKRE---RSSKMED 272
             S+S  S    KK++ G  +S  P+K +   ++S  ED
Sbjct:   442 SSS--SNSNNKKKQLGVTKSRVPLKGKGKIKNSSSED 476


>UNIPROTKB|F1M074 [details] [associations]
            symbol:F1M074 "Uncharacterized protein" species:10116
            "Rattus norvegicus" [GO:0003677 "DNA binding" evidence=IEA]
            InterPro:IPR001025 Pfam:PF01426 PROSITE:PS51038 SMART:SM00439
            GO:GO:0003677 IPI:IPI00764269 Ensembl:ENSRNOT00000054977
            Uniprot:F1M074
        Length = 1123

 Score = 130 (50.8 bits), Expect = 0.00046, P = 0.00046
 Identities = 28/82 (34%), Positives = 43/82 (52%)

Query:    71 ISVGDCALF-KPPQDSPPFIGIIRSLTAGKENKLTLSVNWLYRPAEVKLGKGFLLEAAPN 129
             + +GDCA+F    + + P+IG I SL     + + + V W Y P E KLGK        N
Sbjct:   999 LRIGDCAVFLSAGRPNLPYIGRIESLWESWGSNMVVKVKWFYHPEETKLGKR--QSDGKN 1056

Query:   130 EIFYSFHKDEIPAASLLHPCKV 151
              ++ S H+DE    ++ H C+V
Sbjct:  1057 ALYQSCHEDENDVQTISHKCQV 1078


>UNIPROTKB|E9PTW3 [details] [associations]
            symbol:Dspp "Dentin sialophosphoprotein" species:10116
            "Rattus norvegicus" [GO:0071460 "cellular response to cell-matrix
            adhesion" evidence=IEA] RGD:2525 GeneTree:ENSGT00550000074777
            GO:GO:0071460 IPI:IPI00198253 Ensembl:ENSRNOT00000002946
            OMA:DESSIAV ArrayExpress:E9PTW3 Uniprot:E9PTW3
        Length = 952

 Score = 129 (50.5 bits), Expect = 0.00049, P = 0.00049
 Identities = 83/455 (18%), Positives = 166/455 (36%)

Query:   657 QNSSGGSVEDTSVTNSRASSPVLPEKQNQFDR-NFKEKNDALRADISSNMNSEPWQSNVN 715
             ++ S     DT  ++S        E +++ D  N    +D+     SS+ +S+   S+ +
Sbjct:   495 EDDSSDDTSDTDDSDSNGDDDSESEDKDESDNSNHDNDSDSESKSDSSDSDSDSSDSSDS 554

Query:   716 KDAAACPDEGSGSPAVLPDEQGSKTGDNCRKVVEDLEDNSLPPGYEFKDVKLHESSFSSM 775
              D++   D    S +       S + D+      D  D+S     +  D   ++SS SS 
Sbjct:   555 SDSSDSSDSSDSSESS-DSSDSSDSSDSSDS--SDSSDSS--DSSDSSDSDSNDSSDSSD 609

Query:   776 NALIESCVKYSEANVSAPAGD--DIGMNLLASVAAGEMSKSDVVSPVGSPPRTPIHEPL- 832
             ++        SE++ S+ + D  D   +  +S ++     SD      S   +   +   
Sbjct:   610 SSDSSDSSDSSESSDSSDSSDSSDSSESSESSESSDSSDSSDSSDSSDSSESSESSDSSD 669

Query:   833 ---CDDNDSRVKSFPGDHS--TDSTDDEHEKQGIDRN--LWAKNSDSNQDKPAGGLTGHI 885
                  D+DS   S   D S  +DS+D        D +    + +SDS+    +       
Sbjct:   670 SSDSSDSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSDSSDSDSSDSSDSSDSSD 729

Query:   886 STSPVDLQQSGDPCQENTENSKEIIVAEETPDGAGRNPEEDKAGFRVDADGAPDGKQRIS 945
             S+   D   S D    ++ +S +   + ++ D +  +   D +     +D +       S
Sbjct:   730 SSDSSDSSDSSDSSNSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDS 789

Query:   946 GPLSTEDKVSESTRGVET-EAVEGSASNQSLEFDGENKKGVSEGLNSGVKREQKPSPITT 1004
                S     S+S+   ++ ++ + S S+ S +    N    S+  +S    +   S  ++
Sbjct:   790 SDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSSDSNSSDSSDSSDSSDSSDSSDSSDSS 849

Query:  1005 HS-ESVKGKDGELLHTSGSGEDMPLXXXXXXXXXXXXXXXSKSHVNQTEEQNSEWKSNAP 1063
              S +S    D      S    D                  S S  + + + +    S++ 
Sbjct:   850 DSSDSSDSSDSSNSSDSSDSSDGDSSDGDSSDSDSSDSDSSNSSDSDSSDSSDSSSSDSS 909

Query:  1064 MIREDRVVPHLGSAENEEK-GNGKVDHRENLEGKE 1097
                 D       S++++ K GNG  D   + EG +
Sbjct:   910 DSDSDSKDSTSDSSDDDSKSGNGNSDSDSDSEGSD 944


>FB|FBgn0010422 [details] [associations]
            symbol:TfIIS "RNA polymerase II elongation factor"
            species:7227 "Drosophila melanogaster" [GO:0006355 "regulation of
            transcription, DNA-dependent" evidence=ISS] [GO:0005634 "nucleus"
            evidence=ISS;NAS] [GO:0006368 "transcription elongation from RNA
            polymerase II promoter" evidence=NAS] [GO:0008270 "zinc ion
            binding" evidence=IEA] [GO:0003677 "DNA binding" evidence=IEA]
            [GO:0006357 "regulation of transcription from RNA polymerase II
            promoter" evidence=IEA] [GO:0032784 "regulation of DNA-dependent
            transcription, elongation" evidence=IEA] InterPro:IPR001222
            InterPro:IPR003617 InterPro:IPR003618 InterPro:IPR006289
            InterPro:IPR016492 InterPro:IPR017890 InterPro:IPR017923
            Pfam:PF01096 Pfam:PF07500 Pfam:PF08711 PIRSF:PIRSF006704
            PROSITE:PS00466 PROSITE:PS51133 PROSITE:PS51319 PROSITE:PS51321
            SMART:SM00440 SMART:SM00509 SMART:SM00510 GO:GO:0005634
            EMBL:AE014134 GO:GO:0046872 GO:GO:0003677 GO:GO:0008270
            GO:GO:0006357 GO:GO:0006351 Gene3D:1.10.472.30 SUPFAM:SSF46942
            Gene3D:1.20.930.10 SUPFAM:SSF47676 GO:GO:0032784 eggNOG:COG1594
            KO:K03145 TIGRFAMs:TIGR01385 OMA:LEIWKKV
            GeneTree:ENSGT00390000017794 EMBL:X53670 EMBL:L26091 EMBL:AY051843
            PIR:S55899 RefSeq:NP_476967.1 UniGene:Dm.3045
            ProteinModelPortal:P20232 SMR:P20232 DIP:DIP-23586N
            MINT:MINT-924728 STRING:P20232 PaxDb:P20232 PRIDE:P20232
            EnsemblMetazoa:FBtr0080722 EnsemblMetazoa:FBtr0332475 GeneID:34883
            KEGG:dme:Dmel_CG3710 CTD:34883 FlyBase:FBgn0010422
            InParanoid:P20232 OrthoDB:EOG4GXD3M PhylomeDB:P20232
            GenomeRNAi:34883 NextBio:790684 Bgee:P20232 GermOnline:CG3710
            Uniprot:P20232
        Length = 313

 Score = 122 (48.0 bits), Expect = 0.00051, P = 0.00051
 Identities = 41/139 (29%), Positives = 66/139 (47%)

Query:   374 DKSVEEFLLILLRALDKLPVNLNALQMCNIGKSVNHLR-THKNVEIQKKARSLVDTWKKR 432
             D + ++  L LL+AL  L +NL+ L    IG +VN LR + K+ E+   A++L+  WK+ 
Sbjct:    20 DGTGQDQALDLLKALQTLNINLDILTKTRIGMTVNELRKSSKDDEVIALAKTLIKNWKRF 79

Query:   433 VEAEMDARPRLPEVPHSGNRQTGASTEV-AIKSL-VTQPASSKTGAVKLCQGDAPTKSAF 490
             +     A P  P  P++ + + G+S    A KS    + +SS +G  K     +      
Sbjct:    80 L-----ASPA-PTTPNNSSAKEGSSNNSSASKSTSAAKSSSSISGKDKSSSSSSSKDKEK 133

Query:   491 SSPVSVKSAPLPASGSTDA 509
                 S      P+ G TDA
Sbjct:   134 KGSTSSSQTSFPSGGMTDA 152


>POMBASE|SPBC215.13 [details] [associations]
            symbol:SPBC215.13 "sequence orphan" species:4896
            "Schizosaccharomyces pombe" [GO:0003674 "molecular_function"
            evidence=ND] [GO:0005783 "endoplasmic reticulum" evidence=IDA]
            [GO:0005886 "plasma membrane" evidence=IEA] [GO:0008150
            "biological_process" evidence=ND] [GO:0009986 "cell surface"
            evidence=NAS] [GO:0031225 "anchored to membrane" evidence=IEA]
            PomBase:SPBC215.13 GO:GO:0005783 GO:GO:0005886 GO:GO:0009986
            GO:GO:0031225 EMBL:CU329671 PIR:T39903 RefSeq:NP_596690.1
            ProteinModelPortal:O94317 EnsemblFungi:SPBC215.13.1 GeneID:2540669
            KEGG:spo:SPBC215.13 eggNOG:opiNOG43420 OMA:NPTDVTS NextBio:20801793
            Uniprot:O94317
        Length = 534

 Score = 124 (48.7 bits), Expect = 0.00078, P = 0.00078
 Identities = 76/367 (20%), Positives = 133/367 (36%)

Query:   471 SSKTGAVKLCQGDA-PTKSA----FS---SPVSVKSAPLPASGSTDAKDGQPRNAASATG 522
             +++TG      G A PT S+    FS   +P    S   P S  +D    Q  ++ S   
Sbjct:    63 AAETGGTPTYYGYATPTSSSEPSIFSESATPSETNSYSSPVSSYSDPATSQLPSSTSFFS 122

Query:   523 TTDLPSTPAKDEKXXXXXXXXXXXXXCVGDHAKTGGLSGKEDARSSATVSMTLNKISVGS 582
              T    TP+  E                   +    +S    + S    S T + +S  +
Sbjct:   123 PTSSEYTPSSTESSSLLDPSSVSSAILPSSTSVEVSISSSSLSSSDPLTSSTFSSLSSST 182

Query:   583 SRSRKSVNGYPSSTPAGVQRETVSSK---NATLHRNSASDRPSQPSLTCEKALD---VPV 636
             S S+ SV+   SST +     + SS    ++++  +S+S   S  S     +L    +P 
Sbjct:   183 SSSQPSVSSTSSSTFSSAAPTSTSSSYLSSSSVVSSSSSPSSSSSSTLTSSSLSTSSIPS 242

Query:   637 VEGANPKIIVKIPNRGRSPAQNSSGGSVEDTSVTNSRASSPVLPEKQNQFDRNFKEKNDA 696
                ++      + +   S   +SS  S    S ++S +SSP           +      +
Sbjct:   243 TSSSSSSTSSSLSSSSSSSTASSSSSSSSIISSSSSSSSSPTSTSSTISSSSSSSSSPTS 302

Query:   697 LRADISSNMNSEP-WQSNVNKDAAACPDEGSGSPAVLPDEQGSKTGDNCRKVVEDLEDNS 755
               + ISS+ +S   + S ++  + +     S SP        S +             +S
Sbjct:   303 TSSTISSSSSSSSSFSSTLSSSSMSSSSSFSSSPTSSSSTISSSSSSPSSSSFSSTTSSS 362

Query:   756 LPPGYEFKDVKLHESSFSSMNALIESCVKYSEANVSAPAGDDIGMNLLASVAAGEMSKSD 815
                    K      S+ SS ++   S +  S ++ S PA      + L+S  +   SKS 
Sbjct:   363 -------KSSSSFSSTVSSSSSTSSSTLTSSSSSSSRPASSSSHSSSLSSHKSSSSSKSS 415

Query:   816 VVSPVGS 822
               +PV S
Sbjct:   416 S-APVSS 421


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.310   0.129   0.377    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0     1646      1501   0.00098  124 3  11 23  0.37    35
                                                     39  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  24
  No. of states in DFA:  630 (67 KB)
  Total size of DFA:  607 KB (2269 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:00
  No. of threads or processors used:  24
  Search cpu time:  152.27u 0.12s 152.39t   Elapsed:  00:00:09
  Total cpu time:  152.29u 0.12s 152.41t   Elapsed:  00:00:09
  Start:  Sat May 11 01:26:18 2013   End:  Sat May 11 01:26:27 2013

Back to top