BLAST Results

Query Summary

Your job contains 1 sequence.

Parameters
Threshold: 0.001
Maximum number of alignments shown: 100
BLAST filter: on

Query Sequence

>psy10508
MLDSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIEN
YISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN
NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE
MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA
AFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK
SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP
ELKRPLTSITDEDKKDEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAI
QQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVK
EKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDADEVMSVEYCYQ
RNWHKQAVTSFIIGI

High Scoring Gene Products

Symbol, full name Information P value
Hpr1 protein from Drosophila melanogaster 7.1e-109
THOC1
Uncharacterized protein
protein from Canis lupus familiaris 1.4e-80
THOC1
Uncharacterized protein
protein from Bos taurus 1.4e-80
THOC1
Uncharacterized protein
protein from Gallus gallus 1.8e-80
THOC1
THO complex subunit 1
protein from Homo sapiens 4.7e-80
Thoc1
THO complex 1
protein from Mus musculus 6.0e-80
THOC1
Uncharacterized protein
protein from Sus scrofa 8.7e-79
thoc1
THO complex 1
gene_product from Danio rerio 3.0e-78
THOC1
Uncharacterized protein
protein from Canis lupus familiaris 4.7e-71
Thoc1
THO complex subunit 1
protein from Rattus norvegicus 3.2e-68
thoc-1 gene from Caenorhabditis elegans 1.1e-65
THO1
AT5G09860
protein from Arabidopsis thaliana 3.9e-60
thoc1
putative THO1 protein (nuclear matrix protein p84)
gene from Dictyostelium discoideum 2.0e-38
Thoc1
THO complex 1
gene from Rattus norvegicus 4.7e-20

The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.

Back to top

Raw Blast Data

BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]

Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.

Reference:  Gish, W. (1996-2006) http://blast.wustl.edu

Query=  psy10508
        (555 letters)

Database:  go_20130330-seqdb.fasta
           368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done

                                                                     Smallest
                                                                       Sum
                                                              High  Probability
Sequences producing High-scoring Segment Pairs:              Score  P(N)      N

FB|FBgn0037382 - symbol:Hpr1 "Hpr1" species:7227 "Drosoph...   648  7.1e-109  2
UNIPROTKB|E2RNV0 - symbol:THOC1 "Uncharacterized protein"...   809  1.4e-80   1
UNIPROTKB|F1MJV3 - symbol:THOC1 "Uncharacterized protein"...   809  1.4e-80   1
UNIPROTKB|F1NMW7 - symbol:THOC1 "Uncharacterized protein"...   808  1.8e-80   1
UNIPROTKB|Q96FV9 - symbol:THOC1 "THO complex subunit 1" s...   804  4.7e-80   1
UNIPROTKB|D4ABL0 - symbol:Thoc1 "THO complex subunit 1" s...   803  6.0e-80   1
MGI|MGI:1919668 - symbol:Thoc1 "THO complex 1" species:10...   803  6.0e-80   1
UNIPROTKB|I3LE05 - symbol:THOC1 "Uncharacterized protein"...   792  8.7e-79   1
ZFIN|ZDB-GENE-030826-9 - symbol:thoc1 "THO complex 1" spe...   787  3.0e-78   1
UNIPROTKB|J9NUJ3 - symbol:THOC1 "Uncharacterized protein"...   413  4.7e-71   3
UNIPROTKB|Q6TUH4 - symbol:Thoc1 "LRRGT00070" species:1011...   387  3.2e-68   3
WB|WBGene00020172 - symbol:thoc-1 species:6239 "Caenorhab...   380  1.1e-65   2
TAIR|locus:2178183 - symbol:THO1 "AT5G09860" species:3702...   616  3.9e-60   1
DICTYBASE|DDB_G0275717 - symbol:thoc1 "putative THO1 prot...   233  2.0e-38   3
RGD|1308657 - symbol:Thoc1 "THO complex 1" species:10116 ...   250  4.7e-20   1
POMBASE|SPCP25A2.03 - symbol:SPCP25A2.03 "THO complex sub...   262  2.2e-19   1


>FB|FBgn0037382 [details] [associations]
            symbol:Hpr1 "Hpr1" species:7227 "Drosophila melanogaster"
            [GO:0005654 "nucleoplasm" evidence=ISS] [GO:0007165 "signal
            transduction" evidence=IEA] [GO:0006406 "mRNA export from nucleus"
            evidence=NAS] [GO:0031990 "mRNA export from nucleus in response to
            heat stress" evidence=IMP] [GO:0000347 "THO complex" evidence=IDA]
            [GO:0043234 "protein complex" evidence=IPI] [GO:0005634 "nucleus"
            evidence=IDA] InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017
            EMBL:AE014297 GO:GO:0007165 Gene3D:1.10.533.10 InterPro:IPR011029
            SUPFAM:SSF47986 GO:GO:0031990 InterPro:IPR021861 Pfam:PF11957
            GO:GO:0000347 eggNOG:NOG275387 KO:K12878
            GeneTree:ENSGT00390000016232 EMBL:AY122188 EMBL:AJ556821
            RefSeq:NP_649594.1 UniGene:Dm.31292 SMR:Q9VNI8 DIP:DIP-48907N
            STRING:Q9VNI8 EnsemblMetazoa:FBtr0078667 GeneID:40723
            KEGG:dme:Dmel_CG2031 UCSC:CG2031-RA CTD:40723 FlyBase:FBgn0037382
            InParanoid:Q9VNI8 OMA:DNLQACK OrthoDB:EOG4T1G2K GenomeRNAi:40723
            NextBio:820267 Uniprot:Q9VNI8
        Length = 701

 Score = 648 (233.2 bits), Expect = 7.1e-109, Sum P(2) = 7.1e-109
 Identities = 120/252 (47%), Positives = 172/252 (68%)

Query:     4 SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
             + +L     ++ ++  +Y  +    ++DK+  ++ + R  L+K +   D D+  I   + 
Sbjct:    28 ALELAITDGKVELLVKEYNRFPANTEHDKRLPMDHAFRVLLMKRL---DEDVSRIGELVR 84

Query:    64 LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
             L VE    ++ ++T+P++LL DTFD+ TLDKC+++F +VE  V +WK++ FF SCKNN+L
Sbjct:    85 LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEDMVEVWKEEIFFSSCKNNIL 144

Query:   124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
             RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +   
Sbjct:   145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204

Query:   184 SSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
              S+              ID++ Y KFWSLQD+FRNP QCYNK  WKMF  +A+ +L +F 
Sbjct:   205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKPQWKMFQMHADNILQSFS 264

Query:   244 SYKLDDVQSSLN 255
             S+KL+DV+ S N
Sbjct:   265 SFKLEDVRQSSN 276

 Score = 448 (162.8 bits), Expect = 7.1e-109, Sum P(2) = 7.1e-109
 Identities = 106/264 (40%), Positives = 150/264 (56%)

Query:   252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSD 311
             SS+  +  +FAK+LTN KLL LQLSD NFRR VL+QFLILFQY   +VK + +   L +D
Sbjct:   299 SSVIKANHFFAKFLTNPKLLALQLSDANFRRAVLVQFLILFQYLQVSVKFKSDTQTLTAD 358

Query:   312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP----LT 367
             Q  ++K+T   VY L+++TPP G+ FS+ V  +L  EE WN WKNEGC E K+P    L+
Sbjct:   359 QADFIKETESRVYKLLEETPPYGKRFSRTVYHMLAREEMWNNWKNEGCKEFKKPEEPTLS 418

Query:   368 SIXXXXX---------------XXXXXXXXXXXXXLTKLWN-SKDNLEACKSAERDFTPS 411
                                                LT+LWN S DNL+ACKS +R+F P 
Sbjct:   419 EEDSKPTPNKRPRRPLGDALRDASRSGKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPL 478

Query:   412 LESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFI 471
             LE+Y E   +++DPA              + WRALRLL+R+ PHFF + +    K S+++
Sbjct:   479 LETYLETPHEKVDPA--------------FEWRALRLLARQTPHFFTSLSQPSSKISDYL 524

Query:   472 ENMVKRCVKEK---PSSQISGNGN 492
             E + KR +++K   P++ +S N +
Sbjct:   525 EQVRKRLIRDKEPKPAALLSNNSS 548


>UNIPROTKB|E2RNV0 [details] [associations]
            symbol:THOC1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0046784 "intronless viral mRNA export from
            host nucleus" evidence=IEA] [GO:0032784 "regulation of
            DNA-dependent transcription, elongation" evidence=IEA] [GO:0006915
            "apoptotic process" evidence=IEA] [GO:0005737 "cytoplasm"
            evidence=IEA] [GO:0000445 "THO complex part of transcription export
            complex" evidence=IEA] [GO:0007165 "signal transduction"
            evidence=IEA] InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017
            SMART:SM00005 GO:GO:0005737 GO:GO:0006915 GO:GO:0007165
            Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0046784
            GO:GO:0032784 GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957
            CTD:9984 KO:K12878 OMA:ILMGNEE GeneTree:ENSGT00390000016232
            EMBL:AAEX03005467 RefSeq:XP_547651.2 Ensembl:ENSCAFT00000029114
            GeneID:490529 KEGG:cfa:490529 Uniprot:E2RNV0
        Length = 657

 Score = 809 (289.8 bits), Expect = 1.4e-80, P = 1.4e-80
 Identities = 165/344 (47%), Positives = 210/344 (61%)

Query:   178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
             +E MDV                 ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E 
Sbjct:   207 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 266

Query:   238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
             VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct:   267 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 326

Query:   290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
             ILFQY    VK +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE
Sbjct:   327 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 386

Query:   350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
             +WN WKNEGCP   +  TS                                LT+LWN   
Sbjct:   387 NWNSWKNEGCPSFVKERTSDTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCP 446

Query:   396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
             DN+EACKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PH
Sbjct:   447 DNMEACKSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPH 506

Query:   456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
             FF       +   E++ENMV +  KE   PS +I     G D+D
Sbjct:   507 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 547

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 118/243 (48%), Positives = 150/243 (61%)

Query:    26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
             P  + +KK  L+Q+ R  L + I        V+   ISL +    + +C ++ P +LL D
Sbjct:    41 PGSENEKKCTLDQAFRGVLEEEIINHSSCENVLA-IISLAIGGVTEGICTASTPFVLLGD 99

Query:    86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
               D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct:   100 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159

Query:   146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
              LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV       
Sbjct:   160 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 219

Query:   191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
                       ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD 
Sbjct:   220 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 279

Query:   251 QSS 253
             Q+S
Sbjct:   280 QAS 282


>UNIPROTKB|F1MJV3 [details] [associations]
            symbol:THOC1 "Uncharacterized protein" species:9913 "Bos
            taurus" [GO:0046784 "intronless viral mRNA export from host
            nucleus" evidence=IEA] [GO:0032784 "regulation of DNA-dependent
            transcription, elongation" evidence=IEA] [GO:0006915 "apoptotic
            process" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0000445 "THO complex part of transcription export complex"
            evidence=IEA] [GO:0007165 "signal transduction" evidence=IEA]
            InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
            GO:GO:0005737 GO:GO:0006915 GO:GO:0007165 Gene3D:1.10.533.10
            InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0046784 GO:GO:0032784
            GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957 OMA:ILMGNEE
            GeneTree:ENSGT00390000016232 EMBL:DAAA02056653 IPI:IPI00699918
            Ensembl:ENSBTAT00000025584 Uniprot:F1MJV3
        Length = 660

 Score = 809 (289.8 bits), Expect = 1.4e-80, P = 1.4e-80
 Identities = 165/344 (47%), Positives = 210/344 (61%)

Query:   178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
             +E MDV                 ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E 
Sbjct:   210 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 269

Query:   238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
             VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct:   270 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 329

Query:   290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
             ILFQY    VK +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE
Sbjct:   330 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 389

Query:   350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
             +WN WKNEGCP   +  TS                                LT+LWN   
Sbjct:   390 NWNSWKNEGCPSFVKERTSDSKPTRAVRKRAAPEDFLGKGPSKKILMGNDELTRLWNLCP 449

Query:   396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
             DN+EACKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PH
Sbjct:   450 DNMEACKSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPH 509

Query:   456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
             FF       +   E++ENMV +  KE   PS +I     G D+D
Sbjct:   510 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 550

 Score = 566 (204.3 bits), Expect = 7.8e-55, P = 7.8e-55
 Identities = 118/243 (48%), Positives = 150/243 (61%)

Query:    26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
             P  + +KK  L+Q+ R  L + I        V+   ISL +    + +C ++ P +LL D
Sbjct:    44 PGSENEKKCTLDQAFRGVLEEEIINHSSCENVLA-IISLAIGGVTEGICTASTPFVLLGD 102

Query:    86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
               D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct:   103 VLDCLPLDQCDTIFTFVERNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 162

Query:   146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
              LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV       
Sbjct:   163 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 222

Query:   191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
                       ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD 
Sbjct:   223 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 282

Query:   251 QSS 253
             Q+S
Sbjct:   283 QAS 285


>UNIPROTKB|F1NMW7 [details] [associations]
            symbol:THOC1 "Uncharacterized protein" species:9031 "Gallus
            gallus" [GO:0007165 "signal transduction" evidence=IEA] [GO:0000445
            "THO complex part of transcription export complex" evidence=IEA]
            [GO:0005737 "cytoplasm" evidence=IEA] [GO:0006915 "apoptotic
            process" evidence=IEA] [GO:0032784 "regulation of DNA-dependent
            transcription, elongation" evidence=IEA] [GO:0046784 "intronless
            viral mRNA export from host nucleus" evidence=IEA]
            InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
            GO:GO:0005737 GO:GO:0006915 GO:GO:0007165 GO:GO:0006406
            Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0032784
            GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957 OMA:ILMGNEE
            GeneTree:ENSGT00390000016232 EMBL:AADN02021331 EMBL:AADN02077728
            IPI:IPI00585456 Ensembl:ENSGALT00000024055 Uniprot:F1NMW7
        Length = 547

 Score = 808 (289.5 bits), Expect = 1.8e-80, P = 1.8e-80
 Identities = 168/344 (48%), Positives = 210/344 (61%)

Query:   178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
             DE MDV                 ID+N Y+KFWSLQDYFRNPVQCY KVSWK F  Y+E 
Sbjct:    96 DEGMDVEEGEMGDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEE 155

Query:   238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
             VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct:   156 VLAVFKSYKLDDTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 215

Query:   290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
             ILFQY    VK +     L  +Q  W++DTT+ VY L+ + PPDGE FS++V+ IL  EE
Sbjct:   216 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEE 275

Query:   350 HWNQWKNEGCPEL--KRPLTSIXXXXXXXXXX-----------XXXXXXXXLTKLWN-SK 395
             +WN WKNEGCP    +RP  S                              LT+LWN   
Sbjct:   276 NWNSWKNEGCPSFVKERPPDSKPMRPARKRPAPEDFLGKGPNKKILMGNEELTRLWNLCP 335

Query:   396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
             DN+EACKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PH
Sbjct:   336 DNMEACKSESREYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPH 395

Query:   456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
             FF       +   E++ENMV +  KE   PS +I     G D+D
Sbjct:   396 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 436

 Score = 451 (163.8 bits), Expect = 1.2e-42, P = 1.2e-42
 Identities = 93/158 (58%), Positives = 106/158 (67%)

Query:   111 QQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVE 170
             Q  F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN+E
Sbjct:    14 QNMFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLE 73

Query:   171 NITEFG--------G-------DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDY 215
             N+T F         G       DE MDV                 ID+N Y+KFWSLQDY
Sbjct:    74 NVTVFNTNEHESTLGQKHSEERDEGMDVEEGEMGDDEAPTSCSIPIDYNLYRKFWSLQDY 133

Query:   216 FRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS 253
             FRNPVQCY KVSWK F  Y+E VLA FKSYKLDD Q+S
Sbjct:   134 FRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLDDTQAS 171


>UNIPROTKB|Q96FV9 [details] [associations]
            symbol:THOC1 "THO complex subunit 1" species:9606 "Homo
            sapiens" [GO:0007165 "signal transduction" evidence=IEA]
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
            [GO:0008380 "RNA splicing" evidence=IEA] [GO:0016363 "nuclear
            matrix" evidence=IEA] [GO:0016607 "nuclear speck" evidence=IEA]
            [GO:0000346 "transcription export complex" evidence=IDA]
            [GO:0000347 "THO complex" evidence=IDA] [GO:0000445 "THO complex
            part of transcription export complex" evidence=IDA] [GO:0005737
            "cytoplasm" evidence=IDA] [GO:0005634 "nucleus" evidence=IDA]
            [GO:0006915 "apoptotic process" evidence=IDA] [GO:0006406 "mRNA
            export from nucleus" evidence=IDA] [GO:0046784 "intronless viral
            mRNA export from host nucleus" evidence=IDA] [GO:0032784
            "regulation of DNA-dependent transcription, elongation"
            evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
            [GO:0006396 "RNA processing" evidence=TAS] [GO:0005730 "nucleolus"
            evidence=IDA] InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017
            SMART:SM00005 GO:GO:0005737 GO:GO:0006915 GO:GO:0007165
            GO:GO:0008380 GO:GO:0003677 GO:GO:0016607 GO:GO:0006397
            GO:GO:0006351 GO:GO:0003723 GO:GO:0006396 EMBL:CH471113
            Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0016363
            GO:GO:0046784 GO:GO:0032784 GO:GO:0000445 InterPro:IPR021861
            Pfam:PF11957 EMBL:L36529 EMBL:AY573302 EMBL:AY573303 EMBL:AK314755
            EMBL:AP000845 EMBL:BC010381 IPI:IPI00305374 IPI:IPI00646620
            IPI:IPI00944519 PIR:A53545 RefSeq:NP_005122.2 UniGene:Hs.712543
            PDB:1WXP PDBsum:1WXP ProteinModelPortal:Q96FV9 SMR:Q96FV9
            IntAct:Q96FV9 MINT:MINT-4536762 STRING:Q96FV9 PhosphoSite:Q96FV9
            DMDM:37999906 PaxDb:Q96FV9 PRIDE:Q96FV9 DNASU:9984
            Ensembl:ENST00000261600 GeneID:9984 KEGG:hsa:9984 UCSC:uc002kkj.4
            UCSC:uc002kkl.2 CTD:9984 GeneCards:GC18M000204 HGNC:HGNC:19070
            HPA:HPA019096 HPA:HPA019687 MIM:606930 neXtProt:NX_Q96FV9
            PharmGKB:PA134887435 eggNOG:NOG275387 HOGENOM:HOG000008123
            HOVERGEN:HBG060294 InParanoid:Q96FV9 KO:K12878 OMA:ILMGNEE
            OrthoDB:EOG4HX50P ChiTaRS:THOC1 EvolutionaryTrace:Q96FV9
            GenomeRNAi:9984 NextBio:37696 Bgee:Q96FV9 CleanEx:HS_THOC1
            Genevestigator:Q96FV9 GermOnline:ENSG00000079134 Uniprot:Q96FV9
        Length = 657

 Score = 804 (288.1 bits), Expect = 4.7e-80, P = 4.7e-80
 Identities = 165/344 (47%), Positives = 209/344 (60%)

Query:   178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
             +E MDV                 ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E 
Sbjct:   207 EEGMDVEEGEMGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 266

Query:   238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
             VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct:   267 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 326

Query:   290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
             ILFQY    VK +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE
Sbjct:   327 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 386

Query:   350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
             +WN WKNEGCP   +  TS                                LT+LWN   
Sbjct:   387 NWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCP 446

Query:   396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
             DN+EACKS  R+  P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PH
Sbjct:   447 DNMEACKSETREHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPH 506

Query:   456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
             FF       +   E++ENMV +  KE   PS +I     G D+D
Sbjct:   507 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 547

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 118/243 (48%), Positives = 150/243 (61%)

Query:    26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
             P  + +KK  L+Q+ R  L + I        V+   ISL +    + +C ++ P +LL D
Sbjct:    41 PGSENEKKCTLDQAFRGILEEEIINHSSCENVLA-IISLAIGGVTEGICTASTPFVLLGD 99

Query:    86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
               D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct:   100 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159

Query:   146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
              LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV       
Sbjct:   160 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 219

Query:   191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
                       ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD 
Sbjct:   220 EEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 279

Query:   251 QSS 253
             Q+S
Sbjct:   280 QAS 282


>UNIPROTKB|D4ABL0 [details] [associations]
            symbol:Thoc1 "THO complex subunit 1" species:10116 "Rattus
            norvegicus" [GO:0007165 "signal transduction" evidence=IEA]
            InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
            RGD:1308657 GO:GO:0007165 Gene3D:1.10.533.10 InterPro:IPR011029
            SUPFAM:SSF47986 InterPro:IPR021861 Pfam:PF11957 OrthoDB:EOG4HX50P
            IPI:IPI00560890 Ensembl:ENSRNOT00000021087 ArrayExpress:D4ABL0
            Uniprot:D4ABL0
        Length = 657

 Score = 803 (287.7 bits), Expect = 6.0e-80, P = 6.0e-80
 Identities = 164/344 (47%), Positives = 209/344 (60%)

Query:   178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
             +E MDV                 ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E 
Sbjct:   207 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 266

Query:   238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
             VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct:   267 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 326

Query:   290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
             ILFQY    VK +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE
Sbjct:   327 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 386

Query:   350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
             +WN WKNEGCP   +   S                                LT+LWN   
Sbjct:   387 NWNSWKNEGCPSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCP 446

Query:   396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
             DN+EACKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PH
Sbjct:   447 DNMEACKSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPH 506

Query:   456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
             FF       +   E++ENMV +  KE   PS +I     G D+D
Sbjct:   507 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 547

 Score = 570 (205.7 bits), Expect = 2.9e-55, P = 2.9e-55
 Identities = 119/243 (48%), Positives = 150/243 (61%)

Query:    26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
             P  + +KK  L+Q+ R  L + I        V+   ISL +    + +C ++ P +LL D
Sbjct:    41 PGSENEKKCTLDQAFRGVLEEEIINHSACENVLA-IISLAIGGVTESVCTASTPFVLLGD 99

Query:    86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
               D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct:   100 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159

Query:   146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
              LFLA+ FP SE+SGLN+ S+FN+ENIT F  +E+               MDV       
Sbjct:   160 QLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 219

Query:   191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
                       ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD 
Sbjct:   220 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 279

Query:   251 QSS 253
             Q+S
Sbjct:   280 QAS 282


>MGI|MGI:1919668 [details] [associations]
            symbol:Thoc1 "THO complex 1" species:10090 "Mus musculus"
            [GO:0000346 "transcription export complex" evidence=ISO]
            [GO:0000347 "THO complex" evidence=ISO] [GO:0000445 "THO complex
            part of transcription export complex" evidence=ISO] [GO:0003677
            "DNA binding" evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
            [GO:0005634 "nucleus" evidence=ISO] [GO:0005737 "cytoplasm"
            evidence=ISO] [GO:0006351 "transcription, DNA-dependent"
            evidence=IEA] [GO:0006355 "regulation of transcription,
            DNA-dependent" evidence=IEA] [GO:0006397 "mRNA processing"
            evidence=IEA] [GO:0006406 "mRNA export from nucleus" evidence=ISO]
            [GO:0006810 "transport" evidence=IEA] [GO:0006915 "apoptotic
            process" evidence=ISO;RCA] [GO:0007165 "signal transduction"
            evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA] [GO:0032784
            "regulation of DNA-dependent transcription, elongation"
            evidence=ISO] [GO:0042981 "regulation of apoptotic process"
            evidence=RCA] [GO:0046784 "intronless viral mRNA export from host
            nucleus" evidence=ISO] [GO:0051028 "mRNA transport" evidence=IEA]
            InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
            MGI:MGI:1919668 GO:GO:0005737 GO:GO:0006915 GO:GO:0007165
            GO:GO:0008380 GO:GO:0003677 GO:GO:0016607 GO:GO:0006397
            GO:GO:0006351 GO:GO:0003723 Gene3D:1.10.533.10 InterPro:IPR011029
            SUPFAM:SSF47986 GO:GO:0016363 GO:GO:0046784 GO:GO:0032784
            GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957 CTD:9984
            eggNOG:NOG275387 HOGENOM:HOG000008123 HOVERGEN:HBG060294 KO:K12878
            OMA:ILMGNEE OrthoDB:EOG4HX50P EMBL:AK031785 EMBL:AK032200
            EMBL:AK042867 EMBL:BC024951 IPI:IPI00153778 RefSeq:NP_705780.1
            UniGene:Mm.219648 ProteinModelPortal:Q8R3N6 SMR:Q8R3N6
            STRING:Q8R3N6 PhosphoSite:Q8R3N6 PaxDb:Q8R3N6 PRIDE:Q8R3N6
            Ensembl:ENSMUST00000025137 GeneID:225160 KEGG:mmu:225160
            UCSC:uc008eal.1 GeneTree:ENSGT00390000016232 InParanoid:Q8R3N6
            NextBio:377554 Bgee:Q8R3N6 CleanEx:MM_THOC1 Genevestigator:Q8R3N6
            GermOnline:ENSMUSG00000024287 Uniprot:Q8R3N6
        Length = 657

 Score = 803 (287.7 bits), Expect = 6.0e-80, P = 6.0e-80
 Identities = 164/344 (47%), Positives = 209/344 (60%)

Query:   178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
             +E MDV                 ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E 
Sbjct:   207 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 266

Query:   238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
             VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct:   267 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 326

Query:   290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
             ILFQY    VK +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE
Sbjct:   327 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 386

Query:   350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
             +WN WKNEGCP   +   S                                LT+LWN   
Sbjct:   387 NWNSWKNEGCPSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCP 446

Query:   396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
             DN+EACKS  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PH
Sbjct:   447 DNMEACKSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPH 506

Query:   456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
             FF       +   E++ENMV +  KE   PS +I     G D+D
Sbjct:   507 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 547

 Score = 569 (205.4 bits), Expect = 3.7e-55, P = 3.7e-55
 Identities = 118/243 (48%), Positives = 150/243 (61%)

Query:    26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
             P  + +KK  L+Q+ R  L + I        V+   ISL +    + +C ++ P +LL D
Sbjct:    41 PGSENEKKCTLDQAFRGVLEEEIINHSACENVLA-IISLAIGGVTESVCTASTPFVLLGD 99

Query:    86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
               D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct:   100 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159

Query:   146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
              LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV       
Sbjct:   160 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 219

Query:   191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
                       ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD 
Sbjct:   220 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 279

Query:   251 QSS 253
             Q+S
Sbjct:   280 QAS 282


>UNIPROTKB|I3LE05 [details] [associations]
            symbol:THOC1 "Uncharacterized protein" species:9823 "Sus
            scrofa" [GO:0046784 "intronless viral mRNA export from host
            nucleus" evidence=IEA] [GO:0032784 "regulation of DNA-dependent
            transcription, elongation" evidence=IEA] [GO:0006915 "apoptotic
            process" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
            [GO:0000445 "THO complex part of transcription export complex"
            evidence=IEA] [GO:0007165 "signal transduction" evidence=IEA]
            InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
            GO:GO:0005737 GO:GO:0006915 GO:GO:0007165 Gene3D:1.10.533.10
            InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0046784 GO:GO:0032784
            GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957 OMA:ILMGNEE
            GeneTree:ENSGT00390000016232 Ensembl:ENSSSCT00000029403
            Uniprot:I3LE05
        Length = 662

 Score = 792 (283.9 bits), Expect = 8.7e-79, P = 8.7e-79
 Identities = 160/330 (48%), Positives = 203/330 (61%)

Query:   178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
             +E MDV                 ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E 
Sbjct:   210 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 269

Query:   238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
             VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct:   270 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 329

Query:   290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
             ILFQY    VK +     L  +Q  W++DTT++VY L+ + PPDGE FS++V+ IL  EE
Sbjct:   330 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 389

Query:   350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
             +WN WKNEGCP   +  TS                                LT+LWN   
Sbjct:   390 NWNSWKNEGCPSFVKERTSDTKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCP 449

Query:   396 DNLEACKSAER--DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
             DN+EACKS  R  ++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ 
Sbjct:   450 DNMEACKSETRPREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRS 509

Query:   454 PHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
             PHFF       +   E++ENMV +  KE P
Sbjct:   510 PHFFQPTNQQFKSLPEYLENMVIKLAKELP 539

 Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
 Identities = 118/243 (48%), Positives = 150/243 (61%)

Query:    26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
             P  + +KK  L+Q+ R  L + I        V+   ISL +    + +C ++ P +LL D
Sbjct:    44 PGSENEKKCTLDQAFRGVLEEEIINHSSCENVLA-IISLAIGGVTEGICTASTPFVLLGD 102

Query:    86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
               D   LD+C+ +F +VE NV  WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct:   103 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 162

Query:   146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
              LFLA+ FP SE+SGLN+ S+FN+EN+T F  +E+               MDV       
Sbjct:   163 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 222

Query:   191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
                       ID+N Y+KFWSLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD 
Sbjct:   223 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 282

Query:   251 QSS 253
             Q+S
Sbjct:   283 QAS 285


>ZFIN|ZDB-GENE-030826-9 [details] [associations]
            symbol:thoc1 "THO complex 1" species:7955 "Danio
            rerio" [GO:0007165 "signal transduction" evidence=IEA]
            InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
            ZFIN:ZDB-GENE-030826-9 GO:GO:0007165 Gene3D:1.10.533.10
            InterPro:IPR011029 SUPFAM:SSF47986 InterPro:IPR021861 Pfam:PF11957
            CTD:9984 HOGENOM:HOG000008123 HOVERGEN:HBG060294 KO:K12878
            EMBL:BC054938 IPI:IPI00491037 RefSeq:NP_958481.1 UniGene:Dr.75966
            ProteinModelPortal:Q7SYB2 SMR:Q7SYB2 PRIDE:Q7SYB2 GeneID:373077
            KEGG:dre:373077 InParanoid:Q7SYB2 NextBio:20813350
            ArrayExpress:Q7SYB2 Bgee:Q7SYB2 Uniprot:Q7SYB2
        Length = 655

 Score = 787 (282.1 bits), Expect = 3.0e-78, P = 3.0e-78
 Identities = 159/329 (48%), Positives = 205/329 (62%)

Query:   178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
             +E MDV                 ID+N Y+KFW+LQDYFRNPVQCY+K SW  F  Y++ 
Sbjct:   207 EEGMDVEEGEMGDEDAPAPSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFIKYSDE 266

Query:   238 VLAAFKSYKLDDVQSSLNP-------SGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
              LA FKS+KLDD+Q+S          SGD  YFAK+LT++KL+DLQLSD+NFRR++LLQ+
Sbjct:   267 ALAVFKSFKLDDMQASKKKLEEMRTSSGDHVYFAKFLTSEKLMDLQLSDSNFRRHILLQY 326

Query:   289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
             LILFQY    VK +     L  DQ  W++DTT+ VY L+K+ PPDG+ F  +V+ IL  E
Sbjct:   327 LILFQYLKGQVKFKSSSCVLNDDQSLWIEDTTKLVYQLLKEIPPDGDKFGSMVEHILNTE 386

Query:   349 EHWNQWKNEGCPEL--KRPLTS--IXXXXXXXX---------XXXXXXXXXXLTKLWN-S 394
             E+WN WKNEGCP    +RP  +  I                           LT+LWN +
Sbjct:   387 ENWNSWKNEGCPSFVKERPAETKPIRPSRKRQAPEDFLGKGPDRKILMGNDELTRLWNLN 446

Query:   395 KDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCP 454
              DN+EACKS  R+F PSLE +FEEAI+Q DPA  VE++YK V +SNY WRALRLLSR+ P
Sbjct:   447 PDNMEACKSENREFMPSLEDFFEEAIEQADPANMVEDEYKVVRNSNYGWRALRLLSRRSP 506

Query:   455 HFFLNATPNVEKNSEFIENMVKRCVKEKP 483
             HFF       +  ++++ENMV +  KE P
Sbjct:   507 HFFQPTNQQFKSLADYLENMVIKLAKELP 535

 Score = 564 (203.6 bits), Expect = 1.3e-54, P = 1.3e-54
 Identities = 111/243 (45%), Positives = 156/243 (64%)

Query:    26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
             P  + +KK+ L+Q++R  L + I    +++    + I + ++   + +C++T P +LL D
Sbjct:    40 PGNETEKKATLDQALRGVLEEQIVNQKVNVDDFLSLIYISIDGVTEGICSATTPFLLLGD 99

Query:    86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
               D   LD+C+++F +VE NV+ WK  TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct:   100 VLDCLPLDQCDKIFSFVEENVSTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159

Query:   146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
              LFLA+ FP SE+SGLN+ S+FN++NIT F  +E+               MDV       
Sbjct:   160 QLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQDSTLGQQHTEVKEEGMDVEEGEMGD 219

Query:   191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
                       ID+N Y+KFW+LQDYFRNPVQCY+K SW  F  Y++  LA FKS+KLDD+
Sbjct:   220 EDAPAPSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFIKYSDEALAVFKSFKLDDM 279

Query:   251 QSS 253
             Q+S
Sbjct:   280 QAS 282


>UNIPROTKB|J9NUJ3 [details] [associations]
            symbol:THOC1 "Uncharacterized protein" species:9615 "Canis
            lupus familiaris" [GO:0007165 "signal transduction" evidence=IEA]
            InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
            GO:GO:0007165 Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986
            InterPro:IPR021861 Pfam:PF11957 GeneTree:ENSGT00390000016232
            EMBL:AAEX03005467 Ensembl:ENSCAFT00000045552 Uniprot:J9NUJ3
        Length = 499

 Score = 413 (150.4 bits), Expect = 4.7e-71, Sum P(3) = 4.7e-71
 Identities = 85/158 (53%), Positives = 105/158 (66%)

Query:   159 SGLNIISEFNVENITEFGGDEE--------MDVSSNXXXXXXXXXXXXXXIDFNFYKKFW 210
             S LN+ S+FN+EN+T F  +E+        MDV                 ID+N Y+KFW
Sbjct:    93 SSLNLQSQFNLENVTVFNTNEQHTEDREEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFW 152

Query:   211 SLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSGD--YFA 262
             SLQDYFRNPVQCY K+SWK F  Y+E VLA FKSYKLDD Q+S      L   G+  YFA
Sbjct:   153 SLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFA 212

Query:   263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
             K+LT++KL+DLQLSD+NFRR++LLQ+LILFQY    VK
Sbjct:   213 KFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 250

 Score = 228 (85.3 bits), Expect = 4.7e-71, Sum P(3) = 4.7e-71
 Identities = 46/94 (48%), Positives = 61/94 (64%)

Query:   406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
             R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHFF       +
Sbjct:   299 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 358

Query:   466 KNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
                E++ENMV +  KE   PS +I     G D+D
Sbjct:   359 SLPEYLENMVIKLAKELPPPSEEIK---TGEDED 389

 Score = 109 (43.4 bits), Expect = 4.7e-71, Sum P(3) = 4.7e-71
 Identities = 20/53 (37%), Positives = 30/53 (56%)

Query:    62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTF 114
             ISL +    + +C ++ P +LL D  D   LD+C+ +F +VE NV  WK   F
Sbjct:    31 ISLAIGGVTEGICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSNLF 83


>UNIPROTKB|Q6TUH4 [details] [associations]
            symbol:Thoc1 "LRRGT00070" species:10116 "Rattus norvegicus"
            [GO:0007165 "signal transduction" evidence=IEA] InterPro:IPR000488
            Pfam:PF00531 PROSITE:PS50017 SMART:SM00005 RGD:1308657
            GO:GO:0007165 Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986
            InterPro:IPR021861 Pfam:PF11957 CTD:9984 eggNOG:NOG275387
            HOGENOM:HOG000008123 HOVERGEN:HBG060294 KO:K12878
            GeneTree:ENSGT00390000016232 EMBL:AY387056 IPI:IPI00421329
            RefSeq:NP_001041315.1 UniGene:Rn.202648 SMR:Q6TUH4 STRING:Q6TUH4
            Ensembl:ENSRNOT00000046718 GeneID:291797 KEGG:rno:291797
            NextBio:633190 Genevestigator:Q6TUH4 Uniprot:Q6TUH4
        Length = 499

 Score = 387 (141.3 bits), Expect = 3.2e-68, Sum P(3) = 3.2e-68
 Identities = 77/148 (52%), Positives = 98/148 (66%)

Query:   161 LNIISEFNVENITEFGGDEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPV 220
             L +++E+ ++       +E MDV                 ID+N Y+KFWSLQDYFRNPV
Sbjct:   103 LGLVAEYGLDPQHTEDREEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPV 162

Query:   221 QCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLD 272
             QCY K+SWK F  Y+E VLA FKSYKLDD Q+S      L   G+  YFAK+LT++KL+D
Sbjct:   163 QCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMD 222

Query:   273 LQLSDTNFRRYVLLQFLILFQYFTSTVK 300
             LQLSD+NFRR++LLQ+LILFQY    VK
Sbjct:   223 LQLSDSNFRRHILLQYLILFQYLKGQVK 250

 Score = 231 (86.4 bits), Expect = 3.2e-68, Sum P(3) = 3.2e-68
 Identities = 48/103 (46%), Positives = 64/103 (62%)

Query:   397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
             NL    +  R++ P+LE +FEEAI+Q DP   VE +YK VN+SNY WRALRLL+R+ PHF
Sbjct:   290 NLTVYFTMTREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHF 349

Query:   457 FLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
             F       +   E++ENMV +  KE   PS +I     G D+D
Sbjct:   350 FQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 389

 Score = 105 (42.0 bits), Expect = 3.2e-68, Sum P(3) = 3.2e-68
 Identities = 19/49 (38%), Positives = 29/49 (59%)

Query:    62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWK 110
             ISL +    + +C ++ P +LL D  D   LD+C+ +F +VE NV  WK
Sbjct:    31 ISLAIGGVTESVCTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWK 79


>WB|WBGene00020172 [details] [associations]
            symbol:thoc-1 species:6239 "Caenorhabditis elegans"
            [GO:0000347 "THO complex" evidence=ISS] InterPro:IPR021861
            Pfam:PF11957 eggNOG:NOG275387 KO:K12878
            GeneTree:ENSGT00390000016232 EMBL:FO081698 RefSeq:NP_493796.2
            UniGene:Cel.14485 ProteinModelPortal:Q9N5E3 SMR:Q9N5E3
            STRING:Q9N5E3 PaxDb:Q9N5E3 EnsemblMetazoa:T02H6.2 GeneID:173460
            KEGG:cel:CELE_T02H6.2 UCSC:T02H6.2 CTD:173460 WormBase:T02H6.2
            InParanoid:Q9N5E3 OMA:VEENMNE NextBio:879757 Uniprot:Q9N5E3
        Length = 665

 Score = 380 (138.8 bits), Expect = 1.1e-65, Sum P(2) = 1.1e-65
 Identities = 89/232 (38%), Positives = 119/232 (51%)

Query:   257 SGD-YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT--STVKSRGEGLELKSDQE 313
             S D +F KYLT+ KLL LQL+D++FRRY L+Q +I+FQY T  S  K   + + L  DQ 
Sbjct:   285 SNDVFFTKYLTSPKLLALQLNDSSFRRYFLMQAIIIFQYLTAESRFKPPAKKMVLNEDQA 344

Query:   314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSIXXXX 373
             K+V +  +  Y L+  T P G  F   +K I+  E+ WN WKN  C +            
Sbjct:   345 KYVSECEDKCYRLLADTMPRGTAFVAGLKRIMLREQEWNTWKNANCADFSEKADKGAMQM 404

Query:   374 XXXXXX------XXXXXXXXLTKLW-NSKDNLEACKSAERDFTPSLESYFEEAIQQMDPA 426
                                 LTKLW N  D L+ACKS +R F P L  +  + I +MDP 
Sbjct:   405 YKKRQRIPFNPNSLDLGTPELTKLWTNEPDVLKACKSDKRKFIPKLPDFIRDPIDEMDPE 464

Query:   427 AAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNS---EFIE-NM 474
               VEEQYK++NDS + WRA RLL  K P +        +  +   EF+E NM
Sbjct:   465 QQVEEQYKQINDSAFQWRAARLLMHKSPGYVTKTDTKTDPTTNIKEFLERNM 516

 Score = 319 (117.4 bits), Expect = 1.1e-65, Sum P(2) = 1.1e-65
 Identities = 70/202 (34%), Positives = 104/202 (51%)

Query:    69 CMK-DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCN 127
             C K  +C+   P+  L D  +MS++++C+Q+F  VE N+N +KQ  F  + +NN+LR CN
Sbjct:    67 CSKLGLCSKNTPLSTLQDLLEMSSIEECKQIFSIVEENMNEFKQPGFIETAQNNILRFCN 126

Query:   128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNX 187
             DLLRRLSR+  T FCGRI+ FL++F P +E+SG+N +  FN  N+T +  + E D  +  
Sbjct:   127 DLLRRLSRTAETSFCGRIMFFLSRFLPLTEKSGVNFMGHFNTLNVTNYD-ESETDGEALL 185

Query:   188 XXXXXXXXXXXXXIDFN-----------------FYKKFWSLQDYFRNPVQCYNKVSWKM 230
                           D                    Y++FWSLQ +  NP   Y K  +  
Sbjct:   186 AATSSAPTPTEGAEDMETGEIEEDNSKEIQVTPEMYRQFWSLQKFMSNPNSIYEKEKFLT 245

Query:   231 FTSYAETVLAAFKSYKLDDVQS 252
             F +    VL    S KL+ + S
Sbjct:   246 FKTDLTAVLTLMTSNKLEKLSS 267

 Score = 39 (18.8 bits), Expect = 7.9e-26, Sum P(2) = 7.9e-26
 Identities = 14/49 (28%), Positives = 23/49 (46%)

Query:   393 NSKDNLEA-CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSN 440
             N K+ LE    +A ++F    ES      ++ +    VEE  K+  DS+
Sbjct:   507 NIKEFLERNMYNAAKNFNEFKESIENREKKEAEARKKVEESLKRKLDSS 555


>TAIR|locus:2178183 [details] [associations]
            symbol:THO1 "AT5G09860" species:3702 "Arabidopsis
            thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0010267
            "production of ta-siRNAs involved in RNA interference"
            evidence=IMP] [GO:0031047 "gene silencing by RNA" evidence=IMP]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0006406 "mRNA export from
            nucleus" evidence=IMP] [GO:0050832 "defense response to fungus"
            evidence=IMP] GO:GO:0005634 EMBL:CP002688 GO:GO:0050832
            GO:GO:0006406 GO:GO:0010267 InterPro:IPR021861 Pfam:PF11957
            KO:K12878 EMBL:AF424566 EMBL:AF428323 IPI:IPI00528228
            RefSeq:NP_568219.1 UniGene:At.27041 PRIDE:Q93VM9
            EnsemblPlants:AT5G09860.1 GeneID:830846 KEGG:ath:AT5G09860
            TAIR:At5g09860 InParanoid:Q93VM9 OMA:DMDPSAG PhylomeDB:Q93VM9
            ProtClustDB:CLSN2689570 Genevestigator:Q93VM9 Uniprot:Q93VM9
        Length = 599

 Score = 616 (221.9 bits), Expect = 3.9e-60, P = 3.9e-60
 Identities = 164/494 (33%), Positives = 246/494 (49%)

Query:    38 QSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDK 94
             + I QY  +LI   D D  +   I + + + + LC K+     +   LL D  +MST+  
Sbjct:    62 EQIMQYG-QLIDDDDDDDDIHGQIPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKN 120

Query:    95 CEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFP 154
             C+ +F Y+E   +I  +Q  F   K  +LR CN LLRRLS++ + VFCGRIL+FLA FFP
Sbjct:   121 CKDVFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFP 180

Query:   155 FSERSGLNIISEFNVENITEFGGDEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQD 214
              SERS +NI   FN  N T++  D    +S                +DFNFYK FWSLQ+
Sbjct:   181 LSERSAVNIKGVFNTSNETKYEKDPPKGIS----------------VDFNFYKTFWSLQE 224

Query:   215 YFRNPVQCYN-KVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQK 269
             YF NP    +    W+ F+S    VL  F +  L + +   N     +  +  KYLT+ K
Sbjct:   225 YFCNPASLTSASTKWQKFSSSLAVVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSK 284

Query:   270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQ 329
             L+ L+L D++FRR++LLQ LI+F Y  +  K+  + L  ++ +E+ +K   + V  L++ 
Sbjct:   285 LMGLELKDSSFRRHILLQCLIMFDYLRAPGKN-DKDLPSETMKEE-LKSCEDRVKKLLEI 342

Query:   330 TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSIXXXXXXXXXXXX-XXXXXX 387
             TPP G+ F + V+ IL+ E++W  WK +GCP   K+P+                      
Sbjct:   343 TPPKGKEFLRAVEHILEREKNWVWWKRDGCPPFEKQPIDKKSPNAGQKKRRQRWRLGNKE 402

Query:   388 LTKLWNSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRAL 446
             L++LW   D N  A   ++R  TP +  Y++   + MDP+A +E++Y   N+  Y W+ L
Sbjct:   403 LSQLWRWADQNPNALTDSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGL 462

Query:   447 RLLSRKCPHFFLNAT--------------PNVEKNSEFIEN-MVKRCVKEKP---SSQIS 488
             R  +R+    F   T              P V    +   N   KR  KE+    S +  
Sbjct:   463 RFTARQDLEGFSRFTEMGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETE 522

Query:   489 GNGNGVDQDPAEVE 502
             GN  GV    AE E
Sbjct:   523 GNQIGVSNSEAEAE 536


>DICTYBASE|DDB_G0275717 [details] [associations]
            symbol:thoc1 "putative THO1 protein (nuclear matrix
            protein p84)" species:44689 "Dictyostelium discoideum" [GO:0008150
            "biological_process" evidence=ND] [GO:0005575 "cellular_component"
            evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
            dictyBase:DDB_G0275717 GenomeReviews:CM000151_GR EMBL:AAFI02000013
            InterPro:IPR021861 Pfam:PF11957 eggNOG:NOG275387 KO:K12878
            RefSeq:XP_643594.2 STRING:Q552T7 EnsemblProtists:DDB0233560
            GeneID:8620181 KEGG:ddi:DDB_G0275717 OMA:SFSTHIN
            ProtClustDB:CLSZ2848579 Uniprot:Q552T7
        Length = 726

 Score = 233 (87.1 bits), Expect = 2.0e-38, Sum P(3) = 2.0e-38
 Identities = 76/267 (28%), Positives = 126/267 (47%)

Query:   224 NKVSWKMFTSYAETVLAAFKSY-KLDDVQ--SSLNPSGD-YFAKYLTNQKLLDLQLSDTN 279
             NK+ W+ F    E V+ +F ++  LD++   SS NPS   YF KYLT+  L+ LQL D+ 
Sbjct:   321 NKIKWESFIQSLELVIGSFSTHINLDELSQSSSNNPSKKHYFTKYLTSSNLMKLQLKDSI 380

Query:   280 FRRYVLLQFLILFQYFTSTVKSRGEGLELKSD-QEKWVKDTTETVYSLIKQTPPDGEHFS 338
             FR+ +L Q LI FQ    T +       + +D Q+  +++ T   + ++  T P+GE+FS
Sbjct:   381 FRKNILTQILITFQALDLTNQKYPT---IFNDLQKNIIQELTNKCFKILSNTNPNGEYFS 437

Query:   339 QVVKLILKGEEHWNQWKNEG-CPELKRP----LTSIXXXXXXXXXXXXXXXXXXLTKLWN 393
               +  ILK E++W  WK +  C   +RP    +                     L++LWN
Sbjct:   438 NCLSSILKREKNWIIWKRDNQCKPFERPPCSPIVKKKKLFRKTALTKISLGNQELSRLWN 497

Query:   394 SKDNLEACKSAERDFTPSLESYFE----EAIQQMDPAA--AVEEQYKKVNDSNY--AWRA 445
                        +   + SL+S+ E    E I+Q +     A++ + +K N+     A +A
Sbjct:   498 LSGAPNDRSYLKTQNSVSLDSFIEPLKKETIEQEEKTKQEALKLERRKKNEQKRSDAEKA 557

Query:   446 LRLLSRKCPHFFLNATPNVEKNSEFIE 472
              R    +    FL A P  +K+ ++ E
Sbjct:   558 RRKEYDEKKAEFLLANPT-KKDKDYQE 583

 Score = 215 (80.7 bits), Expect = 2.0e-38, Sum P(3) = 2.0e-38
 Identities = 63/186 (33%), Positives = 93/186 (50%)

Query:     6 QLNFDK-SQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISL 64
             +L F+K S L  +   Y   K     + K++++ +IR +   LIK  DI  + I+  I L
Sbjct:    66 KLLFNKDSLLKEIQKIYPNIKDPIQIEIKTSIDLNIRIFFNNLIKQIDITYENIDLAIKL 125

Query:    65 CVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK--NNL 122
                     + +S LP+ L  D F+  T+ KC  LF  +E    I+ Q    +  +  N L
Sbjct:   126 AYSFVELGILDSILPLQLSEDLFETKTISKCLDLFGLLESRAEIFSQDPEIIKGRKRNLL 185

Query:   123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--ENITEFGGDEE 180
             L++C +LL+R     N   CGRILLFLA  FP S+ SGLN   E N+  E   +F  D  
Sbjct:   186 LKICIELLKR---ETNPDSCGRILLFLAYVFPLSDPSGLNTKGEHNIHPEEALDFQNDIM 242

Query:   181 MDVSSN 186
              +V+ N
Sbjct:   243 NNVNGN 248

 Score = 85 (35.0 bits), Expect = 2.0e-38, Sum P(3) = 2.0e-38
 Identities = 24/103 (23%), Positives = 39/103 (37%)

Query:   162 NIISEFNVENITEFGGDEEMDVSSNXXXXXXXXXXXX---XXIDFNFYKKFWSLQDYFRN 218
             +I++  N  N       E+   +S                  +D NFY++FW LQ  F+N
Sbjct:   240 DIMNNVNGNNNNNVNNSEDTTTTSTAATATVITTTNGNNDTTVDRNFYRQFWGLQTVFQN 299

Query:   219 PVQCY----------------NKVSWKMFTSYAETVLAAFKSY 245
             P Q                  NK+ W+ F    E V+ +F ++
Sbjct:   300 PQQVLLNTTVTTGGTITNITLNKIKWESFIQSLELVIGSFSTH 342

 Score = 59 (25.8 bits), Expect = 3.4e-20, Sum P(3) = 3.4e-20
 Identities = 17/59 (28%), Positives = 31/59 (52%)

Query:   424 DPAAAVEEQYKKVNDSN-YAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE 481
             D  + ++E  + + D+  Y W+ LRL+SRK    F N  P   K  + I++ +K  + +
Sbjct:   594 DDLSDLDEPKELLRDNPVYIWKTLRLISRKRLELFKN--P---KFDDIIQSFIKPTITQ 647

 Score = 47 (21.6 bits), Expect = 2.4e-14, Sum P(2) = 2.4e-14
 Identities = 11/31 (35%), Positives = 16/31 (51%)

Query:   228 WKMFTSYAETVLAAFKSYKLDDV-QSSLNPS 257
             WK     +   L  FK+ K DD+ QS + P+
Sbjct:   614 WKTLRLISRKRLELFKNPKFDDIIQSFIKPT 644

 Score = 45 (20.9 bits), Expect = 9.2e-19, Sum P(3) = 9.2e-19
 Identities = 10/29 (34%), Positives = 15/29 (51%)

Query:   449 LSRKCPHFFLNATPNVEKNSEFIENMVKR 477
             L+ KC     N  PN E  S  + +++KR
Sbjct:   418 LTNKCFKILSNTNPNGEYFSNCLSSILKR 446


>RGD|1308657 [details] [associations]
            symbol:Thoc1 "THO complex 1" species:10116 "Rattus norvegicus"
            [GO:0003677 "DNA binding" evidence=IEA] [GO:0003723 "RNA binding"
            evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0006351
            "transcription, DNA-dependent" evidence=IEA] [GO:0006355
            "regulation of transcription, DNA-dependent" evidence=IEA]
            [GO:0006397 "mRNA processing" evidence=IEA] [GO:0006915 "apoptotic
            process" evidence=IEA] [GO:0007165 "signal transduction"
            evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA] [GO:0016363
            "nuclear matrix" evidence=IEA] [GO:0016607 "nuclear speck"
            evidence=IEA] [GO:0051028 "mRNA transport" evidence=IEA]
            InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
            RGD:1308657 GO:GO:0005737 GO:GO:0006915 GO:GO:0007165 GO:GO:0006355
            GO:GO:0008380 GO:GO:0003677 GO:GO:0016607 GO:GO:0006397
            GO:GO:0006351 GO:GO:0003723 Gene3D:1.10.533.10 InterPro:IPR011029
            SUPFAM:SSF47986 GO:GO:0016363 GO:GO:0051028 InterPro:IPR021861
            Pfam:PF11957 eggNOG:NOG79897 GeneTree:ENSGT00390000016232
            EMBL:AY325254 IPI:IPI00382382 UniGene:Rn.127881
            ProteinModelPortal:P59924 SMR:P59924 PRIDE:P59924
            Ensembl:ENSRNOT00000045976 UCSC:RGD:1308657 HOGENOM:HOG000202278
            HOVERGEN:HBG079252 InParanoid:P59924 Genevestigator:P59924
            GermOnline:ENSRNOG00000032739 InterPro:IPR013544 Pfam:PF08333
            Uniprot:P59924
        Length = 343

 Score = 250 (93.1 bits), Expect = 4.7e-20, P = 4.7e-20
 Identities = 51/113 (45%), Positives = 69/113 (61%)

Query:   388 LTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRAL 446
             LT+LWN   DN+EACK   R++ P LE +FEEAI+Q D    VE +YK +N+SNY W  L
Sbjct:   116 LTRLWNLCPDNMEACKLETREYMPILEEFFEEAIEQADAENMVESEYKAINNSNYGWSTL 175

Query:   447 RLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP--SSQISGNGNGVDQD 497
             R L+ + PHFF       +  +E++ENMV +  KE P  S +I     G D+D
Sbjct:   176 RFLAWRSPHFFQPTNQQFKNMTEYLENMVIKLAKELPPHSEEIK---TGEDED 225


>POMBASE|SPCP25A2.03 [details] [associations]
            symbol:SPCP25A2.03 "THO complex subunit (predicted)"
            species:4896 "Schizosaccharomyces pombe" [GO:0000347 "THO complex"
            evidence=ISS] [GO:0003674 "molecular_function" evidence=ND]
            [GO:0005634 "nucleus" evidence=IDA] [GO:0006368 "transcription
            elongation from RNA polymerase II promoter" evidence=ISS]
            [GO:0006406 "mRNA export from nucleus" evidence=IC]
            PomBase:SPCP25A2.03 EMBL:CU329672 GO:GO:0006368 GO:GO:0006406
            InterPro:IPR021861 Pfam:PF11957 GO:GO:0000347 eggNOG:NOG275387
            KO:K12878 PIR:T50450 RefSeq:NP_588092.1 STRING:Q9URT2
            EnsemblFungi:SPCP25A2.03.1 GeneID:2539396 KEGG:spo:SPCP25A2.03
            OMA:REANWIR OrthoDB:EOG4X3M8G NextBio:20800560 Uniprot:Q9URT2
        Length = 752

 Score = 262 (97.3 bits), Expect = 2.2e-19, P = 2.2e-19
 Identities = 94/333 (28%), Positives = 145/333 (43%)

Query:    28 CDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
             C Y+     E  + + L  L     +D+ VI N I+       +  C+  LP ++L +  
Sbjct:    60 CCYETARKSEIGLEERLKCLFAI--LDLLVIGNEIN-------ESFCDHLLPFLILEELM 110

Query:    88 DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
             D+ T+++C +L+ Y E   ++ K           LLR+ N+LLRRLSR +N+ FCGRI +
Sbjct:   111 DIHTVNECAKLYEYFETRPSLMKGIVSNRGRGPVLLRISNELLRRLSRQENSSFCGRIDI 170

Query:   148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNXXXXXXXXXXXXXXIDFNFYK 207
              L+K FP  ERSG N+  ++N   +  FG  E    S+                 F  Y 
Sbjct:   171 LLSKAFPPEERSGANLRGDYNT--VHSFGKVELSPPSTPISDRTDLSYHKKLNTLFTAY- 227

Query:   208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF-----------KSYKLDDVQSSLNP 256
               W LQ    NP +     +   F   A + + AF           KS    D  SS   
Sbjct:   228 --WDLQCMCSNPPKLLASDTLPKFIDAAGSAIQAFESILQNTFFNGKSNPTIDPNSSSLL 285

Query:   257 SGDYF-------AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELK 309
             S  Y        +KY+ ++ L + QLSD +FR   +LQ +I+F +     K R E   L 
Sbjct:   286 SEKYITLDKGFPSKYIYSRSLFEYQLSDEDFRLQAILQLIIIFDFLLDHSKERIERRTL- 344

Query:   310 SDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
                EKW   T + V  ++  +  D    +++ K
Sbjct:   345 ---EKW---TNKAVIPIVILSDEDTSKLNELSK 371

 Score = 144 (55.7 bits), Expect = 2.1e-06, P = 2.1e-06
 Identities = 56/216 (25%), Positives = 90/216 (41%)

Query:   262 AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWV----- 316
             +KY+ ++ L + QLSD +FR   +LQ +I+F +     K R E   L+    K V     
Sbjct:   298 SKYIYSRSLFEYQLSDEDFRLQAILQLIIIFDFLLDHSKERIERRTLEKWTNKAVIPIVI 357

Query:   317 ---KDTTET------VYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLT 367
                +DT++        YS +  T   G    + +K I+  E +W  WK  GCP L++PL 
Sbjct:   358 LSDEDTSKLNELSKEAYSFL-HTARCGS-VQRTIKEIIHIEGNWKLWKGLGCPSLEKPLV 415

Query:   368 S----------IXXXXXXXXXXXXXXXXXXLTKLWNS--KDNLEACKSAERDFTPSLESY 415
                        +                  L++LW    ++ L+  K  ER   PS ES+
Sbjct:   416 DKAAIDEAVEGLKKLTNTPVKLRFAMGNAALSRLWEQAGENTLDDLKKEERYRIPSPESF 475

Query:   416 FEEA-IQQMDPAAAVEEQYKKVNDSNYA---WRALR 447
                    + +   AV +  K  ++ + A   WRA R
Sbjct:   476 LSGVKADKFEIEEAVRDDDKHFHEQSLATKTWRAFR 511


Parameters:
  V=100
  filter=SEG
  E=0.001

  ctxfactor=1.00

  Query                        -----  As Used  -----    -----  Computed  ----
  Frame  MatID Matrix name     Lambda    K       H      Lambda    K       H
   +0      0   BLOSUM62        0.319   0.134   0.401    same    same    same
               Q=9,R=2         0.244   0.0300  0.180     n/a     n/a     n/a

  Query
  Frame  MatID  Length  Eff.Length     E     S W   T  X   E2     S2
   +0      0      555       504   0.00085  119 3  11 22  0.39    34
                                                     35  0.45    37


Statistics:

  Database:  /share/blast/go-seqdb.fasta
   Title:  go_20130330-seqdb.fasta
   Posted:  5:47:42 AM PDT Apr 1, 2013
   Created:  5:47:42 AM PDT Apr 1, 2013
   Format:  XDF-1
   # of letters in database:  169,044,731
   # of sequences in database:  368,745
   # of database sequences satisfying E:  16
  No. of states in DFA:  624 (66 KB)
  Total size of DFA:  337 KB (2168 KB)
  Time to generate neighborhood:  0.00u 0.00s 0.00t   Elapsed:  00:00:01
  No. of threads or processors used:  24
  Search cpu time:  41.62u 0.09s 41.71t   Elapsed:  00:00:09
  Total cpu time:  41.62u 0.09s 41.71t   Elapsed:  00:00:10
  Start:  Thu Aug 15 11:27:30 2013   End:  Thu Aug 15 11:27:40 2013

Back to top