Your job contains 1 sequence.
>psy10508
MLDSFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIEN
YISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKN
NLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE
MDVSSNETEETDTEEVDKVKIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLA
AFKSYKLDDVQSSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK
SRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCP
ELKRPLTSITDEDKKDEPDAKKKKTPELTKLWNSKDNLEACKSAERDFTPSLESYFEEAI
QQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVK
EKPSSQISGNGNGVDQDPAEVEVDTKSEEIQEEEKEEDWEAKADPEGDADEVMSVEYCYQ
RNWHKQAVTSFIIGI
The BLAST search returned 1 gene product which did not match your query constraints. Please see the full BLAST report below for the details.
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= psy10508
(555 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
FB|FBgn0037382 - symbol:Hpr1 "Hpr1" species:7227 "Drosoph... 648 7.1e-109 2
UNIPROTKB|E2RNV0 - symbol:THOC1 "Uncharacterized protein"... 809 1.4e-80 1
UNIPROTKB|F1MJV3 - symbol:THOC1 "Uncharacterized protein"... 809 1.4e-80 1
UNIPROTKB|F1NMW7 - symbol:THOC1 "Uncharacterized protein"... 808 1.8e-80 1
UNIPROTKB|Q96FV9 - symbol:THOC1 "THO complex subunit 1" s... 804 4.7e-80 1
UNIPROTKB|D4ABL0 - symbol:Thoc1 "THO complex subunit 1" s... 803 6.0e-80 1
MGI|MGI:1919668 - symbol:Thoc1 "THO complex 1" species:10... 803 6.0e-80 1
UNIPROTKB|I3LE05 - symbol:THOC1 "Uncharacterized protein"... 792 8.7e-79 1
ZFIN|ZDB-GENE-030826-9 - symbol:thoc1 "THO complex 1" spe... 787 3.0e-78 1
UNIPROTKB|J9NUJ3 - symbol:THOC1 "Uncharacterized protein"... 413 4.7e-71 3
UNIPROTKB|Q6TUH4 - symbol:Thoc1 "LRRGT00070" species:1011... 387 3.2e-68 3
WB|WBGene00020172 - symbol:thoc-1 species:6239 "Caenorhab... 380 1.1e-65 2
TAIR|locus:2178183 - symbol:THO1 "AT5G09860" species:3702... 616 3.9e-60 1
DICTYBASE|DDB_G0275717 - symbol:thoc1 "putative THO1 prot... 233 2.0e-38 3
RGD|1308657 - symbol:Thoc1 "THO complex 1" species:10116 ... 250 4.7e-20 1
POMBASE|SPCP25A2.03 - symbol:SPCP25A2.03 "THO complex sub... 262 2.2e-19 1
>FB|FBgn0037382 [details] [associations]
symbol:Hpr1 "Hpr1" species:7227 "Drosophila melanogaster"
[GO:0005654 "nucleoplasm" evidence=ISS] [GO:0007165 "signal
transduction" evidence=IEA] [GO:0006406 "mRNA export from nucleus"
evidence=NAS] [GO:0031990 "mRNA export from nucleus in response to
heat stress" evidence=IMP] [GO:0000347 "THO complex" evidence=IDA]
[GO:0043234 "protein complex" evidence=IPI] [GO:0005634 "nucleus"
evidence=IDA] InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017
EMBL:AE014297 GO:GO:0007165 Gene3D:1.10.533.10 InterPro:IPR011029
SUPFAM:SSF47986 GO:GO:0031990 InterPro:IPR021861 Pfam:PF11957
GO:GO:0000347 eggNOG:NOG275387 KO:K12878
GeneTree:ENSGT00390000016232 EMBL:AY122188 EMBL:AJ556821
RefSeq:NP_649594.1 UniGene:Dm.31292 SMR:Q9VNI8 DIP:DIP-48907N
STRING:Q9VNI8 EnsemblMetazoa:FBtr0078667 GeneID:40723
KEGG:dme:Dmel_CG2031 UCSC:CG2031-RA CTD:40723 FlyBase:FBgn0037382
InParanoid:Q9VNI8 OMA:DNLQACK OrthoDB:EOG4T1G2K GenomeRNAi:40723
NextBio:820267 Uniprot:Q9VNI8
Length = 701
Score = 648 (233.2 bits), Expect = 7.1e-109, Sum P(2) = 7.1e-109
Identities = 120/252 (47%), Positives = 172/252 (68%)
Query: 4 SFQLNFDKSQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYIS 63
+ +L ++ ++ +Y + ++DK+ ++ + R L+K + D D+ I +
Sbjct: 28 ALELAITDGKVELLVKEYNRFPANTEHDKRLPMDHAFRVLLMKRL---DEDVSRIGELVR 84
Query: 64 LCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLL 123
L VE ++ ++T+P++LL DTFD+ TLDKC+++F +VE V +WK++ FF SCKNN+L
Sbjct: 85 LSVEATRAEIVSNTIPVVLLIDTFDVVTLDKCQKIFQFVEDMVEVWKEEIFFSSCKNNIL 144
Query: 124 RMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDV 183
RMCNDLLRRLSR+QNTVFCGRI LFL+KFFPFSERSGLNI+SEFN++N TE+G D +
Sbjct: 145 RMCNDLLRRLSRTQNTVFCGRIQLFLSKFFPFSERSGLNIVSEFNLDNFTEYGLDSKDHD 204
Query: 184 SSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFK 243
S+ ID++ Y KFWSLQD+FRNP QCYNK WKMF +A+ +L +F
Sbjct: 205 ESDNKELEDTAEDIPLKIDYDLYCKFWSLQDFFRNPNQCYNKPQWKMFQMHADNILQSFS 264
Query: 244 SYKLDDVQSSLN 255
S+KL+DV+ S N
Sbjct: 265 SFKLEDVRQSSN 276
Score = 448 (162.8 bits), Expect = 7.1e-109, Sum P(2) = 7.1e-109
Identities = 106/264 (40%), Positives = 150/264 (56%)
Query: 252 SSLNPSGDYFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSD 311
SS+ + +FAK+LTN KLL LQLSD NFRR VL+QFLILFQY +VK + + L +D
Sbjct: 299 SSVIKANHFFAKFLTNPKLLALQLSDANFRRAVLVQFLILFQYLQVSVKFKSDTQTLTAD 358
Query: 312 QEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRP----LT 367
Q ++K+T VY L+++TPP G+ FS+ V +L EE WN WKNEGC E K+P L+
Sbjct: 359 QADFIKETESRVYKLLEETPPYGKRFSRTVYHMLAREEMWNNWKNEGCKEFKKPEEPTLS 418
Query: 368 SIXXXXX---------------XXXXXXXXXXXXXLTKLWN-SKDNLEACKSAERDFTPS 411
LT+LWN S DNL+ACKS +R+F P
Sbjct: 419 EEDSKPTPNKRPRRPLGDALRDASRSGKFYLGNDNLTRLWNYSPDNLQACKSEQRNFLPL 478
Query: 412 LESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNSEFI 471
LE+Y E +++DPA + WRALRLL+R+ PHFF + + K S+++
Sbjct: 479 LETYLETPHEKVDPA--------------FEWRALRLLARQTPHFFTSLSQPSSKISDYL 524
Query: 472 ENMVKRCVKEK---PSSQISGNGN 492
E + KR +++K P++ +S N +
Sbjct: 525 EQVRKRLIRDKEPKPAALLSNNSS 548
>UNIPROTKB|E2RNV0 [details] [associations]
symbol:THOC1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0046784 "intronless viral mRNA export from
host nucleus" evidence=IEA] [GO:0032784 "regulation of
DNA-dependent transcription, elongation" evidence=IEA] [GO:0006915
"apoptotic process" evidence=IEA] [GO:0005737 "cytoplasm"
evidence=IEA] [GO:0000445 "THO complex part of transcription export
complex" evidence=IEA] [GO:0007165 "signal transduction"
evidence=IEA] InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017
SMART:SM00005 GO:GO:0005737 GO:GO:0006915 GO:GO:0007165
Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0046784
GO:GO:0032784 GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957
CTD:9984 KO:K12878 OMA:ILMGNEE GeneTree:ENSGT00390000016232
EMBL:AAEX03005467 RefSeq:XP_547651.2 Ensembl:ENSCAFT00000029114
GeneID:490529 KEGG:cfa:490529 Uniprot:E2RNV0
Length = 657
Score = 809 (289.8 bits), Expect = 1.4e-80, P = 1.4e-80
Identities = 165/344 (47%), Positives = 210/344 (61%)
Query: 178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
+E MDV ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E
Sbjct: 207 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 266
Query: 238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct: 267 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 326
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY VK + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE
Sbjct: 327 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 386
Query: 350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
+WN WKNEGCP + TS LT+LWN
Sbjct: 387 NWNSWKNEGCPSFVKERTSDTKPTRVARKRTAPEDFLGKGPNKKILMGNEELTRLWNLCP 446
Query: 396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
DN+EACKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PH
Sbjct: 447 DNMEACKSETREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPH 506
Query: 456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
FF + E++ENMV + KE PS +I G D+D
Sbjct: 507 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 547
Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
Identities = 118/243 (48%), Positives = 150/243 (61%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK L+Q+ R L + I V+ ISL + + +C ++ P +LL D
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEIINHSSCENVLA-IISLAIGGVTEGICTASTPFVLLGD 99
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 100 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV
Sbjct: 160 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 219
Query: 191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD
Sbjct: 220 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 279
Query: 251 QSS 253
Q+S
Sbjct: 280 QAS 282
>UNIPROTKB|F1MJV3 [details] [associations]
symbol:THOC1 "Uncharacterized protein" species:9913 "Bos
taurus" [GO:0046784 "intronless viral mRNA export from host
nucleus" evidence=IEA] [GO:0032784 "regulation of DNA-dependent
transcription, elongation" evidence=IEA] [GO:0006915 "apoptotic
process" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
[GO:0000445 "THO complex part of transcription export complex"
evidence=IEA] [GO:0007165 "signal transduction" evidence=IEA]
InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
GO:GO:0005737 GO:GO:0006915 GO:GO:0007165 Gene3D:1.10.533.10
InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0046784 GO:GO:0032784
GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957 OMA:ILMGNEE
GeneTree:ENSGT00390000016232 EMBL:DAAA02056653 IPI:IPI00699918
Ensembl:ENSBTAT00000025584 Uniprot:F1MJV3
Length = 660
Score = 809 (289.8 bits), Expect = 1.4e-80, P = 1.4e-80
Identities = 165/344 (47%), Positives = 210/344 (61%)
Query: 178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
+E MDV ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E
Sbjct: 210 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 269
Query: 238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct: 270 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 329
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY VK + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE
Sbjct: 330 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 389
Query: 350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
+WN WKNEGCP + TS LT+LWN
Sbjct: 390 NWNSWKNEGCPSFVKERTSDSKPTRAVRKRAAPEDFLGKGPSKKILMGNDELTRLWNLCP 449
Query: 396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
DN+EACKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PH
Sbjct: 450 DNMEACKSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPH 509
Query: 456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
FF + E++ENMV + KE PS +I G D+D
Sbjct: 510 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 550
Score = 566 (204.3 bits), Expect = 7.8e-55, P = 7.8e-55
Identities = 118/243 (48%), Positives = 150/243 (61%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK L+Q+ R L + I V+ ISL + + +C ++ P +LL D
Sbjct: 44 PGSENEKKCTLDQAFRGVLEEEIINHSSCENVLA-IISLAIGGVTEGICTASTPFVLLGD 102
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 103 VLDCLPLDQCDTIFTFVERNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 162
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV
Sbjct: 163 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 222
Query: 191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD
Sbjct: 223 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 282
Query: 251 QSS 253
Q+S
Sbjct: 283 QAS 285
>UNIPROTKB|F1NMW7 [details] [associations]
symbol:THOC1 "Uncharacterized protein" species:9031 "Gallus
gallus" [GO:0007165 "signal transduction" evidence=IEA] [GO:0000445
"THO complex part of transcription export complex" evidence=IEA]
[GO:0005737 "cytoplasm" evidence=IEA] [GO:0006915 "apoptotic
process" evidence=IEA] [GO:0032784 "regulation of DNA-dependent
transcription, elongation" evidence=IEA] [GO:0046784 "intronless
viral mRNA export from host nucleus" evidence=IEA]
InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
GO:GO:0005737 GO:GO:0006915 GO:GO:0007165 GO:GO:0006406
Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0032784
GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957 OMA:ILMGNEE
GeneTree:ENSGT00390000016232 EMBL:AADN02021331 EMBL:AADN02077728
IPI:IPI00585456 Ensembl:ENSGALT00000024055 Uniprot:F1NMW7
Length = 547
Score = 808 (289.5 bits), Expect = 1.8e-80, P = 1.8e-80
Identities = 168/344 (48%), Positives = 210/344 (61%)
Query: 178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
DE MDV ID+N Y+KFWSLQDYFRNPVQCY KVSWK F Y+E
Sbjct: 96 DEGMDVEEGEMGDDEAPTSCSIPIDYNLYRKFWSLQDYFRNPVQCYEKVSWKTFLKYSEE 155
Query: 238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct: 156 VLAVFKSYKLDDTQASRKKLEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 215
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY VK + L +Q W++DTT+ VY L+ + PPDGE FS++V+ IL EE
Sbjct: 216 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKAVYQLLSENPPDGERFSKMVEHILNTEE 275
Query: 350 HWNQWKNEGCPEL--KRPLTSIXXXXXXXXXX-----------XXXXXXXXLTKLWN-SK 395
+WN WKNEGCP +RP S LT+LWN
Sbjct: 276 NWNSWKNEGCPSFVKERPPDSKPMRPARKRPAPEDFLGKGPNKKILMGNEELTRLWNLCP 335
Query: 396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
DN+EACKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PH
Sbjct: 336 DNMEACKSESREYMPTLEEFFEEAIEQADPENMVENKYKAVNNSNYGWRALRLLARRSPH 395
Query: 456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
FF + E++ENMV + KE PS +I G D+D
Sbjct: 396 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 436
Score = 451 (163.8 bits), Expect = 1.2e-42, P = 1.2e-42
Identities = 93/158 (58%), Positives = 106/158 (67%)
Query: 111 QQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVE 170
Q F+ + KN LLRMCNDLLRRLS+SQNTVFCGRI LFLA+ FP SE+SGLN+ S+FN+E
Sbjct: 14 QNMFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRIQLFLARLFPLSEKSGLNLQSQFNLE 73
Query: 171 NITEFG--------G-------DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDY 215
N+T F G DE MDV ID+N Y+KFWSLQDY
Sbjct: 74 NVTVFNTNEHESTLGQKHSEERDEGMDVEEGEMGDDEAPTSCSIPIDYNLYRKFWSLQDY 133
Query: 216 FRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS 253
FRNPVQCY KVSWK F Y+E VLA FKSYKLDD Q+S
Sbjct: 134 FRNPVQCYEKVSWKTFLKYSEEVLAVFKSYKLDDTQAS 171
>UNIPROTKB|Q96FV9 [details] [associations]
symbol:THOC1 "THO complex subunit 1" species:9606 "Homo
sapiens" [GO:0007165 "signal transduction" evidence=IEA]
[GO:0003677 "DNA binding" evidence=IEA] [GO:0003723 "RNA binding"
evidence=IEA] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0006397 "mRNA processing" evidence=IEA]
[GO:0008380 "RNA splicing" evidence=IEA] [GO:0016363 "nuclear
matrix" evidence=IEA] [GO:0016607 "nuclear speck" evidence=IEA]
[GO:0000346 "transcription export complex" evidence=IDA]
[GO:0000347 "THO complex" evidence=IDA] [GO:0000445 "THO complex
part of transcription export complex" evidence=IDA] [GO:0005737
"cytoplasm" evidence=IDA] [GO:0005634 "nucleus" evidence=IDA]
[GO:0006915 "apoptotic process" evidence=IDA] [GO:0006406 "mRNA
export from nucleus" evidence=IDA] [GO:0046784 "intronless viral
mRNA export from host nucleus" evidence=IDA] [GO:0032784
"regulation of DNA-dependent transcription, elongation"
evidence=IDA] [GO:0005515 "protein binding" evidence=IPI]
[GO:0006396 "RNA processing" evidence=TAS] [GO:0005730 "nucleolus"
evidence=IDA] InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017
SMART:SM00005 GO:GO:0005737 GO:GO:0006915 GO:GO:0007165
GO:GO:0008380 GO:GO:0003677 GO:GO:0016607 GO:GO:0006397
GO:GO:0006351 GO:GO:0003723 GO:GO:0006396 EMBL:CH471113
Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0016363
GO:GO:0046784 GO:GO:0032784 GO:GO:0000445 InterPro:IPR021861
Pfam:PF11957 EMBL:L36529 EMBL:AY573302 EMBL:AY573303 EMBL:AK314755
EMBL:AP000845 EMBL:BC010381 IPI:IPI00305374 IPI:IPI00646620
IPI:IPI00944519 PIR:A53545 RefSeq:NP_005122.2 UniGene:Hs.712543
PDB:1WXP PDBsum:1WXP ProteinModelPortal:Q96FV9 SMR:Q96FV9
IntAct:Q96FV9 MINT:MINT-4536762 STRING:Q96FV9 PhosphoSite:Q96FV9
DMDM:37999906 PaxDb:Q96FV9 PRIDE:Q96FV9 DNASU:9984
Ensembl:ENST00000261600 GeneID:9984 KEGG:hsa:9984 UCSC:uc002kkj.4
UCSC:uc002kkl.2 CTD:9984 GeneCards:GC18M000204 HGNC:HGNC:19070
HPA:HPA019096 HPA:HPA019687 MIM:606930 neXtProt:NX_Q96FV9
PharmGKB:PA134887435 eggNOG:NOG275387 HOGENOM:HOG000008123
HOVERGEN:HBG060294 InParanoid:Q96FV9 KO:K12878 OMA:ILMGNEE
OrthoDB:EOG4HX50P ChiTaRS:THOC1 EvolutionaryTrace:Q96FV9
GenomeRNAi:9984 NextBio:37696 Bgee:Q96FV9 CleanEx:HS_THOC1
Genevestigator:Q96FV9 GermOnline:ENSG00000079134 Uniprot:Q96FV9
Length = 657
Score = 804 (288.1 bits), Expect = 4.7e-80, P = 4.7e-80
Identities = 165/344 (47%), Positives = 209/344 (60%)
Query: 178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
+E MDV ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E
Sbjct: 207 EEGMDVEEGEMGDEEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 266
Query: 238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct: 267 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 326
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY VK + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE
Sbjct: 327 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 386
Query: 350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
+WN WKNEGCP + TS LT+LWN
Sbjct: 387 NWNSWKNEGCPSFVKERTSDTKPTRIIRKRTAPEDFLGKGPTKKILMGNEELTRLWNLCP 446
Query: 396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
DN+EACKS R+ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PH
Sbjct: 447 DNMEACKSETREHMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPH 506
Query: 456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
FF + E++ENMV + KE PS +I G D+D
Sbjct: 507 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 547
Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
Identities = 118/243 (48%), Positives = 150/243 (61%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK L+Q+ R L + I V+ ISL + + +C ++ P +LL D
Sbjct: 41 PGSENEKKCTLDQAFRGILEEEIINHSSCENVLA-IISLAIGGVTEGICTASTPFVLLGD 99
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 100 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV
Sbjct: 160 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 219
Query: 191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD
Sbjct: 220 EEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 279
Query: 251 QSS 253
Q+S
Sbjct: 280 QAS 282
>UNIPROTKB|D4ABL0 [details] [associations]
symbol:Thoc1 "THO complex subunit 1" species:10116 "Rattus
norvegicus" [GO:0007165 "signal transduction" evidence=IEA]
InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
RGD:1308657 GO:GO:0007165 Gene3D:1.10.533.10 InterPro:IPR011029
SUPFAM:SSF47986 InterPro:IPR021861 Pfam:PF11957 OrthoDB:EOG4HX50P
IPI:IPI00560890 Ensembl:ENSRNOT00000021087 ArrayExpress:D4ABL0
Uniprot:D4ABL0
Length = 657
Score = 803 (287.7 bits), Expect = 6.0e-80, P = 6.0e-80
Identities = 164/344 (47%), Positives = 209/344 (60%)
Query: 178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
+E MDV ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E
Sbjct: 207 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 266
Query: 238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct: 267 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 326
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY VK + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE
Sbjct: 327 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 386
Query: 350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
+WN WKNEGCP + S LT+LWN
Sbjct: 387 NWNSWKNEGCPSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCP 446
Query: 396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
DN+EACKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PH
Sbjct: 447 DNMEACKSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPH 506
Query: 456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
FF + E++ENMV + KE PS +I G D+D
Sbjct: 507 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 547
Score = 570 (205.7 bits), Expect = 2.9e-55, P = 2.9e-55
Identities = 119/243 (48%), Positives = 150/243 (61%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK L+Q+ R L + I V+ ISL + + +C ++ P +LL D
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEIINHSACENVLA-IISLAIGGVTESVCTASTPFVLLGD 99
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 100 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
LFLA+ FP SE+SGLN+ S+FN+ENIT F +E+ MDV
Sbjct: 160 QLFLARLFPLSEKSGLNLQSQFNLENITVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 219
Query: 191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD
Sbjct: 220 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 279
Query: 251 QSS 253
Q+S
Sbjct: 280 QAS 282
>MGI|MGI:1919668 [details] [associations]
symbol:Thoc1 "THO complex 1" species:10090 "Mus musculus"
[GO:0000346 "transcription export complex" evidence=ISO]
[GO:0000347 "THO complex" evidence=ISO] [GO:0000445 "THO complex
part of transcription export complex" evidence=ISO] [GO:0003677
"DNA binding" evidence=IEA] [GO:0003723 "RNA binding" evidence=IEA]
[GO:0005634 "nucleus" evidence=ISO] [GO:0005737 "cytoplasm"
evidence=ISO] [GO:0006351 "transcription, DNA-dependent"
evidence=IEA] [GO:0006355 "regulation of transcription,
DNA-dependent" evidence=IEA] [GO:0006397 "mRNA processing"
evidence=IEA] [GO:0006406 "mRNA export from nucleus" evidence=ISO]
[GO:0006810 "transport" evidence=IEA] [GO:0006915 "apoptotic
process" evidence=ISO;RCA] [GO:0007165 "signal transduction"
evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA] [GO:0032784
"regulation of DNA-dependent transcription, elongation"
evidence=ISO] [GO:0042981 "regulation of apoptotic process"
evidence=RCA] [GO:0046784 "intronless viral mRNA export from host
nucleus" evidence=ISO] [GO:0051028 "mRNA transport" evidence=IEA]
InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
MGI:MGI:1919668 GO:GO:0005737 GO:GO:0006915 GO:GO:0007165
GO:GO:0008380 GO:GO:0003677 GO:GO:0016607 GO:GO:0006397
GO:GO:0006351 GO:GO:0003723 Gene3D:1.10.533.10 InterPro:IPR011029
SUPFAM:SSF47986 GO:GO:0016363 GO:GO:0046784 GO:GO:0032784
GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957 CTD:9984
eggNOG:NOG275387 HOGENOM:HOG000008123 HOVERGEN:HBG060294 KO:K12878
OMA:ILMGNEE OrthoDB:EOG4HX50P EMBL:AK031785 EMBL:AK032200
EMBL:AK042867 EMBL:BC024951 IPI:IPI00153778 RefSeq:NP_705780.1
UniGene:Mm.219648 ProteinModelPortal:Q8R3N6 SMR:Q8R3N6
STRING:Q8R3N6 PhosphoSite:Q8R3N6 PaxDb:Q8R3N6 PRIDE:Q8R3N6
Ensembl:ENSMUST00000025137 GeneID:225160 KEGG:mmu:225160
UCSC:uc008eal.1 GeneTree:ENSGT00390000016232 InParanoid:Q8R3N6
NextBio:377554 Bgee:Q8R3N6 CleanEx:MM_THOC1 Genevestigator:Q8R3N6
GermOnline:ENSMUSG00000024287 Uniprot:Q8R3N6
Length = 657
Score = 803 (287.7 bits), Expect = 6.0e-80, P = 6.0e-80
Identities = 164/344 (47%), Positives = 209/344 (60%)
Query: 178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
+E MDV ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E
Sbjct: 207 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 266
Query: 238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct: 267 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 326
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY VK + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE
Sbjct: 327 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 386
Query: 350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
+WN WKNEGCP + S LT+LWN
Sbjct: 387 NWNSWKNEGCPSFVKERASDTKPTRVVRKRAAPEDFLGKGPNKKILIGNEELTRLWNLCP 446
Query: 396 DNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPH 455
DN+EACKS R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PH
Sbjct: 447 DNMEACKSETREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPH 506
Query: 456 FFLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
FF + E++ENMV + KE PS +I G D+D
Sbjct: 507 FFQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 547
Score = 569 (205.4 bits), Expect = 3.7e-55, P = 3.7e-55
Identities = 118/243 (48%), Positives = 150/243 (61%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK L+Q+ R L + I V+ ISL + + +C ++ P +LL D
Sbjct: 41 PGSENEKKCTLDQAFRGVLEEEIINHSACENVLA-IISLAIGGVTESVCTASTPFVLLGD 99
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 100 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV
Sbjct: 160 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 219
Query: 191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD
Sbjct: 220 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 279
Query: 251 QSS 253
Q+S
Sbjct: 280 QAS 282
>UNIPROTKB|I3LE05 [details] [associations]
symbol:THOC1 "Uncharacterized protein" species:9823 "Sus
scrofa" [GO:0046784 "intronless viral mRNA export from host
nucleus" evidence=IEA] [GO:0032784 "regulation of DNA-dependent
transcription, elongation" evidence=IEA] [GO:0006915 "apoptotic
process" evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA]
[GO:0000445 "THO complex part of transcription export complex"
evidence=IEA] [GO:0007165 "signal transduction" evidence=IEA]
InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
GO:GO:0005737 GO:GO:0006915 GO:GO:0007165 Gene3D:1.10.533.10
InterPro:IPR011029 SUPFAM:SSF47986 GO:GO:0046784 GO:GO:0032784
GO:GO:0000445 InterPro:IPR021861 Pfam:PF11957 OMA:ILMGNEE
GeneTree:ENSGT00390000016232 Ensembl:ENSSSCT00000029403
Uniprot:I3LE05
Length = 662
Score = 792 (283.9 bits), Expect = 8.7e-79, P = 8.7e-79
Identities = 160/330 (48%), Positives = 203/330 (61%)
Query: 178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
+E MDV ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E
Sbjct: 210 EEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEE 269
Query: 238 VLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQFL 289
VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+DLQLSD+NFRR++LLQ+L
Sbjct: 270 VLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMDLQLSDSNFRRHILLQYL 329
Query: 290 ILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEE 349
ILFQY VK + L +Q W++DTT++VY L+ + PPDGE FS++V+ IL EE
Sbjct: 330 ILFQYLKGQVKFKSSNYVLTDEQSLWIEDTTKSVYQLLSENPPDGERFSKMVEHILNTEE 389
Query: 350 HWNQWKNEGCPELKRPLTSIXXXXXXXXXXXX-------------XXXXXXLTKLWN-SK 395
+WN WKNEGCP + TS LT+LWN
Sbjct: 390 NWNSWKNEGCPSFVKERTSDTKPTRVVRKRTAPEDFLGKGPNKKILMGNEELTRLWNLCP 449
Query: 396 DNLEACKSAER--DFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKC 453
DN+EACKS R ++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+
Sbjct: 450 DNMEACKSETRPREYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRS 509
Query: 454 PHFFLNATPNVEKNSEFIENMVKRCVKEKP 483
PHFF + E++ENMV + KE P
Sbjct: 510 PHFFQPTNQQFKSLPEYLENMVIKLAKELP 539
Score = 567 (204.7 bits), Expect = 6.1e-55, P = 6.1e-55
Identities = 118/243 (48%), Positives = 150/243 (61%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK L+Q+ R L + I V+ ISL + + +C ++ P +LL D
Sbjct: 44 PGSENEKKCTLDQAFRGVLEEEIINHSSCENVLA-IISLAIGGVTEGICTASTPFVLLGD 102
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD+C+ +F +VE NV WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 103 VLDCLPLDQCDTIFTFVEKNVATWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 162
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
LFLA+ FP SE+SGLN+ S+FN+EN+T F +E+ MDV
Sbjct: 163 QLFLARLFPLSEKSGLNLQSQFNLENVTVFNTNEQESTLGQKHTEDREEGMDVEEGEMGD 222
Query: 191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
ID+N Y+KFWSLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD
Sbjct: 223 DEAPTTCSIPIDYNLYRKFWSLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDT 282
Query: 251 QSS 253
Q+S
Sbjct: 283 QAS 285
>ZFIN|ZDB-GENE-030826-9 [details] [associations]
symbol:thoc1 "THO complex 1" species:7955 "Danio
rerio" [GO:0007165 "signal transduction" evidence=IEA]
InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
ZFIN:ZDB-GENE-030826-9 GO:GO:0007165 Gene3D:1.10.533.10
InterPro:IPR011029 SUPFAM:SSF47986 InterPro:IPR021861 Pfam:PF11957
CTD:9984 HOGENOM:HOG000008123 HOVERGEN:HBG060294 KO:K12878
EMBL:BC054938 IPI:IPI00491037 RefSeq:NP_958481.1 UniGene:Dr.75966
ProteinModelPortal:Q7SYB2 SMR:Q7SYB2 PRIDE:Q7SYB2 GeneID:373077
KEGG:dre:373077 InParanoid:Q7SYB2 NextBio:20813350
ArrayExpress:Q7SYB2 Bgee:Q7SYB2 Uniprot:Q7SYB2
Length = 655
Score = 787 (282.1 bits), Expect = 3.0e-78, P = 3.0e-78
Identities = 159/329 (48%), Positives = 205/329 (62%)
Query: 178 DEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAET 237
+E MDV ID+N Y+KFW+LQDYFRNPVQCY+K SW F Y++
Sbjct: 207 EEGMDVEEGEMGDEDAPAPSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFIKYSDE 266
Query: 238 VLAAFKSYKLDDVQSSLNP-------SGD--YFAKYLTNQKLLDLQLSDTNFRRYVLLQF 288
LA FKS+KLDD+Q+S SGD YFAK+LT++KL+DLQLSD+NFRR++LLQ+
Sbjct: 267 ALAVFKSFKLDDMQASKKKLEEMRTSSGDHVYFAKFLTSEKLMDLQLSDSNFRRHILLQY 326
Query: 289 LILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGE 348
LILFQY VK + L DQ W++DTT+ VY L+K+ PPDG+ F +V+ IL E
Sbjct: 327 LILFQYLKGQVKFKSSSCVLNDDQSLWIEDTTKLVYQLLKEIPPDGDKFGSMVEHILNTE 386
Query: 349 EHWNQWKNEGCPEL--KRPLTS--IXXXXXXXX---------XXXXXXXXXXLTKLWN-S 394
E+WN WKNEGCP +RP + I LT+LWN +
Sbjct: 387 ENWNSWKNEGCPSFVKERPAETKPIRPSRKRQAPEDFLGKGPDRKILMGNDELTRLWNLN 446
Query: 395 KDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCP 454
DN+EACKS R+F PSLE +FEEAI+Q DPA VE++YK V +SNY WRALRLLSR+ P
Sbjct: 447 PDNMEACKSENREFMPSLEDFFEEAIEQADPANMVEDEYKVVRNSNYGWRALRLLSRRSP 506
Query: 455 HFFLNATPNVEKNSEFIENMVKRCVKEKP 483
HFF + ++++ENMV + KE P
Sbjct: 507 HFFQPTNQQFKSLADYLENMVIKLAKELP 535
Score = 564 (203.6 bits), Expect = 1.3e-54, P = 1.3e-54
Identities = 111/243 (45%), Positives = 156/243 (64%)
Query: 26 PTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSD 85
P + +KK+ L+Q++R L + I +++ + I + ++ + +C++T P +LL D
Sbjct: 40 PGNETEKKATLDQALRGVLEEQIVNQKVNVDDFLSLIYISIDGVTEGICSATTPFLLLGD 99
Query: 86 TFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRI 145
D LD+C+++F +VE NV+ WK TF+ + KN LLRMCNDLLRRLS+SQNTVFCGRI
Sbjct: 100 VLDCLPLDQCDKIFSFVEENVSTWKSNTFYSAGKNYLLRMCNDLLRRLSKSQNTVFCGRI 159
Query: 146 LLFLAKFFPFSERSGLNIISEFNVENITEFGGDEE---------------MDVSSNXXXX 190
LFLA+ FP SE+SGLN+ S+FN++NIT F +E+ MDV
Sbjct: 160 QLFLARLFPLSEKSGLNLQSQFNLDNITVFNKNEQDSTLGQQHTEVKEEGMDVEEGEMGD 219
Query: 191 XXXXXXXXXXIDFNFYKKFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDV 250
ID+N Y+KFW+LQDYFRNPVQCY+K SW F Y++ LA FKS+KLDD+
Sbjct: 220 EDAPAPSSIPIDYNLYRKFWTLQDYFRNPVQCYDKFSWMTFIKYSDEALAVFKSFKLDDM 279
Query: 251 QSS 253
Q+S
Sbjct: 280 QAS 282
>UNIPROTKB|J9NUJ3 [details] [associations]
symbol:THOC1 "Uncharacterized protein" species:9615 "Canis
lupus familiaris" [GO:0007165 "signal transduction" evidence=IEA]
InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
GO:GO:0007165 Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986
InterPro:IPR021861 Pfam:PF11957 GeneTree:ENSGT00390000016232
EMBL:AAEX03005467 Ensembl:ENSCAFT00000045552 Uniprot:J9NUJ3
Length = 499
Score = 413 (150.4 bits), Expect = 4.7e-71, Sum P(3) = 4.7e-71
Identities = 85/158 (53%), Positives = 105/158 (66%)
Query: 159 SGLNIISEFNVENITEFGGDEE--------MDVSSNXXXXXXXXXXXXXXIDFNFYKKFW 210
S LN+ S+FN+EN+T F +E+ MDV ID+N Y+KFW
Sbjct: 93 SSLNLQSQFNLENVTVFNTNEQHTEDREEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFW 152
Query: 211 SLQDYFRNPVQCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSGD--YFA 262
SLQDYFRNPVQCY K+SWK F Y+E VLA FKSYKLDD Q+S L G+ YFA
Sbjct: 153 SLQDYFRNPVQCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFA 212
Query: 263 KYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVK 300
K+LT++KL+DLQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 213 KFLTSEKLMDLQLSDSNFRRHILLQYLILFQYLKGQVK 250
Score = 228 (85.3 bits), Expect = 4.7e-71, Sum P(3) = 4.7e-71
Identities = 46/94 (48%), Positives = 61/94 (64%)
Query: 406 RDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVE 465
R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHFF +
Sbjct: 299 REYMPTLEEFFEEAIEQADPENMVENEYKAVNNSNYGWRALRLLARRSPHFFQPTNQQFK 358
Query: 466 KNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
E++ENMV + KE PS +I G D+D
Sbjct: 359 SLPEYLENMVIKLAKELPPPSEEIK---TGEDED 389
Score = 109 (43.4 bits), Expect = 4.7e-71, Sum P(3) = 4.7e-71
Identities = 20/53 (37%), Positives = 30/53 (56%)
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTF 114
ISL + + +C ++ P +LL D D LD+C+ +F +VE NV WK F
Sbjct: 31 ISLAIGGVTEGICTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWKSNLF 83
>UNIPROTKB|Q6TUH4 [details] [associations]
symbol:Thoc1 "LRRGT00070" species:10116 "Rattus norvegicus"
[GO:0007165 "signal transduction" evidence=IEA] InterPro:IPR000488
Pfam:PF00531 PROSITE:PS50017 SMART:SM00005 RGD:1308657
GO:GO:0007165 Gene3D:1.10.533.10 InterPro:IPR011029 SUPFAM:SSF47986
InterPro:IPR021861 Pfam:PF11957 CTD:9984 eggNOG:NOG275387
HOGENOM:HOG000008123 HOVERGEN:HBG060294 KO:K12878
GeneTree:ENSGT00390000016232 EMBL:AY387056 IPI:IPI00421329
RefSeq:NP_001041315.1 UniGene:Rn.202648 SMR:Q6TUH4 STRING:Q6TUH4
Ensembl:ENSRNOT00000046718 GeneID:291797 KEGG:rno:291797
NextBio:633190 Genevestigator:Q6TUH4 Uniprot:Q6TUH4
Length = 499
Score = 387 (141.3 bits), Expect = 3.2e-68, Sum P(3) = 3.2e-68
Identities = 77/148 (52%), Positives = 98/148 (66%)
Query: 161 LNIISEFNVENITEFGGDEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQDYFRNPV 220
L +++E+ ++ +E MDV ID+N Y+KFWSLQDYFRNPV
Sbjct: 103 LGLVAEYGLDPQHTEDREEGMDVEEGEMGDDEAPTTCSIPIDYNLYRKFWSLQDYFRNPV 162
Query: 221 QCYNKVSWKMFTSYAETVLAAFKSYKLDDVQSS------LNPSGD--YFAKYLTNQKLLD 272
QCY K+SWK F Y+E VLA FKSYKLDD Q+S L G+ YFAK+LT++KL+D
Sbjct: 163 QCYEKISWKTFLKYSEEVLAVFKSYKLDDTQASRKKMEELKTGGEHVYFAKFLTSEKLMD 222
Query: 273 LQLSDTNFRRYVLLQFLILFQYFTSTVK 300
LQLSD+NFRR++LLQ+LILFQY VK
Sbjct: 223 LQLSDSNFRRHILLQYLILFQYLKGQVK 250
Score = 231 (86.4 bits), Expect = 3.2e-68, Sum P(3) = 3.2e-68
Identities = 48/103 (46%), Positives = 64/103 (62%)
Query: 397 NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRALRLLSRKCPHF 456
NL + R++ P+LE +FEEAI+Q DP VE +YK VN+SNY WRALRLL+R+ PHF
Sbjct: 290 NLTVYFTMTREYMPTLEEFFEEAIEQADPENMVESEYKAVNNSNYGWRALRLLARRSPHF 349
Query: 457 FLNATPNVEKNSEFIENMVKRCVKE--KPSSQISGNGNGVDQD 497
F + E++ENMV + KE PS +I G D+D
Sbjct: 350 FQPTNQQFKSLPEYLENMVIKLAKELPPPSEEIK---TGEDED 389
Score = 105 (42.0 bits), Expect = 3.2e-68, Sum P(3) = 3.2e-68
Identities = 19/49 (38%), Positives = 29/49 (59%)
Query: 62 ISLCVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWK 110
ISL + + +C ++ P +LL D D LD+C+ +F +VE NV WK
Sbjct: 31 ISLAIGGVTESVCTASTPFVLLGDVLDCLPLDQCDTIFTFVEKNVATWK 79
>WB|WBGene00020172 [details] [associations]
symbol:thoc-1 species:6239 "Caenorhabditis elegans"
[GO:0000347 "THO complex" evidence=ISS] InterPro:IPR021861
Pfam:PF11957 eggNOG:NOG275387 KO:K12878
GeneTree:ENSGT00390000016232 EMBL:FO081698 RefSeq:NP_493796.2
UniGene:Cel.14485 ProteinModelPortal:Q9N5E3 SMR:Q9N5E3
STRING:Q9N5E3 PaxDb:Q9N5E3 EnsemblMetazoa:T02H6.2 GeneID:173460
KEGG:cel:CELE_T02H6.2 UCSC:T02H6.2 CTD:173460 WormBase:T02H6.2
InParanoid:Q9N5E3 OMA:VEENMNE NextBio:879757 Uniprot:Q9N5E3
Length = 665
Score = 380 (138.8 bits), Expect = 1.1e-65, Sum P(2) = 1.1e-65
Identities = 89/232 (38%), Positives = 119/232 (51%)
Query: 257 SGD-YFAKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFT--STVKSRGEGLELKSDQE 313
S D +F KYLT+ KLL LQL+D++FRRY L+Q +I+FQY T S K + + L DQ
Sbjct: 285 SNDVFFTKYLTSPKLLALQLNDSSFRRYFLMQAIIIFQYLTAESRFKPPAKKMVLNEDQA 344
Query: 314 KWVKDTTETVYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLTSIXXXX 373
K+V + + Y L+ T P G F +K I+ E+ WN WKN C +
Sbjct: 345 KYVSECEDKCYRLLADTMPRGTAFVAGLKRIMLREQEWNTWKNANCADFSEKADKGAMQM 404
Query: 374 XXXXXX------XXXXXXXXLTKLW-NSKDNLEACKSAERDFTPSLESYFEEAIQQMDPA 426
LTKLW N D L+ACKS +R F P L + + I +MDP
Sbjct: 405 YKKRQRIPFNPNSLDLGTPELTKLWTNEPDVLKACKSDKRKFIPKLPDFIRDPIDEMDPE 464
Query: 427 AAVEEQYKKVNDSNYAWRALRLLSRKCPHFFLNATPNVEKNS---EFIE-NM 474
VEEQYK++NDS + WRA RLL K P + + + EF+E NM
Sbjct: 465 QQVEEQYKQINDSAFQWRAARLLMHKSPGYVTKTDTKTDPTTNIKEFLERNM 516
Score = 319 (117.4 bits), Expect = 1.1e-65, Sum P(2) = 1.1e-65
Identities = 70/202 (34%), Positives = 104/202 (51%)
Query: 69 CMK-DMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCN 127
C K +C+ P+ L D +MS++++C+Q+F VE N+N +KQ F + +NN+LR CN
Sbjct: 67 CSKLGLCSKNTPLSTLQDLLEMSSIEECKQIFSIVEENMNEFKQPGFIETAQNNILRFCN 126
Query: 128 DLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNX 187
DLLRRLSR+ T FCGRI+ FL++F P +E+SG+N + FN N+T + + E D +
Sbjct: 127 DLLRRLSRTAETSFCGRIMFFLSRFLPLTEKSGVNFMGHFNTLNVTNYD-ESETDGEALL 185
Query: 188 XXXXXXXXXXXXXIDFN-----------------FYKKFWSLQDYFRNPVQCYNKVSWKM 230
D Y++FWSLQ + NP Y K +
Sbjct: 186 AATSSAPTPTEGAEDMETGEIEEDNSKEIQVTPEMYRQFWSLQKFMSNPNSIYEKEKFLT 245
Query: 231 FTSYAETVLAAFKSYKLDDVQS 252
F + VL S KL+ + S
Sbjct: 246 FKTDLTAVLTLMTSNKLEKLSS 267
Score = 39 (18.8 bits), Expect = 7.9e-26, Sum P(2) = 7.9e-26
Identities = 14/49 (28%), Positives = 23/49 (46%)
Query: 393 NSKDNLEA-CKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSN 440
N K+ LE +A ++F ES ++ + VEE K+ DS+
Sbjct: 507 NIKEFLERNMYNAAKNFNEFKESIENREKKEAEARKKVEESLKRKLDSS 555
>TAIR|locus:2178183 [details] [associations]
symbol:THO1 "AT5G09860" species:3702 "Arabidopsis
thaliana" [GO:0003674 "molecular_function" evidence=ND] [GO:0010267
"production of ta-siRNAs involved in RNA interference"
evidence=IMP] [GO:0031047 "gene silencing by RNA" evidence=IMP]
[GO:0005634 "nucleus" evidence=IDA] [GO:0006406 "mRNA export from
nucleus" evidence=IMP] [GO:0050832 "defense response to fungus"
evidence=IMP] GO:GO:0005634 EMBL:CP002688 GO:GO:0050832
GO:GO:0006406 GO:GO:0010267 InterPro:IPR021861 Pfam:PF11957
KO:K12878 EMBL:AF424566 EMBL:AF428323 IPI:IPI00528228
RefSeq:NP_568219.1 UniGene:At.27041 PRIDE:Q93VM9
EnsemblPlants:AT5G09860.1 GeneID:830846 KEGG:ath:AT5G09860
TAIR:At5g09860 InParanoid:Q93VM9 OMA:DMDPSAG PhylomeDB:Q93VM9
ProtClustDB:CLSN2689570 Genevestigator:Q93VM9 Uniprot:Q93VM9
Length = 599
Score = 616 (221.9 bits), Expect = 3.9e-60, P = 3.9e-60
Identities = 164/494 (33%), Positives = 246/494 (49%)
Query: 38 QSIRQYLLKLIKTPDIDIKV---IENYISLCVELCMKDMCNSTLPIILLSDTFDMSTLDK 94
+ I QY +LI D D + I + + + + LC K+ + LL D +MST+
Sbjct: 62 EQIMQYG-QLIDDDDDDDDIHGQIPHLLDVVLYLCEKEHVEGGMIFQLLEDLTEMSTMKN 120
Query: 95 CEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILLFLAKFFP 154
C+ +F Y+E +I +Q F K +LR CN LLRRLS++ + VFCGRIL+FLA FFP
Sbjct: 121 CKDVFGYIESKQDILGKQELFARGKLVMLRTCNQLLRRLSKANDVVFCGRILMFLAHFFP 180
Query: 155 FSERSGLNIISEFNVENITEFGGDEEMDVSSNXXXXXXXXXXXXXXIDFNFYKKFWSLQD 214
SERS +NI FN N T++ D +S +DFNFYK FWSLQ+
Sbjct: 181 LSERSAVNIKGVFNTSNETKYEKDPPKGIS----------------VDFNFYKTFWSLQE 224
Query: 215 YFRNPVQCYN-KVSWKMFTSYAETVLAAFKSYKLDDVQSSLNP----SGDYFAKYLTNQK 269
YF NP + W+ F+S VL F + L + + N + + KYLT+ K
Sbjct: 225 YFCNPASLTSASTKWQKFSSSLAVVLNTFDAQPLSEEEGEANSLEEEAATFNIKYLTSSK 284
Query: 270 LLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWVKDTTETVYSLIKQ 329
L+ L+L D++FRR++LLQ LI+F Y + K+ + L ++ +E+ +K + V L++
Sbjct: 285 LMGLELKDSSFRRHILLQCLIMFDYLRAPGKN-DKDLPSETMKEE-LKSCEDRVKKLLEI 342
Query: 330 TPPDGEHFSQVVKLILKGEEHWNQWKNEGCPEL-KRPLTSIXXXXXXXXXXXX-XXXXXX 387
TPP G+ F + V+ IL+ E++W WK +GCP K+P+
Sbjct: 343 TPPKGKEFLRAVEHILEREKNWVWWKRDGCPPFEKQPIDKKSPNAGQKKRRQRWRLGNKE 402
Query: 388 LTKLWNSKD-NLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRAL 446
L++LW D N A ++R TP + Y++ + MDP+A +E++Y N+ Y W+ L
Sbjct: 403 LSQLWRWADQNPNALTDSQRVRTPDIADYWKPLAEDMDPSAGIEDEYHHKNNRVYCWKGL 462
Query: 447 RLLSRKCPHFFLNAT--------------PNVEKNSEFIEN-MVKRCVKEKP---SSQIS 488
R +R+ F T P V + N KR KE+ S +
Sbjct: 463 RFTARQDLEGFSRFTEMGIEGVVPVELLPPEVRSKYQAKPNEKAKRAKKEETKGGSHETE 522
Query: 489 GNGNGVDQDPAEVE 502
GN GV AE E
Sbjct: 523 GNQIGVSNSEAEAE 536
>DICTYBASE|DDB_G0275717 [details] [associations]
symbol:thoc1 "putative THO1 protein (nuclear matrix
protein p84)" species:44689 "Dictyostelium discoideum" [GO:0008150
"biological_process" evidence=ND] [GO:0005575 "cellular_component"
evidence=ND] [GO:0003674 "molecular_function" evidence=ND]
dictyBase:DDB_G0275717 GenomeReviews:CM000151_GR EMBL:AAFI02000013
InterPro:IPR021861 Pfam:PF11957 eggNOG:NOG275387 KO:K12878
RefSeq:XP_643594.2 STRING:Q552T7 EnsemblProtists:DDB0233560
GeneID:8620181 KEGG:ddi:DDB_G0275717 OMA:SFSTHIN
ProtClustDB:CLSZ2848579 Uniprot:Q552T7
Length = 726
Score = 233 (87.1 bits), Expect = 2.0e-38, Sum P(3) = 2.0e-38
Identities = 76/267 (28%), Positives = 126/267 (47%)
Query: 224 NKVSWKMFTSYAETVLAAFKSY-KLDDVQ--SSLNPSGD-YFAKYLTNQKLLDLQLSDTN 279
NK+ W+ F E V+ +F ++ LD++ SS NPS YF KYLT+ L+ LQL D+
Sbjct: 321 NKIKWESFIQSLELVIGSFSTHINLDELSQSSSNNPSKKHYFTKYLTSSNLMKLQLKDSI 380
Query: 280 FRRYVLLQFLILFQYFTSTVKSRGEGLELKSD-QEKWVKDTTETVYSLIKQTPPDGEHFS 338
FR+ +L Q LI FQ T + + +D Q+ +++ T + ++ T P+GE+FS
Sbjct: 381 FRKNILTQILITFQALDLTNQKYPT---IFNDLQKNIIQELTNKCFKILSNTNPNGEYFS 437
Query: 339 QVVKLILKGEEHWNQWKNEG-CPELKRP----LTSIXXXXXXXXXXXXXXXXXXLTKLWN 393
+ ILK E++W WK + C +RP + L++LWN
Sbjct: 438 NCLSSILKREKNWIIWKRDNQCKPFERPPCSPIVKKKKLFRKTALTKISLGNQELSRLWN 497
Query: 394 SKDNLEACKSAERDFTPSLESYFE----EAIQQMDPAA--AVEEQYKKVNDSNY--AWRA 445
+ + SL+S+ E E I+Q + A++ + +K N+ A +A
Sbjct: 498 LSGAPNDRSYLKTQNSVSLDSFIEPLKKETIEQEEKTKQEALKLERRKKNEQKRSDAEKA 557
Query: 446 LRLLSRKCPHFFLNATPNVEKNSEFIE 472
R + FL A P +K+ ++ E
Sbjct: 558 RRKEYDEKKAEFLLANPT-KKDKDYQE 583
Score = 215 (80.7 bits), Expect = 2.0e-38, Sum P(3) = 2.0e-38
Identities = 63/186 (33%), Positives = 93/186 (50%)
Query: 6 QLNFDK-SQLNVVSDKYALYKPTCDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISL 64
+L F+K S L + Y K + K++++ +IR + LIK DI + I+ I L
Sbjct: 66 KLLFNKDSLLKEIQKIYPNIKDPIQIEIKTSIDLNIRIFFNNLIKQIDITYENIDLAIKL 125
Query: 65 CVELCMKDMCNSTLPIILLSDTFDMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCK--NNL 122
+ +S LP+ L D F+ T+ KC LF +E I+ Q + + N L
Sbjct: 126 AYSFVELGILDSILPLQLSEDLFETKTISKCLDLFGLLESRAEIFSQDPEIIKGRKRNLL 185
Query: 123 LRMCNDLLRRLSRSQNTVFCGRILLFLAKFFPFSERSGLNIISEFNV--ENITEFGGDEE 180
L++C +LL+R N CGRILLFLA FP S+ SGLN E N+ E +F D
Sbjct: 186 LKICIELLKR---ETNPDSCGRILLFLAYVFPLSDPSGLNTKGEHNIHPEEALDFQNDIM 242
Query: 181 MDVSSN 186
+V+ N
Sbjct: 243 NNVNGN 248
Score = 85 (35.0 bits), Expect = 2.0e-38, Sum P(3) = 2.0e-38
Identities = 24/103 (23%), Positives = 39/103 (37%)
Query: 162 NIISEFNVENITEFGGDEEMDVSSNXXXXXXXXXXXX---XXIDFNFYKKFWSLQDYFRN 218
+I++ N N E+ +S +D NFY++FW LQ F+N
Sbjct: 240 DIMNNVNGNNNNNVNNSEDTTTTSTAATATVITTTNGNNDTTVDRNFYRQFWGLQTVFQN 299
Query: 219 PVQCY----------------NKVSWKMFTSYAETVLAAFKSY 245
P Q NK+ W+ F E V+ +F ++
Sbjct: 300 PQQVLLNTTVTTGGTITNITLNKIKWESFIQSLELVIGSFSTH 342
Score = 59 (25.8 bits), Expect = 3.4e-20, Sum P(3) = 3.4e-20
Identities = 17/59 (28%), Positives = 31/59 (52%)
Query: 424 DPAAAVEEQYKKVNDSN-YAWRALRLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKE 481
D + ++E + + D+ Y W+ LRL+SRK F N P K + I++ +K + +
Sbjct: 594 DDLSDLDEPKELLRDNPVYIWKTLRLISRKRLELFKN--P---KFDDIIQSFIKPTITQ 647
Score = 47 (21.6 bits), Expect = 2.4e-14, Sum P(2) = 2.4e-14
Identities = 11/31 (35%), Positives = 16/31 (51%)
Query: 228 WKMFTSYAETVLAAFKSYKLDDV-QSSLNPS 257
WK + L FK+ K DD+ QS + P+
Sbjct: 614 WKTLRLISRKRLELFKNPKFDDIIQSFIKPT 644
Score = 45 (20.9 bits), Expect = 9.2e-19, Sum P(3) = 9.2e-19
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 449 LSRKCPHFFLNATPNVEKNSEFIENMVKR 477
L+ KC N PN E S + +++KR
Sbjct: 418 LTNKCFKILSNTNPNGEYFSNCLSSILKR 446
>RGD|1308657 [details] [associations]
symbol:Thoc1 "THO complex 1" species:10116 "Rattus norvegicus"
[GO:0003677 "DNA binding" evidence=IEA] [GO:0003723 "RNA binding"
evidence=IEA] [GO:0005737 "cytoplasm" evidence=IEA] [GO:0006351
"transcription, DNA-dependent" evidence=IEA] [GO:0006355
"regulation of transcription, DNA-dependent" evidence=IEA]
[GO:0006397 "mRNA processing" evidence=IEA] [GO:0006915 "apoptotic
process" evidence=IEA] [GO:0007165 "signal transduction"
evidence=IEA] [GO:0008380 "RNA splicing" evidence=IEA] [GO:0016363
"nuclear matrix" evidence=IEA] [GO:0016607 "nuclear speck"
evidence=IEA] [GO:0051028 "mRNA transport" evidence=IEA]
InterPro:IPR000488 Pfam:PF00531 PROSITE:PS50017 SMART:SM00005
RGD:1308657 GO:GO:0005737 GO:GO:0006915 GO:GO:0007165 GO:GO:0006355
GO:GO:0008380 GO:GO:0003677 GO:GO:0016607 GO:GO:0006397
GO:GO:0006351 GO:GO:0003723 Gene3D:1.10.533.10 InterPro:IPR011029
SUPFAM:SSF47986 GO:GO:0016363 GO:GO:0051028 InterPro:IPR021861
Pfam:PF11957 eggNOG:NOG79897 GeneTree:ENSGT00390000016232
EMBL:AY325254 IPI:IPI00382382 UniGene:Rn.127881
ProteinModelPortal:P59924 SMR:P59924 PRIDE:P59924
Ensembl:ENSRNOT00000045976 UCSC:RGD:1308657 HOGENOM:HOG000202278
HOVERGEN:HBG079252 InParanoid:P59924 Genevestigator:P59924
GermOnline:ENSRNOG00000032739 InterPro:IPR013544 Pfam:PF08333
Uniprot:P59924
Length = 343
Score = 250 (93.1 bits), Expect = 4.7e-20, P = 4.7e-20
Identities = 51/113 (45%), Positives = 69/113 (61%)
Query: 388 LTKLWN-SKDNLEACKSAERDFTPSLESYFEEAIQQMDPAAAVEEQYKKVNDSNYAWRAL 446
LT+LWN DN+EACK R++ P LE +FEEAI+Q D VE +YK +N+SNY W L
Sbjct: 116 LTRLWNLCPDNMEACKLETREYMPILEEFFEEAIEQADAENMVESEYKAINNSNYGWSTL 175
Query: 447 RLLSRKCPHFFLNATPNVEKNSEFIENMVKRCVKEKP--SSQISGNGNGVDQD 497
R L+ + PHFF + +E++ENMV + KE P S +I G D+D
Sbjct: 176 RFLAWRSPHFFQPTNQQFKNMTEYLENMVIKLAKELPPHSEEIK---TGEDED 225
>POMBASE|SPCP25A2.03 [details] [associations]
symbol:SPCP25A2.03 "THO complex subunit (predicted)"
species:4896 "Schizosaccharomyces pombe" [GO:0000347 "THO complex"
evidence=ISS] [GO:0003674 "molecular_function" evidence=ND]
[GO:0005634 "nucleus" evidence=IDA] [GO:0006368 "transcription
elongation from RNA polymerase II promoter" evidence=ISS]
[GO:0006406 "mRNA export from nucleus" evidence=IC]
PomBase:SPCP25A2.03 EMBL:CU329672 GO:GO:0006368 GO:GO:0006406
InterPro:IPR021861 Pfam:PF11957 GO:GO:0000347 eggNOG:NOG275387
KO:K12878 PIR:T50450 RefSeq:NP_588092.1 STRING:Q9URT2
EnsemblFungi:SPCP25A2.03.1 GeneID:2539396 KEGG:spo:SPCP25A2.03
OMA:REANWIR OrthoDB:EOG4X3M8G NextBio:20800560 Uniprot:Q9URT2
Length = 752
Score = 262 (97.3 bits), Expect = 2.2e-19, P = 2.2e-19
Identities = 94/333 (28%), Positives = 145/333 (43%)
Query: 28 CDYDKKSALEQSIRQYLLKLIKTPDIDIKVIENYISLCVELCMKDMCNSTLPIILLSDTF 87
C Y+ E + + L L +D+ VI N I+ + C+ LP ++L +
Sbjct: 60 CCYETARKSEIGLEERLKCLFAI--LDLLVIGNEIN-------ESFCDHLLPFLILEELM 110
Query: 88 DMSTLDKCEQLFYYVEVNVNIWKQQTFFMSCKNNLLRMCNDLLRRLSRSQNTVFCGRILL 147
D+ T+++C +L+ Y E ++ K LLR+ N+LLRRLSR +N+ FCGRI +
Sbjct: 111 DIHTVNECAKLYEYFETRPSLMKGIVSNRGRGPVLLRISNELLRRLSRQENSSFCGRIDI 170
Query: 148 FLAKFFPFSERSGLNIISEFNVENITEFGGDEEMDVSSNXXXXXXXXXXXXXXIDFNFYK 207
L+K FP ERSG N+ ++N + FG E S+ F Y
Sbjct: 171 LLSKAFPPEERSGANLRGDYNT--VHSFGKVELSPPSTPISDRTDLSYHKKLNTLFTAY- 227
Query: 208 KFWSLQDYFRNPVQCYNKVSWKMFTSYAETVLAAF-----------KSYKLDDVQSSLNP 256
W LQ NP + + F A + + AF KS D SS
Sbjct: 228 --WDLQCMCSNPPKLLASDTLPKFIDAAGSAIQAFESILQNTFFNGKSNPTIDPNSSSLL 285
Query: 257 SGDYF-------AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELK 309
S Y +KY+ ++ L + QLSD +FR +LQ +I+F + K R E L
Sbjct: 286 SEKYITLDKGFPSKYIYSRSLFEYQLSDEDFRLQAILQLIIIFDFLLDHSKERIERRTL- 344
Query: 310 SDQEKWVKDTTETVYSLIKQTPPDGEHFSQVVK 342
EKW T + V ++ + D +++ K
Sbjct: 345 ---EKW---TNKAVIPIVILSDEDTSKLNELSK 371
Score = 144 (55.7 bits), Expect = 2.1e-06, P = 2.1e-06
Identities = 56/216 (25%), Positives = 90/216 (41%)
Query: 262 AKYLTNQKLLDLQLSDTNFRRYVLLQFLILFQYFTSTVKSRGEGLELKSDQEKWV----- 316
+KY+ ++ L + QLSD +FR +LQ +I+F + K R E L+ K V
Sbjct: 298 SKYIYSRSLFEYQLSDEDFRLQAILQLIIIFDFLLDHSKERIERRTLEKWTNKAVIPIVI 357
Query: 317 ---KDTTET------VYSLIKQTPPDGEHFSQVVKLILKGEEHWNQWKNEGCPELKRPLT 367
+DT++ YS + T G + +K I+ E +W WK GCP L++PL
Sbjct: 358 LSDEDTSKLNELSKEAYSFL-HTARCGS-VQRTIKEIIHIEGNWKLWKGLGCPSLEKPLV 415
Query: 368 S----------IXXXXXXXXXXXXXXXXXXLTKLWNS--KDNLEACKSAERDFTPSLESY 415
+ L++LW ++ L+ K ER PS ES+
Sbjct: 416 DKAAIDEAVEGLKKLTNTPVKLRFAMGNAALSRLWEQAGENTLDDLKKEERYRIPSPESF 475
Query: 416 FEEA-IQQMDPAAAVEEQYKKVNDSNYA---WRALR 447
+ + AV + K ++ + A WRA R
Sbjct: 476 LSGVKADKFEIEEAVRDDDKHFHEQSLATKTWRAFR 511
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.319 0.134 0.401 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 555 504 0.00085 119 3 11 22 0.39 34
35 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 16
No. of states in DFA: 624 (66 KB)
Total size of DFA: 337 KB (2168 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:01
No. of threads or processors used: 24
Search cpu time: 41.62u 0.09s 41.71t Elapsed: 00:00:09
Total cpu time: 41.62u 0.09s 41.71t Elapsed: 00:00:10
Start: Thu Aug 15 11:27:30 2013 End: Thu Aug 15 11:27:40 2013