Your job contains 1 sequence.
>003896
MSRLGYKNIVYHGDVCLGELDTILVSDENFQFPNNEIRIRHISPSSERCIPLSILHTISS
FSLRCKLESSAPVEQPHLINLHASCFYEFKTAVVVIGDEEIHLVAMPSKQKKFPCFWCYS
VSSGLYNSCLGMLNLRCLAIVFDLDETLIVANTMKSFEDRIEALRSWIAREPDQIRASGM
SAELKRYMDDRTLLKQYTENDCVMDNGKVFKVQQEEVPPPSENHERIVRPVIRIPERNLV
LTRINPENRDTSVLVRLRPAWEDLRSYLIAKGRKRFEVYVCTMAERDYALEMWRLLDPEG
HLIGSKQLLDRVVCVKSGSRKSLLNVFQRGLCHPKMAMVIDDRCKVWEDKDQPRVHVVPA
FTPYYAPQAETANAVPVLCVARNVACNVRGCFFKEFDENLLRSISEVFYEDEAVNLPAAP
DVSNYLMSEDANFAPNGSTNAPMSEGLNGLEVERRLNQSDEKYVVDSGLPSMKNSSDLKS
ETSLLPVAVASNATVPATVVPSQKPGLLGAPIRRDNSSMKHGFDLRNQNSAQPPLPKLHG
QGGWIVEEEVNRVLPNNRPVSIATGLPSHASQAKGEEAIMAHDLHKQNLPPASQPPEIGV
SQNHVSSNSREFLTEGGKTNLLPSYLSIGVLQEIGKRCSSKVEFRSVVSTSKDLQFSVEV
LFTGEKIGVGMGKTRKDAQQQAAENALHYLAEKYVAYITPRSGAMDRDFDKLSLENENGF
LWDTIISESNEGLREDGLRKESTPEASEVEPGSTYASLGNQQVQKRPNVPKLSKLIPSKR
LKDEPPQT
BLASTP 2.0MP-WashU [04-May-2006] [linux26-i686-ILP32F64 2006-05-09T11:47:08]
Copyright (C) 1996-2006 Washington University, Saint Louis, Missouri USA.
All Rights Reserved.
Reference: Gish, W. (1996-2006) http://blast.wustl.edu
Query= 003896
(788 letters)
Database: go_20130330-seqdb.fasta
368,745 sequences; 169,044,731 total letters.
Searching....10....20....30....40....50....60....70....80....90....100% done
Smallest
Sum
High Probability
Sequences producing High-scoring Segment Pairs: Score P(N) N
TAIR|locus:2119053 - symbol:CPL1 "C-terminal domain phosp... 1329 1.1e-159 5
UNIPROTKB|K7EJD2 - symbol:CTDP1 "RNA polymerase II subuni... 112 0.00025 4
UNIPROTKB|Q9Y5B0 - symbol:CTDP1 "RNA polymerase II subuni... 112 0.00056 4
>TAIR|locus:2119053 [details] [associations]
symbol:CPL1 "C-terminal domain phosphatase-like 1"
species:3702 "Arabidopsis thaliana" [GO:0003725 "double-stranded
RNA binding" evidence=IEA;ISS] [GO:0005622 "intracellular"
evidence=IEA] [GO:0005634 "nucleus" evidence=ISM;IDA] [GO:0009651
"response to salt stress" evidence=IMP] [GO:0009738 "abscisic acid
mediated signaling pathway" evidence=IMP] [GO:0019204 "nucleotide
phosphatase activity" evidence=IDA] [GO:0045892 "negative
regulation of transcription, DNA-dependent" evidence=IMP]
[GO:0004647 "phosphoserine phosphatase activity" evidence=IDA]
[GO:0016791 "phosphatase activity" evidence=IDA] [GO:0009611
"response to wounding" evidence=IMP] InterPro:IPR001159
InterPro:IPR004274 Pfam:PF00035 Pfam:PF03031 PROSITE:PS50137
PROSITE:PS50969 SMART:SM00358 SMART:SM00577 InterPro:IPR014720
GO:GO:0007275 GO:GO:0005634 GO:GO:0045892 EMBL:CP002687
GenomeReviews:CT486007_GR GO:GO:0009611 GO:GO:0009738 GO:GO:0046872
Gene3D:3.40.50.1000 InterPro:IPR023214 SUPFAM:SSF56784
Gene3D:3.30.160.20 GO:GO:0006351 GO:GO:0004721 GO:GO:0003725
EMBL:AL161555 EMBL:AL035527 EMBL:AY557186 EMBL:AK221944
EMBL:AK229289 IPI:IPI00518313 PIR:T05842 RefSeq:NP_193898.3
UniGene:At.2700 UniGene:At.32611 ProteinModelPortal:Q5YDB6
SMR:Q5YDB6 IntAct:Q5YDB6 STRING:Q5YDB6 PaxDb:Q5YDB6 PRIDE:Q5YDB6
EnsemblPlants:AT4G21670.1 GeneID:828254 KEGG:ath:AT4G21670
TAIR:At4g21670 eggNOG:NOG244866 HOGENOM:HOG000078695
InParanoid:Q5YDB6 OMA:SFDGMAD PhylomeDB:Q5YDB6
ProtClustDB:CLSN2681612 Genevestigator:Q5YDB6 GO:GO:0019204
GO:GO:0004647 GO:GO:0009628 Uniprot:Q5YDB6
Length = 967
Score = 1329 (472.9 bits), Expect = 1.1e-159, Sum P(5) = 1.1e-159
Identities = 275/481 (57%), Positives = 345/481 (71%)
Query: 37 IRIRHISPSSERCIPLSILHTISSFSLRCKLESSAPVEQPHLINLHASCFYEFKTAVVVI 96
IRI H S S ERC PL+IL TISS L KLE+S Q L ++SC + KTAV+++
Sbjct: 53 IRISHFSQSGERCPPLAILTTISSCGLCFKLEASPSPAQESLSLFYSSCLRDNKTAVMLL 112
Query: 97 GDEEIHLVAMPSKQKKF--PCFWCYSVSSGLYNSCLGMLNLRCLAIVFDLDETLIVANTM 154
G EE+HLVAM S+ K PCFW +SV+ G+Y+SCL MLNLRCL IVFDLDETL+VANTM
Sbjct: 113 GGEELHLVAMYSENIKNDRPCFWAFSVAPGIYDSCLVMLNLRCLGIVFDLDETLVVANTM 172
Query: 155 KSFEDRIEALRSWIAREPDQIRASGMSAELKRYMDDRTLLKQYTENDCVMDNGKVFKVQQ 214
+SFED+I+ + I E D R + + AE+KRY DD+ LLKQY E+D V++NG+V KVQ
Sbjct: 173 RSFEDKIDGFQRRINNEMDPQRLAVIVAEMKRYQDDKNLLKQYIESDQVVENGEVIKVQS 232
Query: 215 EEVPPPSENHXXXXXXXXXXXXXNLVLTRINPENRDTSVLVRLRPAWEDLRSYLIAKGRK 274
E VP S+NH N++LTRINP RDTSVLVR+RP+WE+LRSYL AKGRK
Sbjct: 233 EIVPALSDNHQPLVRPLIRLQEKNIILTRINPMIRDTSVLVRMRPSWEELRSYLTAKGRK 292
Query: 275 RFEVYVCTMAERDYALEMWRLLDPEGHLIGSKQLLDRVVCVKSGSRKSLLNVFQRGLCHP 334
RFEVYVCTMAERDYALEMWRLLDPEG+LI + LL R+VCVKSG +KSL NVF G CHP
Sbjct: 293 RFEVYVCTMAERDYALEMWRLLDPEGNLINTNDLLARIVCVKSGFKKSLFNVFLDGTCHP 352
Query: 335 KMAMVIDDRCKVWEDKDQPRVHVVPAFTPYYAPQAETANAVPVLCVARNVACNVRGCFFK 394
KMA+VIDDR KVW++KDQPRVHVVPAF PYY+PQAE A A PVLCVARNVAC VRG FF+
Sbjct: 353 KMALVIDDRLKVWDEKDQPRVHVVPAFAPYYSPQAEAA-ATPVLCVARNVACGVRGGFFR 411
Query: 395 EFDENLLRSISEVFYEDEAVNLPAAPDVSNYLMSEDANFAPNGSTNAPMS-EGLNGLEVE 453
+FD++LL I+E+ YE++A ++P+ PDVS+YL+SED NG+ + P+S +G+ EVE
Sbjct: 412 DFDDSLLPRIAEISYENDAEDIPSPPDVSHYLVSEDDTSGLNGNKD-PLSFDGMADTEVE 470
Query: 454 RRLNQSDEKYVVDSGLPSMKNSSDLKSETSLLPVAVASNATVPATV-VPSQ--KPGLLGA 510
RRL ++ + LP+ + + P+A AS+ +VP V V Q +P +
Sbjct: 471 RRLKEAIS--ASSAVLPAANIDPRIAAPVQF-PMASASSVSVPVPVQVVQQAIQPSAMAF 527
Query: 511 P 511
P
Sbjct: 528 P 528
Score = 156 (60.0 bits), Expect = 1.1e-159, Sum P(5) = 1.1e-159
Identities = 45/152 (29%), Positives = 79/152 (51%)
Query: 569 HASQAKGEEAIMAHD-LHKQNLPPASQP---PEIGVSQNHVSSNSREFLTEGGKTNLLPS 624
H ++ +E++ + L N P S P + +Q+ ++ +FL E ++ +
Sbjct: 668 HENRRPPKESLRRDEQLRSNNNLPDSHPFYGEDASWNQSSSRNSDLDFLPE---RSVSAT 724
Query: 625 YLSIGVLQEIGKRCSSKVEFRSVVSTSKDLQFSVEVLFTGEKIGVGMGKTRKDAQQQAAE 684
S VL I +C +KVE++ + +S DL+FSVE + +KIG G+GK+R++A +AAE
Sbjct: 725 ETSADVLHGIAIKCGAKVEYKPSLVSSTDLRFSVEAWLSNQKIGEGIGKSRREALHKAAE 784
Query: 685 NALHYLAEKYVAYITPRSGAMDRDFDKLSLEN 716
++ LA+ Y+ G RD + EN
Sbjct: 785 ASIQNLADGYMR-ANGDPGPSHRDATPFTNEN 815
Score = 69 (29.3 bits), Expect = 5.9e-147, Sum P(4) = 5.9e-147
Identities = 25/91 (27%), Positives = 41/91 (45%)
Query: 627 SIGVLQEIGKRCSSKVEFRSVVSTSKDL----QFSVEVLFTGEKIGVGMGKTRKDAQQQA 682
SI L+E+ ++ F+S D+ + +V G +G G+G T +A+ QA
Sbjct: 856 SITALRELCASEGLEMAFQSQRQLPSDMVHRDELHAQVEIDGRVVGEGVGSTWDEARMQA 915
Query: 683 AENALHYLAEKYVAYITPRSGAMDRDFDKLS 713
AE AL + + R G+ R F +S
Sbjct: 916 AERALSSVRSMLGQPLHKRQGS-PRSFGGMS 945
Score = 43 (20.2 bits), Expect = 1.1e-159, Sum P(5) = 1.1e-159
Identities = 14/44 (31%), Positives = 23/44 (52%)
Query: 741 ESTPEASEVEPGSTYASLGNQQVQKRPNVPKLSKLIPSKRLKDE 784
E+ +A+E S + LG Q + KR P+ + +KRLK +
Sbjct: 910 EARMQAAERALSSVRSMLG-QPLHKRQGSPRSFGGMSNKRLKPD 952
Score = 43 (20.2 bits), Expect = 1.1e-159, Sum P(5) = 1.1e-159
Identities = 14/50 (28%), Positives = 24/50 (48%)
Query: 513 RRDNSSMKHGFDLRNQNSAQPPLPK----------LHGQGGWI-VEEEVN 551
RR ++HG D R+ ++P P+ + + GW VEEE++
Sbjct: 573 RRRLLILQHGQDTRDPAPSEPSFPQRPPVQAPPSHVQSRNGWFPVEEEMD 622
Score = 42 (19.8 bits), Expect = 1.1e-159, Sum P(5) = 1.1e-159
Identities = 8/12 (66%), Positives = 10/12 (83%)
Query: 10 VYHGDVCLGELD 21
V+HGD LGEL+
Sbjct: 9 VFHGDGRLGELE 20
>UNIPROTKB|K7EJD2 [details] [associations]
symbol:CTDP1 "RNA polymerase II subunit A C-terminal domain
phosphatase" species:9606 "Homo sapiens" [GO:0004721
"phosphoprotein phosphatase activity" evidence=IEA] [GO:0005634
"nucleus" evidence=IEA] InterPro:IPR004274 InterPro:IPR011947
Pfam:PF03031 PROSITE:PS50969 SMART:SM00577 InterPro:IPR001357
Gene3D:3.40.50.1000 InterPro:IPR023214 SUPFAM:SSF56784
SMART:SM00292 SUPFAM:SSF52113 PROSITE:PS50172 TIGRFAMs:TIGR02250
EMBL:AC021594 EMBL:AC068473 HGNC:HGNC:2498 InterPro:IPR015388
Pfam:PF09309 Ensembl:ENST00000591598 Uniprot:K7EJD2
Length = 799
Score = 112 (44.5 bits), Expect = 0.00025, Sum P(4) = 0.00025
Identities = 41/122 (33%), Positives = 58/122 (47%)
Query: 256 RLRPAWEDLRSYLIAKGRKRFEVYVCTMAERDYALEMWRLLDPEGHLIGSKQLLDRVVCV 315
RLRP +D + K K +E++V T R YA + LDPE L S ++L R C+
Sbjct: 156 RLRPHCKDF----LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLF-SHRILSRDECI 210
Query: 316 KSGSRK-SLLNVFQRGLCHPKMAMVIDDRCKVWEDKDQPRVHVVPAFTPYYAPQAETANA 374
S+ +L N+F C M +IDDR VW K P + V + Y+ + NA
Sbjct: 211 DPFSKTGNLRNLFP---CGDSMVCIIDDREDVW--KFAPNLITVKKYV-YFQGTGDM-NA 263
Query: 375 VP 376
P
Sbjct: 264 PP 265
Score = 69 (29.3 bits), Expect = 0.00025, Sum P(4) = 0.00025
Identities = 31/95 (32%), Positives = 40/95 (42%)
Query: 435 PNGSTNAPMSEGLNGLEVE-RRLNQSDEKYVVDSGLPSMKNSSDL----KSETSLLPVAV 489
P G T AP E NGLE R LN S+ DS P + D+ ++ TS +A
Sbjct: 294 PEGVTQAPGVEPSNGLEKPARELNGSEAATPRDSPRPGKPDERDIWPPAQAPTSSQELAG 353
Query: 490 A----SNATVPATVVPSQKP--GLLGAPIRRDNSS 518
A + V P Q+P G G + D SS
Sbjct: 354 APEPQGSCAQGGRVAPGQRPAQGATGTDLDFDLSS 388
Score = 40 (19.1 bits), Expect = 0.00025, Sum P(4) = 0.00025
Identities = 7/24 (29%), Positives = 16/24 (66%)
Query: 760 NQQVQKRPNVPKLSKLIPSKRLKD 783
N+++++ P++ K+ + SK L D
Sbjct: 545 NKEIEEAPDIRKIVPELKSKVLAD 568
Score = 39 (18.8 bits), Expect = 0.00025, Sum P(4) = 0.00025
Identities = 8/14 (57%), Positives = 11/14 (78%)
Query: 136 RCLAIVFDLDETLI 149
R L ++ DLD+TLI
Sbjct: 113 RKLVLMVDLDQTLI 126
>UNIPROTKB|Q9Y5B0 [details] [associations]
symbol:CTDP1 "RNA polymerase II subunit A C-terminal domain
phosphatase" species:9606 "Homo sapiens" [GO:0005737 "cytoplasm"
evidence=IEA] [GO:0007067 "mitosis" evidence=IEA] [GO:0051301 "cell
division" evidence=IEA] [GO:0008420 "CTD phosphatase activity"
evidence=IDA] [GO:0006470 "protein dephosphorylation" evidence=IDA]
[GO:0005813 "centrosome" evidence=IDA] [GO:0005819 "spindle"
evidence=IDA] [GO:0000922 "spindle pole" evidence=IDA] [GO:0051233
"spindle midzone" evidence=IDA] [GO:0030496 "midbody" evidence=IDA]
[GO:0010458 "exit from mitosis" evidence=IMP] [GO:0003899
"DNA-directed RNA polymerase activity" evidence=TAS] [GO:0005654
"nucleoplasm" evidence=TAS] [GO:0006366 "transcription from RNA
polymerase II promoter" evidence=TAS] [GO:0006368 "transcription
elongation from RNA polymerase II promoter" evidence=TAS]
[GO:0010467 "gene expression" evidence=TAS] [GO:0016032 "viral
reproduction" evidence=TAS] [GO:0050434 "positive regulation of
viral transcription" evidence=TAS] [GO:0005634 "nucleus"
evidence=IDA] [GO:0005730 "nucleolus" evidence=IDA] [GO:0015629
"actin cytoskeleton" evidence=IDA] Reactome:REACT_71
InterPro:IPR004274 InterPro:IPR011947 Pfam:PF03031 PROSITE:PS50969
SMART:SM00577 InterPro:IPR001357 GO:GO:0005737 GO:GO:0005813
Reactome:REACT_116125 GO:GO:0005654 GO:GO:0016032 GO:GO:0006470
GO:GO:0051301 GO:GO:0007067 GO:GO:0010458 GO:GO:0015629
Gene3D:3.40.50.1000 InterPro:IPR023214 SUPFAM:SSF56784
GO:GO:0051233 EMBL:CH471117 GO:GO:0006368 GO:GO:0030496
GO:GO:0000922 SMART:SM00292 SUPFAM:SSF52113 PROSITE:PS50172
Reactome:REACT_1788 GO:GO:0050434 Reactome:REACT_1892
eggNOG:COG5190 TIGRFAMs:TIGR02250 EMBL:AF081287 EMBL:AF154115
EMBL:AC021594 EMBL:AC068473 EMBL:BC015010 EMBL:BC052576
EMBL:BC063447 IPI:IPI00410256 IPI:IPI00410257 IPI:IPI00410258
IPI:IPI01008810 RefSeq:NP_001189433.1 RefSeq:NP_004706.3
RefSeq:NP_430255.2 UniGene:Hs.465490 UniGene:Hs.734021 PDB:1J2X
PDB:1ONV PDB:2K7L PDBsum:1J2X PDBsum:1ONV PDBsum:2K7L
DisProt:DP00177 ProteinModelPortal:Q9Y5B0 SMR:Q9Y5B0 DIP:DIP-41788N
IntAct:Q9Y5B0 MINT:MINT-275991 STRING:Q9Y5B0 PhosphoSite:Q9Y5B0
DMDM:46396052 PaxDb:Q9Y5B0 PRIDE:Q9Y5B0 DNASU:9150
Ensembl:ENST00000075430 Ensembl:ENST00000299543 GeneID:9150
KEGG:hsa:9150 UCSC:uc002lnh.2 UCSC:uc002lni.2 CTD:9150
GeneCards:GC18P077494 HGNC:HGNC:2498 HPA:CAB032641 HPA:HPA040394
MIM:604168 MIM:604927 neXtProt:NX_Q9Y5B0 Orphanet:48431
PharmGKB:PA27001 HOGENOM:HOG000112039 HOVERGEN:HBG051213
InParanoid:Q9Y5B0 KO:K15732 OMA:EAPDIRK OrthoDB:EOG4HMJ8T
PhylomeDB:Q9Y5B0 ChiTaRS:CTDP1 EvolutionaryTrace:Q9Y5B0
GenomeRNAi:9150 NextBio:34327 Bgee:Q9Y5B0 CleanEx:HS_CTDP1
Genevestigator:Q9Y5B0 GermOnline:ENSG00000060069 GO:GO:0008420
GO:GO:0003899 InterPro:IPR015388 Pfam:PF09309 Uniprot:Q9Y5B0
Length = 961
Score = 112 (44.5 bits), Expect = 0.00056, Sum P(4) = 0.00056
Identities = 41/122 (33%), Positives = 58/122 (47%)
Query: 256 RLRPAWEDLRSYLIAKGRKRFEVYVCTMAERDYALEMWRLLDPEGHLIGSKQLLDRVVCV 315
RLRP +D + K K +E++V T R YA + LDPE L S ++L R C+
Sbjct: 224 RLRPHCKDF----LEKIAKLYELHVFTFGSRLYAHTIAGFLDPEKKLF-SHRILSRDECI 278
Query: 316 KSGSRK-SLLNVFQRGLCHPKMAMVIDDRCKVWEDKDQPRVHVVPAFTPYYAPQAETANA 374
S+ +L N+F C M +IDDR VW K P + V + Y+ + NA
Sbjct: 279 DPFSKTGNLRNLFP---CGDSMVCIIDDREDVW--KFAPNLITVKKYV-YFQGTGDM-NA 331
Query: 375 VP 376
P
Sbjct: 332 PP 333
Score = 69 (29.3 bits), Expect = 0.00056, Sum P(4) = 0.00056
Identities = 31/95 (32%), Positives = 40/95 (42%)
Query: 435 PNGSTNAPMSEGLNGLEVE-RRLNQSDEKYVVDSGLPSMKNSSDL----KSETSLLPVAV 489
P G T AP E NGLE R LN S+ DS P + D+ ++ TS +A
Sbjct: 362 PEGVTQAPGVEPSNGLEKPARELNGSEAATPRDSPRPGKPDERDIWPPAQAPTSSQELAG 421
Query: 490 A----SNATVPATVVPSQKP--GLLGAPIRRDNSS 518
A + V P Q+P G G + D SS
Sbjct: 422 APEPQGSCAQGGRVAPGQRPAQGATGTDLDFDLSS 456
Score = 40 (19.1 bits), Expect = 0.00056, Sum P(4) = 0.00056
Identities = 7/24 (29%), Positives = 16/24 (66%)
Query: 760 NQQVQKRPNVPKLSKLIPSKRLKD 783
N+++++ P++ K+ + SK L D
Sbjct: 613 NKEIEEAPDIRKIVPELKSKVLAD 636
Score = 39 (18.8 bits), Expect = 0.00056, Sum P(4) = 0.00056
Identities = 8/14 (57%), Positives = 11/14 (78%)
Query: 136 RCLAIVFDLDETLI 149
R L ++ DLD+TLI
Sbjct: 181 RKLVLMVDLDQTLI 194
Parameters:
V=100
filter=SEG
E=0.001
ctxfactor=1.00
Query ----- As Used ----- ----- Computed ----
Frame MatID Matrix name Lambda K H Lambda K H
+0 0 BLOSUM62 0.316 0.132 0.389 same same same
Q=9,R=2 0.244 0.0300 0.180 n/a n/a n/a
Query
Frame MatID Length Eff.Length E S W T X E2 S2
+0 0 788 775 0.00093 121 3 11 22 0.42 34
37 0.45 37
Statistics:
Database: /share/blast/go-seqdb.fasta
Title: go_20130330-seqdb.fasta
Posted: 5:47:42 AM PDT Apr 1, 2013
Created: 5:47:42 AM PDT Apr 1, 2013
Format: XDF-1
# of letters in database: 169,044,731
# of sequences in database: 368,745
# of database sequences satisfying E: 3
No. of states in DFA: 622 (66 KB)
Total size of DFA: 392 KB (2192 KB)
Time to generate neighborhood: 0.00u 0.00s 0.00t Elapsed: 00:00:00
No. of threads or processors used: 24
Search cpu time: 74.83u 0.10s 74.93t Elapsed: 00:00:04
Total cpu time: 74.83u 0.10s 74.93t Elapsed: 00:00:04
Start: Fri May 10 04:04:07 2013 End: Fri May 10 04:04:11 2013