BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003705
(801 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q84JU6|HOS1_ARATH E3 ubiquitin-protein ligase HOS1 OS=Arabidopsis thaliana GN=HOS1
PE=1 SV=1
Length = 927
Score = 749 bits (1935), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/793 (50%), Positives = 528/793 (66%), Gaps = 53/793 (6%)
Query: 1 MDEAAVSSDPVVAFLLDEVVVKDWCKRAFKNIIAELKLIYNLEVEVMKTRLSLLLKFQMK 60
MDE AVSSDPV+AFLLDEVVVKDW KR F++ +AEL+ IYNLE + M+ L LL+ +
Sbjct: 167 MDETAVSSDPVIAFLLDEVVVKDWVKRTFRSTLAELQEIYNLETKEMQAWLDKLLRCSKQ 226
Query: 61 LRDISSVIEVLASSFKDDLSAQVHDLHHFQESILKTKQHLEIMMWCAKQQFLENVRSRHA 120
+ I SV+EV+ S+FK +S Q+ D+ +E+I KTKQHL+IM+WC + FL++VRSR++
Sbjct: 227 VAGICSVLEVMESAFKGSVSPQLQDVQTLRENIGKTKQHLDIMVWCIRHGFLDDVRSRYS 286
Query: 121 SFTSWHSLVRQRKSAATERAWYDPVNYSAESTKQDGSLFIEDALANLEIEQEFTQGRGEE 180
+FTSW++LV +RKS A +RAW D V+ S++ + Q SLFIEDAL NLE E E++Q G +
Sbjct: 287 NFTSWNALVGERKSNAVKRAWPDAVDQSSDCSVQSASLFIEDALENLEREPEYSQEIGAD 346
Query: 181 LDITSLHKDDEGSSFVRSKIEGVSGCYPFENLRAAVDILFLHGSSDLVLAKQAIFLYYLF 240
L++ L KD SF+RSKIEG SG YPFENLR A D+LFLHG SDLV+AKQAIFLYYLF
Sbjct: 347 LEVGRLQKDKR--SFLRSKIEGTSGSYPFENLRTAADMLFLHGGSDLVVAKQAIFLYYLF 404
Query: 241 DRHWTMPDENWRHIVDDFAATFSITRHSLLESLTFYLLDDQADEALQEACHLLPEISGPT 300
DRHWT P++ W+H +DDFAATF ITRHSLLES FYLLDD ++EALQEAC +LPEI GP
Sbjct: 405 DRHWTTPEKYWKHTIDDFAATFGITRHSLLESFVFYLLDDHSEEALQEACRILPEICGPE 464
Query: 301 THPKIAQVLLERENPEAALMVLRWSGRDGGSLLVSLSEAVTAVRIRVECALLTEAFTYQR 360
T+PK+AQVLLER+NPE ALMVLRWSGRDG S LVS+ EAVTA+R+RVEC LL+EAFTYQR
Sbjct: 465 TYPKVAQVLLERDNPETALMVLRWSGRDGVSELVSIGEAVTALRVRVECGLLSEAFTYQR 524
Query: 361 MLCTKVREKKLKFGTIGETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWN 420
LC KV+E LK G + DDL +W +W+E+LV E CCL IRRNLVDR+IELPWN
Sbjct: 525 TLCLKVKENNLKNGAVKHASDDLD--IWSWTEWMEILVNEFCCLSIRRNLVDRIIELPWN 582
Query: 421 TDEEKYLHKCLLDSATDDPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQSVEQDFISKNPV 480
DEEKYLH+CLLDSATDDPS+ VGSLLVVFY+QRYRY +AYQV+L+LQ +E+ F+S N +
Sbjct: 583 PDEEKYLHRCLLDSATDDPSSAVGSLLVVFYLQRYRYIQAYQVDLRLQKIEEAFVSDNQI 642
Query: 481 SEEVLSRMQSQIHWRTKFIDTSIELLPEVQRQLLKNGKLPLNALNSSEEVEIPEKSDLHG 540
EEV+ RM+SQ HWR + +D +I++LP +Q+Q +++G+ + E +KSDL
Sbjct: 643 GEEVMFRMRSQSHWRKELVDRAIDILPVIQQQQVRSGQFS----EMEDASEGAKKSDLPD 698
Query: 541 SQELKSITLLIPTTADSSLLLPTSNLTPANSSVFE--SPTGPGRSIKSPHFEVGHYGPSI 598
+ ++ IT +P +S+ L ++N A V SP PG I G+ +
Sbjct: 699 APDM--ITSSVPFATTNSVFLQSANNARAREPVANNGSPFQPGHMI-------GNASHDL 749
Query: 599 LHERLFMN-KEGSTYDF-GVSKEFKVDGFSTPGVCQSSPMNQTPLKGRNFSSRTLSNSHR 656
H RLF N G + V+K K S TP K N R NS
Sbjct: 750 SHGRLFTNANRGQKSEVRSVTKNLKFGEMS------------TPFKDLN---RARGNSQL 794
Query: 657 RDKVSDKISPVPEQNGFLSQHLNTIHHYSHRMTTNPASTPVSNRGLHNDLAGDLH---SN 713
+ K +++ SP + ++ ++++ Y R+T N PV+ + N L G S
Sbjct: 795 QGKRTEESSPEVNVDRYIENNMSS--PYLRRITAN---NPVTVKSSSNHLNGSSQKPEST 849
Query: 714 LSSKRVHSDREDGLRYMISSEDPMDVSLS---NGKKGFAVEDRQAIAGGGLRWRSDETSD 770
R+ D+++ + +DPMD+S S N A E R GGLRWRSDETSD
Sbjct: 850 FFGTRMQPDKDN----FVDLDDPMDMSSSLKDNNNNVLATESRN--NSGGLRWRSDETSD 903
Query: 771 EEEKQSPESAMGV 783
+E++ + +M V
Sbjct: 904 DEDELTSFGSMPV 916
>sp|Q8WYP5|ELYS_HUMAN Protein ELYS OS=Homo sapiens GN=AHCTF1 PE=1 SV=3
Length = 2266
Score = 62.0 bits (149), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 93/416 (22%), Positives = 175/416 (42%), Gaps = 70/416 (16%)
Query: 201 EGVSGCYPFENLRAAVDILFLHGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAA 260
EG +G YP +L A +D+ L G ++ AK +I +Y L D ++ P++ I + F
Sbjct: 744 EGGTGKYPPASLHAVLDMYLLDGVTEA--AKHSITIYLLLDIMYSFPNKTDTPI-ESFPT 800
Query: 261 TFSITRHSLLESLTFYLLD-DQADEALQEACHLLPEISGPTT--HPKIAQVLLERENPEA 317
F+I+ + F+L+D + + L H P + P + H KI Q + +
Sbjct: 801 VFAISWGQVKLIQGFWLIDHNDYESGLDLLFH--PATAKPLSWQHSKIIQAFMSQGEHRQ 858
Query: 318 AL---MVLRWSGRDGGSLLVSLSEAVTAVRIRVECALLTEAFTYQRMLCTKVREKKLKFG 374
AL ++ + G +++ L+ V + C + EA+ + R C ++
Sbjct: 859 ALRYIQTMKPTVSSGNDVILHLT-----VLLFNRC--MVEAWNFLRQHCNRLN------- 904
Query: 375 TIGETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWNTDEEKYLHKCLLDS 434
+E L+ + +C L++ +++LP+ E++ L K L S
Sbjct: 905 -------------------IEELLKHMYEVCQEMGLMEDLLKLPFTDTEQECLVKFLQSS 945
Query: 435 ATDDPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQ-SVEQDFISKNPVSEEVLSRMQSQIH 493
A S L+V ++QR Y A ++N L+ +V D ++P R++ +
Sbjct: 946 A----SVQNHEFLLVHHLQRANYVPALKLNQTLKINVMND---RDP-------RLRERSL 991
Query: 494 WRTKFIDTSIELLPEVQRQLLKNGKLPLNALNSSEEVEIPEKSDLHG-SQELKSITLLIP 552
R +D ++LP V R+L P + SS + L +++ + T+L
Sbjct: 992 ARNSILDQYGKILPRVHRKLAIERAKPYHLSTSSVFRLVSRPKPLSAVPKQVVTGTVLTR 1051
Query: 553 TTADSSLLL----------PTSNLTPANSSVFESPTGPGRSIKSPHFEVGHYGPSI 598
+ +++L P ++ TP NSS E P+ S+ +P +G I
Sbjct: 1052 SVFINNVLSKIGEVWASKEPINSTTPFNSSKIEEPSPIVYSLPAPELPEAFFGTPI 1107
>sp|Q5U249|ELYS_XENLA Protein ELYS OS=Xenopus laevis GN=ahctf1 PE=1 SV=1
Length = 2408
Score = 60.8 bits (146), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 75/324 (23%), Positives = 139/324 (42%), Gaps = 62/324 (19%)
Query: 198 SKIEGVSGCYPFENLRAAVDILFLHGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDD 257
S+ + +G YP NL A +D+ L + ++ +K AI +Y+L D ++ PD+ I +
Sbjct: 739 SRDDNGTGKYPPANLHALLDVYLLENADEM--SKHAITIYFLLDIMYSFPDKPDSSI-ES 795
Query: 258 FAATFSITRHSLLESLTFYLLDDQADEALQEACHLLPEISGPTT--HPKIAQVLLERENP 315
F F + SL++ + + L D D C L P S + H +I + LL +
Sbjct: 796 FPTAFFVP-GSLIKLIQGFWLLDHNDYQNSVDCILNPASSRVMSWQHSQIIENLLCHGDS 854
Query: 316 EAALMVLRWSGRDGGSLLVSLSEAVTAVRIRVECALLT------EAFTYQRMLCTKVREK 369
AL L+ V A T+ +++ +L EA+ QR+ +++
Sbjct: 855 RQALRYLQ----------VMKPVATTSKEVKLHMTVLLANRSILEAWNLQRLHSSRLN-- 902
Query: 370 KLKFGTIGETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWNTDEEKYLHK 429
+E L+ + +C L++ +++L + E+ YLHK
Sbjct: 903 ------------------------VEELLKHMYEMCQEMGLIEELLKLTFTDFEQGYLHK 938
Query: 430 CLLDSATDDPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQSVEQDFISKNPVSEEVLSRMQ 489
L + + LL+V ++QR Y A Q+N QS++ + ++ + R++
Sbjct: 939 FLQTTGVQN-----QELLLVHHLQRANYISALQLN---QSLKTNHLN------DCDRRLR 984
Query: 490 SQIHWRTKFIDTSIELLPEVQRQL 513
+ R +D ++LP VQR L
Sbjct: 985 ERSGARNAILDQYGKILPRVQRTL 1008
>sp|Q8CJF7|ELYS_MOUSE Protein ELYS OS=Mus musculus GN=Ahctf1 PE=1 SV=1
Length = 2243
Score = 52.0 bits (123), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 71/316 (22%), Positives = 131/316 (41%), Gaps = 51/316 (16%)
Query: 201 EGVSGCYPFENLRAAVDILFLHGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAA 260
EG +G YP ++ A +DI L ++ +K AI +Y L D ++ P++ I + F
Sbjct: 744 EGGTGRYPPASIHALLDIYLLDNITEA--SKHAITIYLLLDIMYSFPNKTDTPI-ESFPT 800
Query: 261 TFSITRHSLLESLTFYLLD-DQADEALQEACHLLPEISGPTT--HPKIAQVLLERENPEA 317
F+I+ + F+LLD + + L H P + P + H KI + + + +
Sbjct: 801 AFAISWGQVKLVQGFWLLDHNDYENGLDLLFH--PVTAKPASWQHSKIIEAFMSQGEHKQ 858
Query: 318 ALMVLRWSGRDGGSLLVSLSEAVTAVRIRVECALLTEAFTYQRMLCTKVREKKLKFGTIG 377
AL L+ + S +E + + + + + EA+ R +V
Sbjct: 859 ALRYLQTMK----PTVSSSNEVILHLTVLLFNRCMVEAWNLLRQNSNRVN---------- 904
Query: 378 ETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWNTDEEKYLHKCLLDSATD 437
+E L+ +C L++ +++LP+ E++ L K L S
Sbjct: 905 ----------------IEELLKHAYEVCQEMGLMEDLLKLPFTNTEQECLVKFLQSST-- 946
Query: 438 DPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQSVEQDFISKNPVSEEVLSRMQSQIHWRTK 497
S L+V ++QR Y A ++N I KN + + R++ + R
Sbjct: 947 --SVENHEFLLVHHLQRANYISALKLNQ---------ILKNNLMSDRDPRLRERSVTRNS 995
Query: 498 FIDTSIELLPEVQRQL 513
+D ++LP VQR+L
Sbjct: 996 ILDQYGKILPRVQRKL 1011
>sp|P0C734|BTRF1_EBVG Uncharacterized protein BTRF1 OS=Epstein-Barr virus (strain GD1)
GN=BTRF1 PE=1 SV=1
Length = 404
Score = 35.0 bits (79), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 10/153 (6%)
Query: 222 HGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAATFSITRHSLLESLT--FYLLD 279
H S LV + L +L+ +H + N +IV DF A F+ ++ E+ T F L
Sbjct: 242 HNSLRLVPNPDLLPLQHLYLKHVVLKSLNLENIVQDFEAIFTSPSDTISEAETKAFEKLV 301
Query: 280 DQADEALQEACHLLPEISGPTTHPKIAQVLLERENPEAALMVLRW--SGRDGGSLLVSLS 337
+QA ++ L I +T P V+ + NP +L + ++ G+++ ++
Sbjct: 302 EQAKNTVENIVFCLNSICSTSTLP---DVVPDVNNPNISLALEKYFLMFPPSGTIMRNVR 358
Query: 338 EAVTAVRIRVECALLTEAFTYQRMLCTKVREKK 370
A VR+ + A E T + L ++ KK
Sbjct: 359 FATPIVRLLCQGA---ELGTMAQFLGKYIKVKK 388
>sp|P30119|BTRF1_EBVB9 Uncharacterized protein BTRF1 OS=Epstein-Barr virus (strain B95-8)
GN=BTRF1 PE=3 SV=2
Length = 404
Score = 35.0 bits (79), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 10/153 (6%)
Query: 222 HGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAATFSITRHSLLESLT--FYLLD 279
H S LV + L +L+ +H + N +IV DF A F+ ++ E+ T F L
Sbjct: 242 HNSLRLVPNPDLLPLQHLYLKHVVLKSLNLENIVQDFEAIFTSPSDTISEAETKAFEKLV 301
Query: 280 DQADEALQEACHLLPEISGPTTHPKIAQVLLERENPEAALMVLRW--SGRDGGSLLVSLS 337
+QA ++ L I +T P V+ + NP +L + ++ G+++ ++
Sbjct: 302 EQAKNTVENIVFCLNSICSTSTLP---DVVPDVNNPNISLALEKYFLMFPPSGTIMRNVR 358
Query: 338 EAVTAVRIRVECALLTEAFTYQRMLCTKVREKK 370
A VR+ + A E T + L ++ KK
Sbjct: 359 FATPIVRLLCQGA---ELGTMAQFLGKYIKVKK 388
>sp|Q1HVD3|BTRF1_EBVA8 Uncharacterized protein BTRF1 OS=Epstein-Barr virus (strain AG876)
GN=BTRF1 PE=3 SV=1
Length = 404
Score = 34.7 bits (78), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 10/153 (6%)
Query: 222 HGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAATFSITRHSLLESLT--FYLLD 279
H S LV + L +L+ +H + N +IV DF A F+ ++ E+ T F L
Sbjct: 242 HNSLRLVPNPDLLPLQHLYLKHVVLKSLNLENIVQDFEAIFTSPSDTISEAETKAFEKLV 301
Query: 280 DQADEALQEACHLLPEISGPTTHPKIAQVLLERENPEAALMVLRW--SGRDGGSLLVSLS 337
+QA ++ L I +T P V+ + NP +L + ++ G+++ ++
Sbjct: 302 EQAKNTVENIVFCLNSICSTSTLP---DVVPDVNNPNISLALEKYFLMFPPSGTIMRNVR 358
Query: 338 EAVTAVRIRVECALLTEAFTYQRMLCTKVREKK 370
A VR+ + A E T + L ++ KK
Sbjct: 359 FATPIVRLLCQGA---ELGTMAQFLGKYIKVKK 388
>sp|Q28295|VWF_CANFA von Willebrand factor OS=Canis familiaris GN=VWF PE=1 SV=2
Length = 2813
Score = 33.1 bits (74), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 15/36 (41%), Positives = 20/36 (55%)
Query: 278 LDDQADEALQEACHLLPEISGPTTHPKIAQVLLERE 313
L+ + DE + C L PE PT HP +AQV + E
Sbjct: 1446 LEQRRDEIINYLCDLAPEAPAPTQHPPMAQVTVGSE 1481
>sp|P38035|RTCR_ECOLI Transcriptional regulatory protein RtcR OS=Escherichia coli (strain
K12) GN=rtcR PE=4 SV=2
Length = 532
Score = 33.1 bits (74), Expect = 8.1, Method: Compositional matrix adjust.
Identities = 22/78 (28%), Positives = 32/78 (41%), Gaps = 5/78 (6%)
Query: 369 KKLKFGTIGETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWNTDEEKYLH 428
K + FG +G D G + W +W L CLC + +LV +EL +
Sbjct: 3 KTVAFGFVGTVLDYAGRGSQRWSKWRPTL-----CLCQQESLVIDRLELLHDARSRSLFE 57
Query: 429 KCLLDSATDDPSTTVGSL 446
D A+ P T V S+
Sbjct: 58 TLKRDIASVSPETEVVSV 75
>sp|P21449|MDR2_CRIGR Multidrug resistance protein 2 OS=Cricetulus griseus GN=PGY2 PE=2
SV=2
Length = 1276
Score = 33.1 bits (74), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 5/67 (7%)
Query: 583 SIKSPHFEVGHYGPSILHERLFMNKEGSTYDFG--VSKEFKVDGFSTPGVCQSSPMNQTP 640
SI F +GH P+I +F N G+ Y+ + E +D FST G S M
Sbjct: 336 SILFGTFSIGHIAPNI---EVFANARGAAYEIFKIIDNEPSIDSFSTQGHKPDSVMGNLE 392
Query: 641 LKGRNFS 647
K +FS
Sbjct: 393 FKNVHFS 399
>sp|O39822|NSP1_ROTEL Non-structural protein 1 OS=Rotavirus A (isolate Equine/United
Kingdom/L338/1988
G13-P12[18]-I6-Rx-Cx-M6-A6-Nx-Tx-Ex-Hx) PE=3 SV=1
Length = 492
Score = 33.1 bits (74), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 16/105 (15%)
Query: 431 LLDSATDDPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQSV------EQDFISKNPVSEEV 484
LLDS D + + S L Y Q+Y + + ++KL+ V +Q+ I+K +
Sbjct: 185 LLDSVNFDRMSALPSDLQALYAQKY-FKISRLPSMKLRQVYYSDFTKQNLITKYRTKTRI 243
Query: 485 LSRMQSQIHWRTKFIDTSIELLPEVQRQLLKNGKLPLNALNSSEE 529
R S+I+W DT IEL L+ N L AL ++EE
Sbjct: 244 THRNVSKINW-----DTDIEL----HNDLMHNKHRILTALTTAEE 279
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.132 0.386
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 294,728,746
Number of Sequences: 539616
Number of extensions: 12518706
Number of successful extensions: 30923
Number of sequences better than 100.0: 35
Number of HSP's better than 100.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 32
Number of HSP's that attempted gapping in prelim test: 30891
Number of HSP's gapped (non-prelim): 53
length of query: 801
length of database: 191,569,459
effective HSP length: 126
effective length of query: 675
effective length of database: 123,577,843
effective search space: 83415044025
effective search space used: 83415044025
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)