BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 003705
         (801 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q84JU6|HOS1_ARATH E3 ubiquitin-protein ligase HOS1 OS=Arabidopsis thaliana GN=HOS1
           PE=1 SV=1
          Length = 927

 Score =  749 bits (1935), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 402/793 (50%), Positives = 528/793 (66%), Gaps = 53/793 (6%)

Query: 1   MDEAAVSSDPVVAFLLDEVVVKDWCKRAFKNIIAELKLIYNLEVEVMKTRLSLLLKFQMK 60
           MDE AVSSDPV+AFLLDEVVVKDW KR F++ +AEL+ IYNLE + M+  L  LL+   +
Sbjct: 167 MDETAVSSDPVIAFLLDEVVVKDWVKRTFRSTLAELQEIYNLETKEMQAWLDKLLRCSKQ 226

Query: 61  LRDISSVIEVLASSFKDDLSAQVHDLHHFQESILKTKQHLEIMMWCAKQQFLENVRSRHA 120
           +  I SV+EV+ S+FK  +S Q+ D+   +E+I KTKQHL+IM+WC +  FL++VRSR++
Sbjct: 227 VAGICSVLEVMESAFKGSVSPQLQDVQTLRENIGKTKQHLDIMVWCIRHGFLDDVRSRYS 286

Query: 121 SFTSWHSLVRQRKSAATERAWYDPVNYSAESTKQDGSLFIEDALANLEIEQEFTQGRGEE 180
           +FTSW++LV +RKS A +RAW D V+ S++ + Q  SLFIEDAL NLE E E++Q  G +
Sbjct: 287 NFTSWNALVGERKSNAVKRAWPDAVDQSSDCSVQSASLFIEDALENLEREPEYSQEIGAD 346

Query: 181 LDITSLHKDDEGSSFVRSKIEGVSGCYPFENLRAAVDILFLHGSSDLVLAKQAIFLYYLF 240
           L++  L KD    SF+RSKIEG SG YPFENLR A D+LFLHG SDLV+AKQAIFLYYLF
Sbjct: 347 LEVGRLQKDKR--SFLRSKIEGTSGSYPFENLRTAADMLFLHGGSDLVVAKQAIFLYYLF 404

Query: 241 DRHWTMPDENWRHIVDDFAATFSITRHSLLESLTFYLLDDQADEALQEACHLLPEISGPT 300
           DRHWT P++ W+H +DDFAATF ITRHSLLES  FYLLDD ++EALQEAC +LPEI GP 
Sbjct: 405 DRHWTTPEKYWKHTIDDFAATFGITRHSLLESFVFYLLDDHSEEALQEACRILPEICGPE 464

Query: 301 THPKIAQVLLERENPEAALMVLRWSGRDGGSLLVSLSEAVTAVRIRVECALLTEAFTYQR 360
           T+PK+AQVLLER+NPE ALMVLRWSGRDG S LVS+ EAVTA+R+RVEC LL+EAFTYQR
Sbjct: 465 TYPKVAQVLLERDNPETALMVLRWSGRDGVSELVSIGEAVTALRVRVECGLLSEAFTYQR 524

Query: 361 MLCTKVREKKLKFGTIGETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWN 420
            LC KV+E  LK G +    DDL     +W +W+E+LV E CCL IRRNLVDR+IELPWN
Sbjct: 525 TLCLKVKENNLKNGAVKHASDDLD--IWSWTEWMEILVNEFCCLSIRRNLVDRIIELPWN 582

Query: 421 TDEEKYLHKCLLDSATDDPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQSVEQDFISKNPV 480
            DEEKYLH+CLLDSATDDPS+ VGSLLVVFY+QRYRY +AYQV+L+LQ +E+ F+S N +
Sbjct: 583 PDEEKYLHRCLLDSATDDPSSAVGSLLVVFYLQRYRYIQAYQVDLRLQKIEEAFVSDNQI 642

Query: 481 SEEVLSRMQSQIHWRTKFIDTSIELLPEVQRQLLKNGKLPLNALNSSEEVEIPEKSDLHG 540
            EEV+ RM+SQ HWR + +D +I++LP +Q+Q +++G+         +  E  +KSDL  
Sbjct: 643 GEEVMFRMRSQSHWRKELVDRAIDILPVIQQQQVRSGQFS----EMEDASEGAKKSDLPD 698

Query: 541 SQELKSITLLIPTTADSSLLLPTSNLTPANSSVFE--SPTGPGRSIKSPHFEVGHYGPSI 598
           + ++  IT  +P    +S+ L ++N   A   V    SP  PG  I       G+    +
Sbjct: 699 APDM--ITSSVPFATTNSVFLQSANNARAREPVANNGSPFQPGHMI-------GNASHDL 749

Query: 599 LHERLFMN-KEGSTYDF-GVSKEFKVDGFSTPGVCQSSPMNQTPLKGRNFSSRTLSNSHR 656
            H RLF N   G   +   V+K  K    S            TP K  N   R   NS  
Sbjct: 750 SHGRLFTNANRGQKSEVRSVTKNLKFGEMS------------TPFKDLN---RARGNSQL 794

Query: 657 RDKVSDKISPVPEQNGFLSQHLNTIHHYSHRMTTNPASTPVSNRGLHNDLAGDLH---SN 713
           + K +++ SP    + ++  ++++   Y  R+T N    PV+ +   N L G      S 
Sbjct: 795 QGKRTEESSPEVNVDRYIENNMSS--PYLRRITAN---NPVTVKSSSNHLNGSSQKPEST 849

Query: 714 LSSKRVHSDREDGLRYMISSEDPMDVSLS---NGKKGFAVEDRQAIAGGGLRWRSDETSD 770
               R+  D+++     +  +DPMD+S S   N     A E R     GGLRWRSDETSD
Sbjct: 850 FFGTRMQPDKDN----FVDLDDPMDMSSSLKDNNNNVLATESRN--NSGGLRWRSDETSD 903

Query: 771 EEEKQSPESAMGV 783
           +E++ +   +M V
Sbjct: 904 DEDELTSFGSMPV 916


>sp|Q8WYP5|ELYS_HUMAN Protein ELYS OS=Homo sapiens GN=AHCTF1 PE=1 SV=3
          Length = 2266

 Score = 62.0 bits (149), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 93/416 (22%), Positives = 175/416 (42%), Gaps = 70/416 (16%)

Query: 201  EGVSGCYPFENLRAAVDILFLHGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAA 260
            EG +G YP  +L A +D+  L G ++   AK +I +Y L D  ++ P++    I + F  
Sbjct: 744  EGGTGKYPPASLHAVLDMYLLDGVTEA--AKHSITIYLLLDIMYSFPNKTDTPI-ESFPT 800

Query: 261  TFSITRHSLLESLTFYLLD-DQADEALQEACHLLPEISGPTT--HPKIAQVLLERENPEA 317
             F+I+   +     F+L+D +  +  L    H  P  + P +  H KI Q  + +     
Sbjct: 801  VFAISWGQVKLIQGFWLIDHNDYESGLDLLFH--PATAKPLSWQHSKIIQAFMSQGEHRQ 858

Query: 318  AL---MVLRWSGRDGGSLLVSLSEAVTAVRIRVECALLTEAFTYQRMLCTKVREKKLKFG 374
            AL     ++ +   G  +++ L+     V +   C  + EA+ + R  C ++        
Sbjct: 859  ALRYIQTMKPTVSSGNDVILHLT-----VLLFNRC--MVEAWNFLRQHCNRLN------- 904

Query: 375  TIGETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWNTDEEKYLHKCLLDS 434
                               +E L+  +  +C    L++ +++LP+   E++ L K L  S
Sbjct: 905  -------------------IEELLKHMYEVCQEMGLMEDLLKLPFTDTEQECLVKFLQSS 945

Query: 435  ATDDPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQ-SVEQDFISKNPVSEEVLSRMQSQIH 493
            A    S      L+V ++QR  Y  A ++N  L+ +V  D   ++P       R++ +  
Sbjct: 946  A----SVQNHEFLLVHHLQRANYVPALKLNQTLKINVMND---RDP-------RLRERSL 991

Query: 494  WRTKFIDTSIELLPEVQRQLLKNGKLPLNALNSSEEVEIPEKSDLHG-SQELKSITLLIP 552
             R   +D   ++LP V R+L      P +   SS    +     L    +++ + T+L  
Sbjct: 992  ARNSILDQYGKILPRVHRKLAIERAKPYHLSTSSVFRLVSRPKPLSAVPKQVVTGTVLTR 1051

Query: 553  TTADSSLLL----------PTSNLTPANSSVFESPTGPGRSIKSPHFEVGHYGPSI 598
            +   +++L           P ++ TP NSS  E P+    S+ +P      +G  I
Sbjct: 1052 SVFINNVLSKIGEVWASKEPINSTTPFNSSKIEEPSPIVYSLPAPELPEAFFGTPI 1107


>sp|Q5U249|ELYS_XENLA Protein ELYS OS=Xenopus laevis GN=ahctf1 PE=1 SV=1
          Length = 2408

 Score = 60.8 bits (146), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 75/324 (23%), Positives = 139/324 (42%), Gaps = 62/324 (19%)

Query: 198  SKIEGVSGCYPFENLRAAVDILFLHGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDD 257
            S+ +  +G YP  NL A +D+  L  + ++  +K AI +Y+L D  ++ PD+    I + 
Sbjct: 739  SRDDNGTGKYPPANLHALLDVYLLENADEM--SKHAITIYFLLDIMYSFPDKPDSSI-ES 795

Query: 258  FAATFSITRHSLLESLTFYLLDDQADEALQEACHLLPEISGPTT--HPKIAQVLLERENP 315
            F   F +   SL++ +  + L D  D      C L P  S   +  H +I + LL   + 
Sbjct: 796  FPTAFFVP-GSLIKLIQGFWLLDHNDYQNSVDCILNPASSRVMSWQHSQIIENLLCHGDS 854

Query: 316  EAALMVLRWSGRDGGSLLVSLSEAVTAVRIRVECALLT------EAFTYQRMLCTKVREK 369
              AL  L+          V    A T+  +++   +L       EA+  QR+  +++   
Sbjct: 855  RQALRYLQ----------VMKPVATTSKEVKLHMTVLLANRSILEAWNLQRLHSSRLN-- 902

Query: 370  KLKFGTIGETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWNTDEEKYLHK 429
                                    +E L+  +  +C    L++ +++L +   E+ YLHK
Sbjct: 903  ------------------------VEELLKHMYEMCQEMGLIEELLKLTFTDFEQGYLHK 938

Query: 430  CLLDSATDDPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQSVEQDFISKNPVSEEVLSRMQ 489
             L  +   +       LL+V ++QR  Y  A Q+N   QS++ + ++      +   R++
Sbjct: 939  FLQTTGVQN-----QELLLVHHLQRANYISALQLN---QSLKTNHLN------DCDRRLR 984

Query: 490  SQIHWRTKFIDTSIELLPEVQRQL 513
             +   R   +D   ++LP VQR L
Sbjct: 985  ERSGARNAILDQYGKILPRVQRTL 1008


>sp|Q8CJF7|ELYS_MOUSE Protein ELYS OS=Mus musculus GN=Ahctf1 PE=1 SV=1
          Length = 2243

 Score = 52.0 bits (123), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/316 (22%), Positives = 131/316 (41%), Gaps = 51/316 (16%)

Query: 201  EGVSGCYPFENLRAAVDILFLHGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAA 260
            EG +G YP  ++ A +DI  L   ++   +K AI +Y L D  ++ P++    I + F  
Sbjct: 744  EGGTGRYPPASIHALLDIYLLDNITEA--SKHAITIYLLLDIMYSFPNKTDTPI-ESFPT 800

Query: 261  TFSITRHSLLESLTFYLLD-DQADEALQEACHLLPEISGPTT--HPKIAQVLLERENPEA 317
             F+I+   +     F+LLD +  +  L    H  P  + P +  H KI +  + +   + 
Sbjct: 801  AFAISWGQVKLVQGFWLLDHNDYENGLDLLFH--PVTAKPASWQHSKIIEAFMSQGEHKQ 858

Query: 318  ALMVLRWSGRDGGSLLVSLSEAVTAVRIRVECALLTEAFTYQRMLCTKVREKKLKFGTIG 377
            AL  L+         + S +E +  + + +    + EA+   R    +V           
Sbjct: 859  ALRYLQTMK----PTVSSSNEVILHLTVLLFNRCMVEAWNLLRQNSNRVN---------- 904

Query: 378  ETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWNTDEEKYLHKCLLDSATD 437
                            +E L+     +C    L++ +++LP+   E++ L K L  S   
Sbjct: 905  ----------------IEELLKHAYEVCQEMGLMEDLLKLPFTNTEQECLVKFLQSST-- 946

Query: 438  DPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQSVEQDFISKNPVSEEVLSRMQSQIHWRTK 497
              S      L+V ++QR  Y  A ++N          I KN +  +   R++ +   R  
Sbjct: 947  --SVENHEFLLVHHLQRANYISALKLNQ---------ILKNNLMSDRDPRLRERSVTRNS 995

Query: 498  FIDTSIELLPEVQRQL 513
             +D   ++LP VQR+L
Sbjct: 996  ILDQYGKILPRVQRKL 1011


>sp|P0C734|BTRF1_EBVG Uncharacterized protein BTRF1 OS=Epstein-Barr virus (strain GD1)
           GN=BTRF1 PE=1 SV=1
          Length = 404

 Score = 35.0 bits (79), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 10/153 (6%)

Query: 222 HGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAATFSITRHSLLESLT--FYLLD 279
           H S  LV     + L +L+ +H  +   N  +IV DF A F+    ++ E+ T  F  L 
Sbjct: 242 HNSLRLVPNPDLLPLQHLYLKHVVLKSLNLENIVQDFEAIFTSPSDTISEAETKAFEKLV 301

Query: 280 DQADEALQEACHLLPEISGPTTHPKIAQVLLERENPEAALMVLRW--SGRDGGSLLVSLS 337
           +QA   ++     L  I   +T P    V+ +  NP  +L + ++       G+++ ++ 
Sbjct: 302 EQAKNTVENIVFCLNSICSTSTLP---DVVPDVNNPNISLALEKYFLMFPPSGTIMRNVR 358

Query: 338 EAVTAVRIRVECALLTEAFTYQRMLCTKVREKK 370
            A   VR+  + A   E  T  + L   ++ KK
Sbjct: 359 FATPIVRLLCQGA---ELGTMAQFLGKYIKVKK 388


>sp|P30119|BTRF1_EBVB9 Uncharacterized protein BTRF1 OS=Epstein-Barr virus (strain B95-8)
           GN=BTRF1 PE=3 SV=2
          Length = 404

 Score = 35.0 bits (79), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 10/153 (6%)

Query: 222 HGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAATFSITRHSLLESLT--FYLLD 279
           H S  LV     + L +L+ +H  +   N  +IV DF A F+    ++ E+ T  F  L 
Sbjct: 242 HNSLRLVPNPDLLPLQHLYLKHVVLKSLNLENIVQDFEAIFTSPSDTISEAETKAFEKLV 301

Query: 280 DQADEALQEACHLLPEISGPTTHPKIAQVLLERENPEAALMVLRW--SGRDGGSLLVSLS 337
           +QA   ++     L  I   +T P    V+ +  NP  +L + ++       G+++ ++ 
Sbjct: 302 EQAKNTVENIVFCLNSICSTSTLP---DVVPDVNNPNISLALEKYFLMFPPSGTIMRNVR 358

Query: 338 EAVTAVRIRVECALLTEAFTYQRMLCTKVREKK 370
            A   VR+  + A   E  T  + L   ++ KK
Sbjct: 359 FATPIVRLLCQGA---ELGTMAQFLGKYIKVKK 388


>sp|Q1HVD3|BTRF1_EBVA8 Uncharacterized protein BTRF1 OS=Epstein-Barr virus (strain AG876)
           GN=BTRF1 PE=3 SV=1
          Length = 404

 Score = 34.7 bits (78), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 38/153 (24%), Positives = 68/153 (44%), Gaps = 10/153 (6%)

Query: 222 HGSSDLVLAKQAIFLYYLFDRHWTMPDENWRHIVDDFAATFSITRHSLLESLT--FYLLD 279
           H S  LV     + L +L+ +H  +   N  +IV DF A F+    ++ E+ T  F  L 
Sbjct: 242 HNSLRLVPNPDLLPLQHLYLKHVVLKSLNLENIVQDFEAIFTSPSDTISEAETKAFEKLV 301

Query: 280 DQADEALQEACHLLPEISGPTTHPKIAQVLLERENPEAALMVLRW--SGRDGGSLLVSLS 337
           +QA   ++     L  I   +T P    V+ +  NP  +L + ++       G+++ ++ 
Sbjct: 302 EQAKNTVENIVFCLNSICSTSTLP---DVVPDVNNPNISLALEKYFLMFPPSGTIMRNVR 358

Query: 338 EAVTAVRIRVECALLTEAFTYQRMLCTKVREKK 370
            A   VR+  + A   E  T  + L   ++ KK
Sbjct: 359 FATPIVRLLCQGA---ELGTMAQFLGKYIKVKK 388


>sp|Q28295|VWF_CANFA von Willebrand factor OS=Canis familiaris GN=VWF PE=1 SV=2
          Length = 2813

 Score = 33.1 bits (74), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 15/36 (41%), Positives = 20/36 (55%)

Query: 278  LDDQADEALQEACHLLPEISGPTTHPKIAQVLLERE 313
            L+ + DE +   C L PE   PT HP +AQV +  E
Sbjct: 1446 LEQRRDEIINYLCDLAPEAPAPTQHPPMAQVTVGSE 1481


>sp|P38035|RTCR_ECOLI Transcriptional regulatory protein RtcR OS=Escherichia coli (strain
           K12) GN=rtcR PE=4 SV=2
          Length = 532

 Score = 33.1 bits (74), Expect = 8.1,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 32/78 (41%), Gaps = 5/78 (6%)

Query: 369 KKLKFGTIGETFDDLQGGFKTWEQWLEVLVTEICCLCIRRNLVDRMIELPWNTDEEKYLH 428
           K + FG +G   D    G + W +W   L     CLC + +LV   +EL  +        
Sbjct: 3   KTVAFGFVGTVLDYAGRGSQRWSKWRPTL-----CLCQQESLVIDRLELLHDARSRSLFE 57

Query: 429 KCLLDSATDDPSTTVGSL 446
               D A+  P T V S+
Sbjct: 58  TLKRDIASVSPETEVVSV 75


>sp|P21449|MDR2_CRIGR Multidrug resistance protein 2 OS=Cricetulus griseus GN=PGY2 PE=2
           SV=2
          Length = 1276

 Score = 33.1 bits (74), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 22/67 (32%), Positives = 30/67 (44%), Gaps = 5/67 (7%)

Query: 583 SIKSPHFEVGHYGPSILHERLFMNKEGSTYDFG--VSKEFKVDGFSTPGVCQSSPMNQTP 640
           SI    F +GH  P+I    +F N  G+ Y+    +  E  +D FST G    S M    
Sbjct: 336 SILFGTFSIGHIAPNI---EVFANARGAAYEIFKIIDNEPSIDSFSTQGHKPDSVMGNLE 392

Query: 641 LKGRNFS 647
            K  +FS
Sbjct: 393 FKNVHFS 399


>sp|O39822|NSP1_ROTEL Non-structural protein 1 OS=Rotavirus A (isolate Equine/United
           Kingdom/L338/1988
           G13-P12[18]-I6-Rx-Cx-M6-A6-Nx-Tx-Ex-Hx) PE=3 SV=1
          Length = 492

 Score = 33.1 bits (74), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 16/105 (15%)

Query: 431 LLDSATDDPSTTVGSLLVVFYIQRYRYAEAYQVNLKLQSV------EQDFISKNPVSEEV 484
           LLDS   D  + + S L   Y Q+Y +  +   ++KL+ V      +Q+ I+K      +
Sbjct: 185 LLDSVNFDRMSALPSDLQALYAQKY-FKISRLPSMKLRQVYYSDFTKQNLITKYRTKTRI 243

Query: 485 LSRMQSQIHWRTKFIDTSIELLPEVQRQLLKNGKLPLNALNSSEE 529
             R  S+I+W     DT IEL       L+ N    L AL ++EE
Sbjct: 244 THRNVSKINW-----DTDIEL----HNDLMHNKHRILTALTTAEE 279


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.317    0.132    0.386 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 294,728,746
Number of Sequences: 539616
Number of extensions: 12518706
Number of successful extensions: 30923
Number of sequences better than 100.0: 35
Number of HSP's better than 100.0 without gapping: 3
Number of HSP's successfully gapped in prelim test: 32
Number of HSP's that attempted gapping in prelim test: 30891
Number of HSP's gapped (non-prelim): 53
length of query: 801
length of database: 191,569,459
effective HSP length: 126
effective length of query: 675
effective length of database: 123,577,843
effective search space: 83415044025
effective search space used: 83415044025
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (29.6 bits)