BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 012466
         (463 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q68CP9|ARID2_HUMAN AT-rich interactive domain-containing protein 2 OS=Homo sapiens
           GN=ARID2 PE=1 SV=2
          Length = 1835

 Score = 57.4 bits (137), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 84/333 (25%), Positives = 143/333 (42%), Gaps = 48/333 (14%)

Query: 64  NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQVIDDWR 123
           ++ ++VL+L SGL +E+ +A+N  TLLS + K  M+ +  P  KI  LL A   V DD  
Sbjct: 157 DYNKLVLSLLSGLPNEVDFAINVCTLLSNESKHVMQLEKDP--KIITLLLANAGVFDD-- 212

Query: 124 DIALPKELSKGPRARTLGVNSLVTGFGSEFEALGSINNAFPRSGVGSGSSAADSLVQKNA 183
                          TLG  S  T FG E++     +       +   +   D +  +N 
Sbjct: 213 ---------------TLG--SFSTVFGEEWKEKTDRDFVKFWKDIVDDNEVRDLISDRNK 255

Query: 184 ARVRSSEWW-----FDEDGLFNLDDEGRAEKQQCAVGASNIIRNFSFMPDNEVIMAQHRH 238
           +   +S  W     F       ++D       Q AV    I+RN SF   N  ++A +R 
Sbjct: 256 SHEGTSGEWIWESLFHPPRKLGINDIEGQRVLQIAV----ILRNLSFEEGNVKLLAANRT 311

Query: 239 CLETVFQCIEDHVTEDEELVTNALETIVNLAP--LLDLRIFSSSKQSYIKITREKRAVEA 296
           CL  +      H     +L    L+T+ N+A   LLD   F ++   +  +T+   + + 
Sbjct: 312 CLRFLLLSAHSHFISLRQL---GLDTLGNIAAELLLDPVDFKTTHLMFHTVTKCLMSRDR 368

Query: 297 IMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQ-IHKRLVDLMSLPAFDAQAAA 355
            + + G           E+LG L    DN   +  +V Q  ++ ++  ++LP      + 
Sbjct: 369 FLKMRG----------MEILGNLCKAEDNGVLICEYVDQDSYREIICHLTLPDVLLVIST 418

Query: 356 VGALYNLAEV-NVDCRLKLASERWAIDRLLRVI 387
           +  LY L E+ +V C  K+A    +ID L+ ++
Sbjct: 419 LEVLYMLTEMGDVAC-TKIAKVEKSIDMLVCLV 450


>sp|O14497|ARI1A_HUMAN AT-rich interactive domain-containing protein 1A OS=Homo sapiens
            GN=ARID1A PE=1 SV=3
          Length = 2285

 Score = 40.0 bits (92), Expect = 0.039,   Method: Composition-based stats.
 Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)

Query: 67   RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
            R++++L+SGL +E TWAL+T+ +L + +   M  +   L+++PGLL+ L++
Sbjct: 1671 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1718


>sp|A2BH40|ARI1A_MOUSE AT-rich interactive domain-containing protein 1A OS=Mus musculus
            GN=Arid1a PE=1 SV=1
          Length = 2283

 Score = 40.0 bits (92), Expect = 0.042,   Method: Composition-based stats.
 Identities = 19/51 (37%), Positives = 37/51 (72%), Gaps = 3/51 (5%)

Query: 67   RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
            R++++L+SGL +E TWAL+T+ +L + +   M  +   L+++PGLL+ L++
Sbjct: 1672 RVMMSLKSGLLAESTWALDTINILLYDDNSIMTFN---LSQLPGLLELLVE 1719


>sp|Q8NFD5|ARI1B_HUMAN AT-rich interactive domain-containing protein 1B OS=Homo sapiens
            GN=ARID1B PE=1 SV=2
          Length = 2236

 Score = 35.8 bits (81), Expect = 0.71,   Method: Composition-based stats.
 Identities = 18/51 (35%), Positives = 34/51 (66%), Gaps = 3/51 (5%)

Query: 67   RIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALLQ 117
            R++++L+SGL +E TWAL+T+ +L +   DD       L+++ G L+ L++
Sbjct: 1625 RVMMSLKSGLLAESTWALDTINILLY---DDSTVATFNLSQLSGFLELLVE 1672


>sp|O00507|USP9Y_HUMAN Probable ubiquitin carboxyl-terminal hydrolase FAF-Y OS=Homo sapiens
            GN=USP9Y PE=1 SV=2
          Length = 2555

 Score = 35.4 bits (80), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 45/188 (23%), Positives = 75/188 (39%), Gaps = 44/188 (23%)

Query: 63   QNHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATP---LAKIPGLLDALLQVI 119
            +NH+ ++ +L+  +K +L    N        EK D + D      + K+P +L   L+  
Sbjct: 1747 RNHQNLLDSLEQYIKGDLLEGANAYHC----EKCDKKVDTVKRLLIKKLPRVLAIQLKRF 1802

Query: 120  D-DW-RDIAL--------PKELSKGP----------RARTLGVNSLV---------TGFG 150
            D DW R+ A+        P+EL  GP          R      N L+         T  G
Sbjct: 1803 DYDWERECAIKFNDYFEFPRELDMGPYTVAGVANLERDNVNSENELIEQKEQSDNETAGG 1862

Query: 151  SEFEALGSINNAFPRSGVGSGSSAADSLVQKNAARVRSSEWWFDEDGLFN---LDDEGRA 207
            +++  +G        SG  SG      ++Q+N    ++  W+  +DG      +DD+   
Sbjct: 1863 TKYRLVG----VLVHSGQASGGHYYSYIIQRNGKDDQTDHWYKFDDGDVTECKMDDDEEM 1918

Query: 208  EKQQCAVG 215
             K QC  G
Sbjct: 1919 -KNQCFGG 1925


>sp|Q8NCT3|K0895_HUMAN Uncharacterized protein KIAA0895 OS=Homo sapiens GN=KIAA0895 PE=2
           SV=4
          Length = 520

 Score = 35.0 bits (79), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 28/107 (26%), Positives = 53/107 (49%), Gaps = 9/107 (8%)

Query: 208 EKQQCAVGASNIIRNFSFMPDNEV---IMAQHRHCLETVFQCIEDHVTEDEELVTNALET 264
           EK++     S+   N  F   N     ++A+H H  +   + I  H+TED  L++ A  T
Sbjct: 248 EKEKLRFFKSDYTYNPQFEYANPALPSVLAKHSHASDRFLKQIVVHLTED--LLSRASMT 305

Query: 265 IVNLAPLLDLRIFSSSKQSYIK-ITREKRAVEAIMGI--LGSPFKAW 308
           +VN  P L + + S++++ +++ + R +       GI  L  P+ +W
Sbjct: 306 VVNGCPTLTINV-STAREHWLEGMLRHEIGTHYFRGINNLQQPWNSW 351


>sp|D4D4B1|VPS10_TRIVH Vacuolar protein sorting/targeting protein 10 OS=Trichophyton
           verrucosum (strain HKI 0517) GN=VPS10 PE=3 SV=1
          Length = 1486

 Score = 34.7 bits (78), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)

Query: 242 TVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGIL 301
           T F    D V ED++++T+  E +  +  +  + I  ++++     TREK+ V  I    
Sbjct: 323 TSFTMNLDGVNEDKDMITD-FEQVSGIQGIFLVNIVENAEEVKKGATREKKLVSRISFDD 381

Query: 302 GSPFKAWHCAAAELLGRLIINPDN 325
           G  FK   C   EL    I  P N
Sbjct: 382 GRTFKPLKCGDKELHLHSITRPSN 405


>sp|D4AQV3|VPS10_ARTBC Vacuolar protein sorting/targeting protein 10 OS=Arthroderma
           benhamiae (strain ATCC MYA-4681 / CBS 112371) GN=VPS10
           PE=3 SV=1
          Length = 1486

 Score = 34.7 bits (78), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 38/84 (45%), Gaps = 1/84 (1%)

Query: 242 TVFQCIEDHVTEDEELVTNALETIVNLAPLLDLRIFSSSKQSYIKITREKRAVEAIMGIL 301
           T F    D V ED++++T+  E +  +  +  + I  ++++     TREK+ V  I    
Sbjct: 323 TSFTMNLDGVNEDKDMITD-FEQVSGIQGIFLVNIVENAEEVKKGATREKKLVSRISFDD 381

Query: 302 GSPFKAWHCAAAELLGRLIINPDN 325
           G  FK   C   EL    I  P N
Sbjct: 382 GRTFKPLKCGDKELHLHSITRPSN 405


>sp|O74365|SOL1_SCHPO SWI/SNF chromatin-remodeling complex subunit sol1
           OS=Schizosaccharomyces pombe (strain 972 / ATCC 24843)
           GN=sol1 PE=1 SV=1
          Length = 865

 Score = 33.9 bits (76), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 48/92 (52%), Gaps = 15/92 (16%)

Query: 64  NHKRIVLALQSGLKSELTWALNTLTLLSFKEKDDMRKDATPLAKIPGLLDALL----QVI 119
           N + + ++LQS L+ E+T+A+N L +L+     + +K   PL++   ++DAL+    Q +
Sbjct: 429 NIQALCMSLQSTLEKEITYAMNVLLILT-----NDQKWMFPLSECQDVVDALIDVATQCL 483

Query: 120 DDWRDIALPKE-----LSKGPRARTLGVNSLV 146
           D+   + LP E       K P  R L  N  V
Sbjct: 484 DNLLSV-LPNEDLMEIADKRPSYRQLLYNCCV 514


>sp|Q4AAR2|KCY_MYCHJ Cytidylate kinase OS=Mycoplasma hyopneumoniae (strain J / ATCC
           25934 / NCTC 10110) GN=cmk PE=3 SV=1
          Length = 229

 Score = 33.9 bits (76), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 52/123 (42%), Gaps = 7/123 (5%)

Query: 159 INNAFPRSGVGSGS--SAADSLVQKNAARVRSSEWWFDE--DGLFNLDDEGRAEKQQCAV 214
           I N F    + +GS   A     QKN   + S             +LD EG    Q   V
Sbjct: 25  IANKFNYLFINTGSLYRAIAFFCQKNQISITSERKMIKHLPPNFLSLDFEGNVWLQNQNV 84

Query: 215 GASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDL 274
             SN++RN   +  N  I+AQ+    + V + +++     + ++    +T  N+ P  DL
Sbjct: 85  --SNLLRN-DLISKNAAIIAQYPQIRKIVTEILQNFQKNHKGIIMEGRDTTYNVMPDADL 141

Query: 275 RIF 277
           +IF
Sbjct: 142 KIF 144


>sp|Q4A8U3|KCY_MYCH7 Cytidylate kinase OS=Mycoplasma hyopneumoniae (strain 7448) GN=cmk
           PE=3 SV=1
          Length = 229

 Score = 33.9 bits (76), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 52/123 (42%), Gaps = 7/123 (5%)

Query: 159 INNAFPRSGVGSGS--SAADSLVQKNAARVRSSEWWFDE--DGLFNLDDEGRAEKQQCAV 214
           I N F    + +GS   A     QKN   + S             +LD EG    Q   V
Sbjct: 25  IANKFNYLFINTGSLYRAIAFFCQKNQISITSERKMIKHLPPNFLSLDFEGNVWLQNQNV 84

Query: 215 GASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDL 274
             SN++RN   +  N  I+AQ+    + V + +++     + ++    +T  N+ P  DL
Sbjct: 85  --SNLLRN-DLISKNAAIIAQYPQIRKIVTEILQNFQKNHKGIIMEGRDTTYNVMPDADL 141

Query: 275 RIF 277
           +IF
Sbjct: 142 KIF 144


>sp|Q32ES6|MDTH_SHIDS Multidrug resistance protein MdtH OS=Shigella dysenteriae serotype
           1 (strain Sd197) GN=mdtH PE=3 SV=2
          Length = 401

 Score = 33.1 bits (74), Expect = 4.3,   Method: Compositional matrix adjust.
 Identities = 22/80 (27%), Positives = 34/80 (42%), Gaps = 2/80 (2%)

Query: 296 AIMGILGSPFKAWHCAAAELLGRLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAA 355
           A MGI   P+  W       LG  + +P     ++  +PQ   R   L+ +   D+  A 
Sbjct: 91  ATMGIAHEPWLLWFSCLLSGLGGTLFDPPRSALVVKLMPQQRGRFFSLLMMQ--DSAGAV 148

Query: 356 VGALYNLAEVNVDCRLKLAS 375
           +GAL     +  D RL  A+
Sbjct: 149 IGALLGSWLLQYDFRLVCAT 168


>sp|Q601X7|KCY_MYCH2 Cytidylate kinase OS=Mycoplasma hyopneumoniae (strain 232) GN=cmk
           PE=3 SV=1
          Length = 229

 Score = 33.1 bits (74), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 32/123 (26%), Positives = 51/123 (41%), Gaps = 7/123 (5%)

Query: 159 INNAFPRSGVGSGS--SAADSLVQKNAARVRSSEWWFDE--DGLFNLDDEGRAEKQQCAV 214
           I N F    + +GS   A     QKN   + S             +LD EG    Q   V
Sbjct: 25  IANKFNYLFINTGSLYRAIAFFCQKNQISITSERKMIKHLPPNFLSLDFEGNVWLQNQNV 84

Query: 215 GASNIIRNFSFMPDNEVIMAQHRHCLETVFQCIEDHVTEDEELVTNALETIVNLAPLLDL 274
             SN++RN   +  N  I+AQ+    + V + ++      + ++    +T  N+ P  DL
Sbjct: 85  --SNLLRN-DLISKNAAIIAQYPQIRKIVTEILQSFQKNHKGIIMEGRDTTYNVMPDADL 141

Query: 275 RIF 277
           +IF
Sbjct: 142 KIF 144


>sp|Q5T2S8|ARMC4_HUMAN Armadillo repeat-containing protein 4 OS=Homo sapiens GN=ARMC4 PE=2
           SV=1
          Length = 1044

 Score = 33.1 bits (74), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 29/107 (27%), Positives = 48/107 (44%), Gaps = 17/107 (15%)

Query: 208 EKQQCAVGASNIIRNF--SFMPDNEVIMAQHRHCLETVFQCIEDHVTEDEELVTNALETI 265
           E  + A+ A  II N   +   +NE +     HC   ++QC ED  T D   +   L+ +
Sbjct: 663 ENYRAAIKAERIIENLVKNLNSENEQL---QEHCAMAIYQCAEDKETRDLVRLHGGLKPL 719

Query: 266 VNLAPLLDLR-----------IFSSSKQSYIKITREKRAVEAIMGIL 301
            +L    D +             S SK++  K  RE +A+E ++G+L
Sbjct: 720 ASLLNNTDNKERLAAVTGAIWKCSISKENVTKF-REYKAIETLVGLL 765


>sp|O43028|VAC8_SCHPO Vacuolar protein 8 OS=Schizosaccharomyces pombe (strain 972 / ATCC
           24843) GN=vac8 PE=1 SV=4
          Length = 550

 Score = 33.1 bits (74), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 29/109 (26%), Positives = 53/109 (48%), Gaps = 10/109 (9%)

Query: 264 TIVNLAPLLDLRIFSSSKQSYIKITR------EKRAVEAIMGILGSPFKAWHCAAAELLG 317
           TI+  +  LDL+   S+  ++ +IT       ++  +E ++ +L SP      AA+  LG
Sbjct: 53  TILAYSDNLDLQ--RSAALAFAEITEKDVREVDRETIEPVLFLLQSPDAEIQRAASVALG 110

Query: 318 RLIINPDNEPFLLPFVPQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVN 366
            L +N +N+  ++         L+  M  P  + Q  AVG + NLA ++
Sbjct: 111 NLAVNAENKALVVKL--NGLDLLIRQMMSPHVEVQCNAVGCITNLATLD 157


>sp|P25043|PSB2_YEAST Proteasome subunit beta type-2 OS=Saccharomyces cerevisiae (strain
           ATCC 204508 / S288c) GN=PUP1 PE=1 SV=1
          Length = 261

 Score = 32.7 bits (73), Expect = 5.6,   Method: Compositional matrix adjust.
 Identities = 16/38 (42%), Positives = 27/38 (71%), Gaps = 1/38 (2%)

Query: 36  GSGSAADSAAPTTLLGPSLQVHSSFADQNHKRIVLALQ 73
           G+G+AAD+ A T L+G ++++HS +  +   R+V ALQ
Sbjct: 74  GAGTAADTEAVTQLIGSNIELHSLYTSRE-PRVVSALQ 110


>sp|Q8IN94|OSA_DROME Trithorax group protein osa OS=Drosophila melanogaster GN=osa PE=1
            SV=1
          Length = 2716

 Score = 32.7 bits (73), Expect = 6.0,   Method: Composition-based stats.
 Identities = 14/28 (50%), Positives = 21/28 (75%)

Query: 67   RIVLALQSGLKSELTWALNTLTLLSFKE 94
            RI +A++SGL +E TWAL+ L +L F +
Sbjct: 1806 RIFMAMRSGLLTECTWALDVLNVLLFDD 1833


>sp|Q54RZ7|Y1199_DICDI Probable serine/threonine-protein kinase DDB_G0282895
            OS=Dictyostelium discoideum GN=DDB_G0282895 PE=3 SV=1
          Length = 1634

 Score = 32.0 bits (71), Expect = 9.5,   Method: Composition-based stats.
 Identities = 30/137 (21%), Positives = 56/137 (40%), Gaps = 29/137 (21%)

Query: 332  FVPQIHKRLVDLMSLPAFDAQAAAVGALYNLAEVNVDCRLKLASERWAIDRLLRVIKTPH 391
            F+  + K +VD  +  A   +A A   L NL+  +++C L++ S+               
Sbjct: 1307 FIDIMEKPIVDATNKNAEKTRAQAAQILINLSISSIECLLEVKSK-----------NALG 1355

Query: 392  PVPEVCRKAAMILESLVSEPQNRVLLLAYENAFAEI-------------LFSDGRYSDTF 438
            P+ ++C+       + + E + ++L    E A AE+             +F++G +S   
Sbjct: 1356 PILDICKFGQAFAHTQIEESELQILQFLGEGALAEVHKGIWKGKEVAVKIFNEGSFSFRL 1415

Query: 439  ARILYE-----LTSRPN 450
               L E     L S PN
Sbjct: 1416 EDFLKEVAILGLISHPN 1432


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.318    0.133    0.390 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 166,673,816
Number of Sequences: 539616
Number of extensions: 6865393
Number of successful extensions: 30588
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 24
Number of HSP's successfully gapped in prelim test: 41
Number of HSP's that attempted gapping in prelim test: 30042
Number of HSP's gapped (non-prelim): 542
length of query: 463
length of database: 191,569,459
effective HSP length: 121
effective length of query: 342
effective length of database: 126,275,923
effective search space: 43186365666
effective search space used: 43186365666
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 63 (28.9 bits)