BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 017829
         (365 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9SD33|U183_ARATH UPF0183 protein At3g51130 OS=Arabidopsis thaliana GN=At3g51130 PE=2
           SV=2
          Length = 410

 Score =  615 bits (1587), Expect = e-175,   Method: Compositional matrix adjust.
 Identities = 286/354 (80%), Positives = 317/354 (89%), Gaps = 5/354 (1%)

Query: 2   LQSQKPRRRCEGTAMGAIVLDLRPGVGIGPFSLGMPICEAFASIEQQPNIYDVVHVKYFD 61
           L  Q+PRRR EGTAMGA V DLRPGVGIGPFS+GMPICEAFA IEQQPNIYDVVHVKY+D
Sbjct: 11  LVMQRPRRRLEGTAMGATVFDLRPGVGIGPFSIGMPICEAFAQIEQQPNIYDVVHVKYYD 70

Query: 62  EEPLKLDIIISFPDHGFHLRFDPWSQRLRLIEIFDIKRLQMRYATSLIGGSSTLATFVAV 121
           E+PLKLD++ISFPDHGFHLRFDPWSQRLRL+EIFD+KRLQMRYATS+IGG STLATFVAV
Sbjct: 71  EDPLKLDVVISFPDHGFHLRFDPWSQRLRLVEIFDVKRLQMRYATSMIGGPSTLATFVAV 130

Query: 122 YALFGPTFPGVYDKERSVYMLFYPGLSFAFPIPAQYADCCQDREAELPLEFPDGTTPVTC 181
           YALFGPTFPG+YDKER +Y LFYPGLSF FPIP QY DCC D EA LPLEFPDGTTPVTC
Sbjct: 131 YALFGPTFPGIYDKERGIYSLFYPGLSFEFPIPNQYTDCCHDGEAALPLEFPDGTTPVTC 190

Query: 182 RVSIYDGSADKKVGVGSLFDKAIAPSLPVGSLYIEEVHAKLGEELHFTVGSQHIPFGASP 241
           RVSIYD S+DKKVGVG L D+A  P LP GSLY+EEVH K G+EL+FTVG QH+PFGASP
Sbjct: 191 RVSIYDNSSDKKVGVGKLMDRASVPPLPPGSLYMEEVHVKPGKELYFTVGGQHMPFGASP 250

Query: 242 QDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRGLDILFDGQTHKIK 301
           QDVWTELGRPCGIH KQVDQMVIHSASDPRP++T+CGDYFYNY+TRGLDILFDG+THK+K
Sbjct: 251 QDVWTELGRPCGIHPKQVDQMVIHSASDPRPKTTICGDYFYNYFTRGLDILFDGETHKVK 310

Query: 302 KFIMHTNYPGHADFNSYIKCNFII-LGSDFAGTSAEVHSYKNKITPNTKWEQVK 354
           KF++HTNYPGHADFNSYIKCNF+I  G+D    +AE +   NKITP+T W+QVK
Sbjct: 311 KFVLHTNYPGHADFNSYIKCNFVISAGAD----AAEANRSGNKITPSTNWDQVK 360


>sp|Q9VSH9|U183_DROME UPF0183 protein CG7083 OS=Drosophila melanogaster GN=CG7083 PE=2
           SV=1
          Length = 438

 Score =  194 bits (492), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 132/371 (35%), Positives = 190/371 (51%), Gaps = 52/371 (14%)

Query: 21  LDLRPGVGIG----PFSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDH 76
           L++ P + +G     F LGM   +A A I+ Q  I   V V Y D  PL +DIII+ P  
Sbjct: 4   LEIVPEISLGCDAWEFVLGMHFSQAIAIIQSQVGIIKGVQVLYSDTTPLGVDIIINLPQD 63

Query: 77  GFHLRFDPWSQRLRLIEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKE 136
           G  L FDP SQRL+ IE+F++K +++RY          L +   +   FG T PGVYD  
Sbjct: 64  GVRLIFDPVSQRLKTIEVFNMKLVKLRYFGVYFNSPEVLPSIEQIEHSFGATHPGVYDAA 123

Query: 137 RSVYMLFYPGLSFAFPIPAQ----YADCCQDREAELPLEFPDGTTPVTCRVSIYDGSADK 192
           + ++ L + GLSF FP+ ++    YA           L F +G +PV  ++S+Y GS   
Sbjct: 124 KQLFALHFRGLSFYFPVDSKLHSGYAHGLSS------LVFLNGASPVVSKMSLYAGS--- 174

Query: 193 KVGVGSLFDKAIAPSLPVG----SLYIEEV--------HAKLGEELHFTVGS-------- 232
                ++ +  + PSLP+      +Y+E          H K  +   FT GS        
Sbjct: 175 -----NVLENRV-PSLPLSCYHRQMYLESATVLRTAFGHTKGLKLKLFTEGSGRALEPRR 228

Query: 233 ----QHIPFGASPQDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRG 288
               + + FG S +DV T LG P  I  K  D+M IHS+S  R   +   D F+NY+T G
Sbjct: 229 QCFTRELLFGDSCEDVATSLGAPNRIFFKSEDKMKIHSSSVNRQAQSKRSDIFFNYFTLG 288

Query: 289 LDILFDGQTHKIKKFIMHTNYPGHADFNSYIKCNF-IILGSDFAGTSAEVHSYKNKITPN 347
           +D+LFD +T   KKFI+HTNYPGH +FN Y +C F  +L +D    S   H   + +TP 
Sbjct: 289 IDVLFDARTQTCKKFILHTNYPGHFNFNMYHRCEFQFLLQADHPSMSDSGH---DLVTP- 344

Query: 348 TKWEQVKVKSF 358
           TK E V + ++
Sbjct: 345 TKQEHVNITAY 355


>sp|O08654|CP070_RAT UPF0183 protein C16orf70 homolog OS=Rattus norvegicus PE=2 SV=1
          Length = 422

 Score =  177 bits (449), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 118/363 (32%), Positives = 179/363 (49%), Gaps = 66/363 (18%)

Query: 32  FSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHLRFDPWSQRLRL 91
           F+LGMP+ +A A +++   I   V V Y ++ PL  D+I++    G  L FD ++QRL++
Sbjct: 19  FTLGMPLAQAVAILQKHCRIIKNVQVLYSEQSPLSHDLILNLTQDGIKLLFDAFNQRLKV 78

Query: 92  IEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVYMLFYPGLSFAF 151
           IE++D+ +++++Y        +   T   +   FG T PGVY+    ++ L + GLSF+F
Sbjct: 79  IEVYDLTKVKLKYCGVHFNSQAIAPTIEQIDQSFGATHPGVYNSAEQLFHLNFRGLSFSF 138

Query: 152 PIPAQYADCCQDREAELP------------LEFPDGTTPVTCRVSIYDGSADKKVGVGSL 199
            +         D   E P            L+ P G T    R+ IY G+        SL
Sbjct: 139 QL---------DSWTEAPKYEPNFAHGLASLQIPHGAT--VKRMYIYSGN--------SL 179

Query: 200 FDKAIAPSLPV----GSLYIEEVH-----------------AKLG----EELHFTVGSQH 234
            D   AP +P+    G++Y E V                  A  G     +    V  + 
Sbjct: 180 QDTK-APMMPLSCFLGNVYAESVDVIRDGTGPSGLRLRLLAAGCGPGVLADAKMRVFERA 238

Query: 235 IPFGASPQDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRGLDILFD 294
           + FG S QDV + LG P  +  K  D+M IHS S  +   + C DYF+NY+T G+DILFD
Sbjct: 239 VYFGDSCQDVLSMLGSPHKVFYKSEDKMKIHSPSPHKQVPSKCNDYFFNYFTLGVDILFD 298

Query: 295 GQTHKIKKFIMHTNYPGHADFNSYIKCNFII---LGSDFAGTSAEVHSYKNKITPNTKWE 351
             THK+KKF++HTNYPGH +FN Y +C F I   +  + AG   E+       T  +KW+
Sbjct: 299 ANTHKVKKFVLHTNYPGHYNFNIYHRCEFKIPLAIKKENAGGQTEI------CTTYSKWD 352

Query: 352 QVK 354
            ++
Sbjct: 353 SIQ 355


>sp|Q9BSU1|CP070_HUMAN UPF0183 protein C16orf70 OS=Homo sapiens GN=C16orf70 PE=1 SV=1
          Length = 422

 Score =  174 bits (442), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 112/360 (31%), Positives = 170/360 (47%), Gaps = 60/360 (16%)

Query: 32  FSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHLRFDPWSQRLRL 91
           F+LGMP+ +A A +++   I   V V Y ++ PL  D+I++    G  L FD ++QRL++
Sbjct: 19  FTLGMPLAQAVAILQKHCRIIKNVQVLYSEQSPLSHDLILNLTQDGIKLMFDAFNQRLKV 78

Query: 92  IEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVYMLFYPGLSFAF 151
           IE+ D+ +++++Y        +   T   +   FG T PGVY+    ++ L + GLSF+F
Sbjct: 79  IEVCDLTKVKLKYCGVHFNSQAIAPTIEQIDQSFGATHPGVYNSAEQLFHLNFRGLSFSF 138

Query: 152 PIPAQYADCCQDREAELP------------LEFPDGTTPVTCRVSIYDGSADKKVGVGSL 199
            +         D   E P            L+ P G T    R+ IY G++         
Sbjct: 139 QL---------DSWTEAPKYEPNFAHGLASLQIPHGAT--VKRMYIYSGNS--------- 178

Query: 200 FDKAIAPSLPV----GSLYIEEVHA---------------------KLGEELHFTVGSQH 234
                AP +P+    G++Y E V                        L  +    V  + 
Sbjct: 179 LQDTKAPMMPLSCFLGNVYAESVDVLRDGTGPAGLRLRLLAAGCGPGLLADAKMRVFERS 238

Query: 235 IPFGASPQDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRGLDILFD 294
           + FG S QDV + LG P  +  K  D+M IHS S  +   + C DYF+NY+T G+DILFD
Sbjct: 239 VYFGDSCQDVLSMLGSPHKVFYKSEDKMKIHSPSPHKQVPSKCNDYFFNYFTLGVDILFD 298

Query: 295 GQTHKIKKFIMHTNYPGHADFNSYIKCNFIILGSDFAGTSAEVHSYKNKITPNTKWEQVK 354
             THK+KKF++HTNYPGH +FN Y +C F I     A             T  +KW+ ++
Sbjct: 299 ANTHKVKKFVLHTNYPGHYNFNIYHRCEFKI---PLAIKKENADGQTETCTTYSKWDNIQ 355


>sp|Q922R1|CP070_MOUSE UPF0183 protein C16orf70 homolog OS=Mus musculus PE=2 SV=2
          Length = 422

 Score =  173 bits (439), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 115/354 (32%), Positives = 176/354 (49%), Gaps = 48/354 (13%)

Query: 32  FSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHLRFDPWSQRLRL 91
           F+LGMP+ +A A +++   I   V V Y ++ PL  D+I++    G  L FD ++QRL++
Sbjct: 19  FTLGMPLAQAVAILQKHCRIIRNVQVLYSEQSPLSHDLILNLTQDGITLLFDAFNQRLKV 78

Query: 92  IEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVYMLFYPGLSFAF 151
           IE+ ++ +++++Y        +   T   +   FG T PGVY+    ++ L + GLSF+F
Sbjct: 79  IEVCELTKVKLKYCGVHFNSQAIAPTIEQIDQSFGATHPGVYNSTEQLFHLNFRGLSFSF 138

Query: 152 PIPAQYADCCQDREAELP------------LEFPDGTTPVTCRVSIYDGSA--DKKVGVG 197
            +         D   E P            L+ P G T    R+ IY G++  D K  V 
Sbjct: 139 QL---------DSWTEAPKYEPNFAHGLASLQIPHGAT--VKRMYIYSGNSLQDTKAPVM 187

Query: 198 SL---FDKAIAPSLPV-------GSLYIEEVHAKLG----EELHFTVGSQHIPFGASPQD 243
            L        A S+ V         L +  + A  G     +    V  + + FG S QD
Sbjct: 188 PLSCFLGNVYAESVDVLRDGTGPSGLRLRLLAAGCGPGVLADAKMRVFERAVYFGDSCQD 247

Query: 244 VWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRGLDILFDGQTHKIKKF 303
           V + LG P  +  K  D+M IHS S  +   + C DYF+NY+T G+DILFD  THK+KKF
Sbjct: 248 VLSMLGSPHKVFYKSEDKMKIHSPSPHKQVPSKCNDYFFNYFTLGVDILFDANTHKVKKF 307

Query: 304 IMHTNYPGHADFNSYIKCNFII---LGSDFAGTSAEVHSYKNKITPNTKWEQVK 354
           ++HTNYPGH +FN Y +C F I   +  + AG   E+       T  +KW+ ++
Sbjct: 308 VLHTNYPGHYNFNIYHRCEFKIPLAIKKENAGGQTEI------CTTYSKWDSIQ 355


>sp|P34692|U183_CAEEL UPF0183 protein T01G9.2 OS=Caenorhabditis elegans GN=T01G9.2 PE=1
           SV=3
          Length = 422

 Score =  154 bits (388), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 105/330 (31%), Positives = 162/330 (49%), Gaps = 29/330 (8%)

Query: 25  PGVGIGP----FSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHL 80
           P VG+      F LGMPI +  A I+Q P +   V +KY  ++P   DIII     G  L
Sbjct: 26  PDVGLKSSQFEFVLGMPINQCIAMIQQHPRMLTKVELKYSKKDPFYQDIIIYIGSTGIRL 85

Query: 81  RFDPWSQRLRLIEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVY 140
            FD  SQ ++LIE+ ++  + + Y  ++    + +AT   V   FG T PG YD + ++Y
Sbjct: 86  YFDGLSQLIKLIEVDNLSMITLTYNDTIFSDPNNMATLDRVNEFFGSTHPGSYDDKHNIY 145

Query: 141 MLFYPGLSFAFPIPAQYADCCQDREA----ELPLEFPDGTTPVTCRVSIYDG-SADKKVG 195
           +  +PGLSF FP   + ++  + R         L++   + P   ++SIY G +  +   
Sbjct: 146 VQSWPGLSFCFPYGGENSN-LEVRPGFGGNLRSLKYDANSQPKLTKMSIYRGPNPSEPES 204

Query: 196 VGSLFDKAIAPSLPVGSLYIEEVHAKLGEELHF--------------TVGSQHIPFGASP 241
           V + F      +       I E    +G ++ F              +  ++ I FG S 
Sbjct: 205 VDTPFSCYCGQNRTRKVEAIWENGNIVGIDIQFDTQNGRIVDGEYDVSTYTRQIYFGDSV 264

Query: 242 QDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCG--DYFYNYYTRGLDILFDGQTHK 299
            DV + LG P  +  K  D+M IH       + TL G  ++F+NY+  GLDILFD  + +
Sbjct: 265 SDVQSILGAPTKVFYKSDDKMKIHRGLH---KETLYGPPNFFFNYFVMGLDILFDFVSKR 321

Query: 300 IKKFIMHTNYPGHADFNSYIKCNFIILGSD 329
           + KF++HTN PGH DF  Y +CNF I  +D
Sbjct: 322 VVKFVLHTNAPGHCDFGMYSRCNFSIFLND 351


>sp|P15238|RPC_BP163 Repressor protein C OS=Rhizobium phage 16-3 GN=C PE=1 SV=4
          Length = 263

 Score = 34.3 bits (77), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 13/35 (37%), Positives = 21/35 (60%)

Query: 244 VWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCG 278
           VW EL R  GI ++++ QM+  +  DP   ++L G
Sbjct: 53  VWRELARELGIDEQEMRQMMTEAGRDPEKVTSLAG 87


>sp|Q9D2X5|SCC4_MOUSE MAU2 chromatid cohesion factor homolog OS=Mus musculus GN=Mau2 PE=2
           SV=3
          Length = 619

 Score = 33.5 bits (75), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 284 YYTRGLDILFDGQTHKIKKFIMHTNYPGHA-DFNSYIKCNFIILGSDF 330
           +Y RGL   F G+ ++ K+F+  T    +A D N    C+ ++LG  F
Sbjct: 468 FYVRGLFSFFQGRYNEAKRFLRETLKMSNAEDLNRLTACSLVLLGHIF 515


>sp|Q9Y6X3|SCC4_HUMAN MAU2 chromatid cohesion factor homolog OS=Homo sapiens GN=MAU2 PE=1
           SV=2
          Length = 613

 Score = 33.5 bits (75), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 284 YYTRGLDILFDGQTHKIKKFIMHTNYPGHA-DFNSYIKCNFIILGSDF 330
           +Y RGL   F G+ ++ K+F+  T    +A D N    C+ ++LG  F
Sbjct: 462 FYVRGLFSFFQGRYNEAKRFLRETLKMSNAEDLNRLTACSLVLLGHIF 509


>sp|B4ZIX8|SCC4_XENLA MAU2 chromatid cohesion factor homolog OS=Xenopus laevis GN=mau2
           PE=1 SV=1
          Length = 607

 Score = 33.5 bits (75), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 284 YYTRGLDILFDGQTHKIKKFIMHTNYPGHA-DFNSYIKCNFIILGSDF 330
           +Y RGL   F G+ ++ K+F+  T    +A D N    C+ ++LG  F
Sbjct: 456 FYIRGLFSFFQGRYNEAKRFLRETLKMSNAEDLNRLTACSLVLLGHIF 503


>sp|B1H1Z8|SCC4_XENTR MAU2 chromatid cohesion factor homolog OS=Xenopus tropicalis
           GN=mau2 PE=2 SV=1
          Length = 604

 Score = 33.5 bits (75), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 16/48 (33%), Positives = 26/48 (54%), Gaps = 1/48 (2%)

Query: 284 YYTRGLDILFDGQTHKIKKFIMHTNYPGHA-DFNSYIKCNFIILGSDF 330
           +Y RGL   F G+ ++ K+F+  T    +A D N    C+ ++LG  F
Sbjct: 453 FYIRGLFSFFQGRYNEAKRFLRETLKMSNAEDLNRLTACSLVLLGHIF 500


>sp|P46530|NOTC1_DANRE Neurogenic locus notch homolog protein 1 OS=Danio rerio GN=notch1a
           PE=2 SV=1
          Length = 2437

 Score = 32.0 bits (71), Expect = 7.0,   Method: Composition-based stats.
 Identities = 18/49 (36%), Positives = 21/49 (42%), Gaps = 1/49 (2%)

Query: 154 PAQYADCCQDREAELPLEFPDGTTPVTCRVSIYDGSADKKVGVGSLFDK 202
           P +    CQDRE       P GTT V C ++I D    K    G   DK
Sbjct: 609 PCRNGGTCQDRENAYICTCPKGTTGVNCEINI-DDCKRKPCDYGKCIDK 656


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.323    0.141    0.442 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 150,708,219
Number of Sequences: 539616
Number of extensions: 6882735
Number of successful extensions: 12648
Number of sequences better than 100.0: 12
Number of HSP's better than 100.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 5
Number of HSP's that attempted gapping in prelim test: 12629
Number of HSP's gapped (non-prelim): 16
length of query: 365
length of database: 191,569,459
effective HSP length: 119
effective length of query: 246
effective length of database: 127,355,155
effective search space: 31329368130
effective search space used: 31329368130
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 62 (28.5 bits)