BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 008286
         (571 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q6YRM6|Y8219_ORYSJ UPF0392 protein Os08g0121900 OS=Oryza sativa subsp. japonica
           GN=Os08g0121900 PE=2 SV=1
          Length = 584

 Score =  295 bits (755), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 179/488 (36%), Positives = 268/488 (54%), Gaps = 51/488 (10%)

Query: 98  SVLLPDWEVLVLLSPETSLDSVEGFYCLFWNSQTSPARFS----GVLPFTERTAFKCAMP 153
           +V LPD  VL++L   + L +   F CL+    +S  R        LP    +   C  P
Sbjct: 109 AVALPD-HVLLMLRDGSLLPASGQFECLYSPVNSSQLRRQPLSVATLP-DGPSLVHC--P 164

Query: 154 NGARRPPLWQPILTKYPVKENPAKERELLRMKKLAYESI--STEDDVVLFVKGVNARSRS 211
            G  R  +   +    PV          L+  +L Y ++  S ++  V+F KG+N R   
Sbjct: 165 AGPSRVAVSLSLAQSVPVAP--------LQWDRLVYTALIDSKDNSTVVFAKGMNLRP-G 215

Query: 212 RSPQPQDFMCVFGD-------AVKTPVTSCTQEVFRCSHPE-----LTAFTSGT------ 253
           R   P  + CVFG         V +PV S  QE+FRC  P      L   T G       
Sbjct: 216 RLGVPSRYECVFGRDFSKPKLVVTSPVVSAAQEIFRCVTPVRIRRYLRMTTGGKNSVNND 275

Query: 254 DQPIKMSLQIMHQVQNRTLPSVAYYRPR---QSHAQEPKSEICVCTMVYNVAKFVKEWVI 310
           D+P+ +S++   +  + TLPS+A   P      H +     +CVCTM+ N A+F++EW+I
Sbjct: 276 DKPMLVSIRTKGR-GSSTLPSIAQPEPLPRYNKHWRRKAHSMCVCTMLRNQARFLREWII 334

Query: 311 YHTKIGVEKFILYDNGSEDDLQNVVNELNGDGYNVTTLLWIWPKTQEAGFSHNAIHAKHS 370
           YH++IGV+++ +YDN S+D ++ V+N ++   YNVT  LW W K+QEAGF+H A+ A+ S
Sbjct: 335 YHSRIGVQRWFIYDNNSDDGIEEVLNTMDSSRYNVTRYLWPWMKSQEAGFAHCALRARES 394

Query: 371 CKWMLYIDVDEFVYSPSWHDSGPSKHLLK--ALIPQLHSIGQVSIRCLDFGPSGQKSHPI 428
           C+W+ +ID+DEF++ P    +   + +L+  ++ P+   IG++   C  FGPSG+   P 
Sbjct: 395 CEWVGFIDIDEFLHFPG---NQTLQDVLRNYSVKPR---IGELRTACHSFGPSGRTKIPK 448

Query: 429 EGVTQGYNCQRWDVSQQRHKSMVLLEAIDDSLDNVVHHFRLKNTFQWRELSMSSALVNHY 488
           +GVT GY C+    + +RHKS+V  +A++ SL NVVHHF LK   ++  +     L+NHY
Sbjct: 449 KGVTTGYTCRL--AAPERHKSIVRPDALNPSLINVVHHFHLKEGMKYVNIGQGMMLINHY 506

Query: 489 KYQAWSEFKTKFRRRVSAYVADWRDATNPNSKDRTPGLGFEPIKPQGWEFEFCNFTDDRL 548
           KYQ W  FK KF  RV+ YVADW+D  N  S+DR PGLG +P++P+ W   FC   D+ L
Sbjct: 507 KYQVWEVFKDKFSGRVATYVADWQDEENVGSRDRAPGLGTKPVEPEDWPRRFCEVYDNGL 566

Query: 549 KLLTQRWF 556
           K   Q+ F
Sbjct: 567 KDFVQKVF 574


>sp|Q94K98|Y1720_ARATH UPF0392 protein At1g27200 OS=Arabidopsis thaliana GN=At1g27200 PE=2
           SV=2
          Length = 575

 Score =  288 bits (738), Expect = 7e-77,   Method: Compositional matrix adjust.
 Identities = 173/425 (40%), Positives = 240/425 (56%), Gaps = 42/425 (9%)

Query: 172 KENPAKERELLRMKKLAYESISTEDDVVLFVKGVNARSRSRSPQPQDFMCVFG-----DA 226
           K+   + R +   +K+ YE++   D VV+FVKG+  R    S  P  + C F      + 
Sbjct: 164 KKMKKQSRRVHNWEKVGYEAVIDGDTVVVFVKGLTRRPHKES-DPSYYKCQFEIENSEEK 222

Query: 227 VKTPVTSCTQEVFRCSHPELTAFTSGTDQPIKMSLQIMHQV-------QNRT---LPSVA 276
             T   +  QEV RC  PE           +K++ ++M +V       + RT   LPSVA
Sbjct: 223 EVTQAIAAAQEVVRCGLPE----------SLKLNPEMMFRVSVIHIDPRGRTTPALPSVA 272

Query: 277 -YYRPRQSHAQEPKS----EICVCTMVYNVAKFVKEWVIYHTKIGVEKFILYDNGSEDDL 331
             Y       +E KS    E+CVCTM++N A F++EW++YH+ +GVE++ +YDN S+D +
Sbjct: 273 RIYGSDSIEKKEKKSGVKHELCVCTMLWNQAPFLREWIMYHSWLGVERWFIYDNNSDDGI 332

Query: 332 QNVVNELNGDGYNVTTLLWIWPKTQEAGFSHNAIHAKHSCKWMLYIDVDEFVYSPSWHDS 391
           Q  +  L+ + YNV+  +W W KTQEAGFSH A+ AK  C W+ + DVDEF Y P+    
Sbjct: 333 QEEIELLSSENYNVSRHVWPWIKTQEAGFSHCAVRAKEECNWVGFFDVDEFYYFPTHRSQ 392

Query: 392 G-PSKHLLKALIPQLHS---IGQVSIRCLDFGPSGQKSHPIEGVTQGYNCQRWDVSQQRH 447
           G PSK+ LK+L+    S   +G++   C  +GPSG  S P +GVT GY C++   + +RH
Sbjct: 393 GLPSKNALKSLVSNYTSWDLVGEIRTDCHSYGPSGLTSVPSQGVTVGYTCRQ--ANPERH 450

Query: 448 KSMVLLEAIDDSLDNVVHHFRLKNTFQWRELSMSSALVNHYKYQAWSEFKTKFRRRVSAY 507
           KS++  E +  SL N VHHF+LK       L  S A+VNHYKYQ W  FK KF RRV+ Y
Sbjct: 451 KSIIRPELLTSSLLNEVHHFQLKEGVGHMSLVESVAVVNHYKYQVWDTFKAKFYRRVATY 510

Query: 508 VADWRDATNPNSKDRTPGLGFEPIKPQGWEFEFCNFTDDRLKLLTQRWFG-QLTPNGGYK 566
           V DW++  N  SKDR PGLG E I+P  W+  FC   D  LK L    F  Q+T   GY 
Sbjct: 511 VVDWQENQNQGSKDRAPGLGTEAIEPPDWKRRFCEVWDTGLKDLVMSNFADQVT---GY- 566

Query: 567 MAWQR 571
           + WQR
Sbjct: 567 LPWQR 571


>sp|B9S2H4|Y232_RICCO UPF0392 protein RCOM_0530710 OS=Ricinus communis GN=RCOM_0699480
           PE=3 SV=1
          Length = 578

 Score =  274 bits (701), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 172/495 (34%), Positives = 256/495 (51%), Gaps = 29/495 (5%)

Query: 99  VLLPDWEVLVLLSPETSLDSVEGFYCLFWNSQT---SPARFSGVLPFTERTAFKCAMPNG 155
           VLLPD  +L++ +   +  +++  Y   +NS      PA  S      +++  +C +P  
Sbjct: 88  VLLPDHVLLIVSNKVATSQNLDCVYSNLYNSHDVVLKPA-LSVNQYHRDKSIVRCQLPPN 146

Query: 156 ARRPPLWQPILTKYPVKENPAKERELLRMKKLAYESISTEDDVVLFVKGVNARSRSRSPQ 215
                ++     +       A    ++   ++ YE++   + V +FVKG+N R    S  
Sbjct: 147 NYSAAVYLRWSWEAAEGVAAAAPASVVSWDRVVYEAMLDWNTVAVFVKGLNLRPHKESDS 206

Query: 216 PQDFMCVFGDA---------VKTPVTSCTQEVFRCSHPELTAFTSGTDQPIKMSLQIMHQ 266
            + F C FG +           T   +  QEV RC  P          Q I++++  ++ 
Sbjct: 207 SK-FRCHFGLSKFDKDEGIVFTTEAITAAQEVIRCLLPRSIRNNPVKAQGIRVTVSRINA 265

Query: 267 VQNRT---LPSVAYYRPRQSH---AQEPKSEICVCTMVYNVAKFVKEWVIYHTKIGVEKF 320
            ++     LPSVA     +S+   +   K E+C CTM++N A F+ EW+ YH  +GV+++
Sbjct: 266 GEDGVDAPLPSVAKVYGAKSYEKRSNRGKYELCACTMLWNQASFLHEWITYHAWLGVQRW 325

Query: 321 ILYDNGSEDDLQNVVNELNGDGYNVTTLLWIWPKTQEAGFSHNAIHAKHSCKWMLYIDVD 380
            +YDN S+D +Q VV+ELN   YNVT   W W K QEAGFSH A+ A+  CKW+ + DVD
Sbjct: 326 FIYDNNSDDGIQEVVDELNLQNYNVTRHSWPWIKAQEAGFSHCALRARSECKWLGFFDVD 385

Query: 381 EFVYSPSWHDSGP-SKHLLKALIPQL---HSIGQVSIRCLDFGPSGQKSHPIEGVTQGYN 436
           EF Y P         ++ L+ L+       +  ++   C  FGPSG  S P +GVT GY 
Sbjct: 386 EFFYLPRHRGQDMLGENSLRTLVANYSDSSTYAEIRTICHSFGPSGLTSAPSQGVTVGYT 445

Query: 437 CQRWDVSQQRHKSMVLLEAIDDSLDNVVHHFRLKNTFQWRELSMSSALVNHYKYQAWSEF 496
           C+    + +RHKS+V  E +D +L NVVHHF+LK  +++  +  S+A+VNHYKYQ W  F
Sbjct: 446 CRL--QAPERHKSIVRPELLDTTLLNVVHHFKLKEGYRYLNVPESTAVVNHYKYQVWDTF 503

Query: 497 KTKFRRRVSAYVADWRDATNPNSKDRTPGLGFEPIKPQGWEFEFCNFTDDRLKLLTQRWF 556
           K KF RRVS YVA+W++  N  SKDR PGLG   I+P  W   FC   D  LK      F
Sbjct: 504 KAKFFRRVSTYVANWQEDQNQGSKDRAPGLGTVAIEPPDWRLRFCEVWDTGLKDFVLANF 563

Query: 557 GQLTPNGGYKMAWQR 571
                  GY + W+R
Sbjct: 564 ADTA--SGY-LPWER 575


>sp|B9SLR1|Y231_RICCO UPF0392 protein RCOM_0530710 OS=Ricinus communis GN=RCOM_0530710
           PE=3 SV=1
          Length = 552

 Score =  266 bits (681), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 148/386 (38%), Positives = 221/386 (57%), Gaps = 32/386 (8%)

Query: 183 RMKKLAYESISTEDDV-VLFVKGVNARSRSRSPQPQDFMCVFG-------DAVKTPVTSC 234
           R   L YE++   D+  V+FVKG N R+  R      F CV+G         +++ V S 
Sbjct: 161 RWDSLVYEAMIDRDNTTVVFVKGFNLRA-DRIYNASKFECVYGWDFRKTKFVLRSNVISI 219

Query: 235 TQEVFRCSHPELTAFTS--GTDQPIKMSLQIMHQVQNRTLPSVA------YYRPRQSHAQ 286
            QE+ RC  P L+   +    +  IK+S+++  +    TL S+A         P      
Sbjct: 220 AQEIVRCQTP-LSILNNQLKVNNAIKVSIRLKGK---GTLHSIARPGVQLLTDPEPGLRG 275

Query: 287 EPKSEICVCTMVYNVAKFVKEWVIYHTKIGVEKFILYDNGSEDDLQNVVNELNGDGYNVT 346
           E   E+C+CTM+ N  +F+KEWV+YH++IGVE++ +YDN SEDD+ +V+  L    +N++
Sbjct: 276 EKPHEMCICTMLRNQGRFLKEWVMYHSQIGVERWFIYDNNSEDDIDSVIESLIDAKFNIS 335

Query: 347 TLLWIWPKTQEAGFSHNAIHAKHSCKWMLYIDVDEFVYSPSWHDSGPSKHLLKALIPQLH 406
             +W W K QEAGF+H A+ A+  C+W+ +IDVDEF + P+        +L  A+  Q +
Sbjct: 336 RHVWPWVKAQEAGFAHCALRARGLCEWVGFIDVDEFFHLPT------GLNLQDAVKNQSN 389

Query: 407 S---IGQVSIRCLDFGPSGQKSHPIEGVTQGYNCQRWDVSQQRHKSMVLLEAIDDSLDNV 463
           S   + ++ + C  FGPSG K  P +GVT GY C+   +  +RHKS+V  EA++ +L NV
Sbjct: 390 SGNNVAELRVSCHSFGPSGLKHVPAQGVTVGYTCRM--MLPERHKSIVKPEALNSTLINV 447

Query: 464 VHHFRLKNTFQWRELSMSSALVNHYKYQAWSEFKTKFRRRVSAYVADWRDATNPNSKDRT 523
           VHHF L++ F++        ++NHYKYQ W  FK KF RRV+ YV DW++  N  SKDR 
Sbjct: 448 VHHFHLRDGFRYVNADKGILVINHYKYQVWEVFKEKFYRRVATYVVDWQNEQNVGSKDRA 507

Query: 524 PGLGFEPIKPQGWEFEFCNFTDDRLK 549
           PGLG   ++P  W   FC  +D  L+
Sbjct: 508 PGLGTRAVEPPDWSSRFCEVSDTGLR 533


>sp|Q21006|NHR34_CAEEL Nuclear hormone receptor family member nhr-34 OS=Caenorhabditis
           elegans GN=nhr-34 PE=2 SV=3
          Length = 631

 Score = 34.3 bits (77), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 11/102 (10%)

Query: 54  RFLPAKRPHARNLISNVMKDDFQEVTRLSRHVSSVQDSAKYQSVSVLLPDWEVLVLLSPE 113
           R     R H + +    ++  +Q V    RH SSVQ S +   + +LLP  E L     +
Sbjct: 498 RLSEEGRAHIKRVRDKYIEALYQHVQHQHRHFSSVQTSMRISKILLLLPSIEHL----SQ 553

Query: 114 TSLDSVEGFYCLFWNSQTSPARFSGVLPFTERTAFKCAMPNG 155
              D+V+ F  LF     + A  +G LP+   ++ K  +PNG
Sbjct: 554 QEDDNVQ-FLALF-----NLANLNG-LPYELHSSIKQHIPNG 588


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.320    0.134    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 221,237,076
Number of Sequences: 539616
Number of extensions: 9238333
Number of successful extensions: 18658
Number of sequences better than 100.0: 8
Number of HSP's better than 100.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 18642
Number of HSP's gapped (non-prelim): 8
length of query: 571
length of database: 191,569,459
effective HSP length: 123
effective length of query: 448
effective length of database: 125,196,691
effective search space: 56088117568
effective search space used: 56088117568
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 64 (29.3 bits)