BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 008286
(571 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q6YRM6|Y8219_ORYSJ UPF0392 protein Os08g0121900 OS=Oryza sativa subsp. japonica
GN=Os08g0121900 PE=2 SV=1
Length = 584
Score = 295 bits (755), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 179/488 (36%), Positives = 268/488 (54%), Gaps = 51/488 (10%)
Query: 98 SVLLPDWEVLVLLSPETSLDSVEGFYCLFWNSQTSPARFS----GVLPFTERTAFKCAMP 153
+V LPD VL++L + L + F CL+ +S R LP + C P
Sbjct: 109 AVALPD-HVLLMLRDGSLLPASGQFECLYSPVNSSQLRRQPLSVATLP-DGPSLVHC--P 164
Query: 154 NGARRPPLWQPILTKYPVKENPAKERELLRMKKLAYESI--STEDDVVLFVKGVNARSRS 211
G R + + PV L+ +L Y ++ S ++ V+F KG+N R
Sbjct: 165 AGPSRVAVSLSLAQSVPVAP--------LQWDRLVYTALIDSKDNSTVVFAKGMNLRP-G 215
Query: 212 RSPQPQDFMCVFGD-------AVKTPVTSCTQEVFRCSHPE-----LTAFTSGT------ 253
R P + CVFG V +PV S QE+FRC P L T G
Sbjct: 216 RLGVPSRYECVFGRDFSKPKLVVTSPVVSAAQEIFRCVTPVRIRRYLRMTTGGKNSVNND 275
Query: 254 DQPIKMSLQIMHQVQNRTLPSVAYYRPR---QSHAQEPKSEICVCTMVYNVAKFVKEWVI 310
D+P+ +S++ + + TLPS+A P H + +CVCTM+ N A+F++EW+I
Sbjct: 276 DKPMLVSIRTKGR-GSSTLPSIAQPEPLPRYNKHWRRKAHSMCVCTMLRNQARFLREWII 334
Query: 311 YHTKIGVEKFILYDNGSEDDLQNVVNELNGDGYNVTTLLWIWPKTQEAGFSHNAIHAKHS 370
YH++IGV+++ +YDN S+D ++ V+N ++ YNVT LW W K+QEAGF+H A+ A+ S
Sbjct: 335 YHSRIGVQRWFIYDNNSDDGIEEVLNTMDSSRYNVTRYLWPWMKSQEAGFAHCALRARES 394
Query: 371 CKWMLYIDVDEFVYSPSWHDSGPSKHLLK--ALIPQLHSIGQVSIRCLDFGPSGQKSHPI 428
C+W+ +ID+DEF++ P + + +L+ ++ P+ IG++ C FGPSG+ P
Sbjct: 395 CEWVGFIDIDEFLHFPG---NQTLQDVLRNYSVKPR---IGELRTACHSFGPSGRTKIPK 448
Query: 429 EGVTQGYNCQRWDVSQQRHKSMVLLEAIDDSLDNVVHHFRLKNTFQWRELSMSSALVNHY 488
+GVT GY C+ + +RHKS+V +A++ SL NVVHHF LK ++ + L+NHY
Sbjct: 449 KGVTTGYTCRL--AAPERHKSIVRPDALNPSLINVVHHFHLKEGMKYVNIGQGMMLINHY 506
Query: 489 KYQAWSEFKTKFRRRVSAYVADWRDATNPNSKDRTPGLGFEPIKPQGWEFEFCNFTDDRL 548
KYQ W FK KF RV+ YVADW+D N S+DR PGLG +P++P+ W FC D+ L
Sbjct: 507 KYQVWEVFKDKFSGRVATYVADWQDEENVGSRDRAPGLGTKPVEPEDWPRRFCEVYDNGL 566
Query: 549 KLLTQRWF 556
K Q+ F
Sbjct: 567 KDFVQKVF 574
>sp|Q94K98|Y1720_ARATH UPF0392 protein At1g27200 OS=Arabidopsis thaliana GN=At1g27200 PE=2
SV=2
Length = 575
Score = 288 bits (738), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 173/425 (40%), Positives = 240/425 (56%), Gaps = 42/425 (9%)
Query: 172 KENPAKERELLRMKKLAYESISTEDDVVLFVKGVNARSRSRSPQPQDFMCVFG-----DA 226
K+ + R + +K+ YE++ D VV+FVKG+ R S P + C F +
Sbjct: 164 KKMKKQSRRVHNWEKVGYEAVIDGDTVVVFVKGLTRRPHKES-DPSYYKCQFEIENSEEK 222
Query: 227 VKTPVTSCTQEVFRCSHPELTAFTSGTDQPIKMSLQIMHQV-------QNRT---LPSVA 276
T + QEV RC PE +K++ ++M +V + RT LPSVA
Sbjct: 223 EVTQAIAAAQEVVRCGLPE----------SLKLNPEMMFRVSVIHIDPRGRTTPALPSVA 272
Query: 277 -YYRPRQSHAQEPKS----EICVCTMVYNVAKFVKEWVIYHTKIGVEKFILYDNGSEDDL 331
Y +E KS E+CVCTM++N A F++EW++YH+ +GVE++ +YDN S+D +
Sbjct: 273 RIYGSDSIEKKEKKSGVKHELCVCTMLWNQAPFLREWIMYHSWLGVERWFIYDNNSDDGI 332
Query: 332 QNVVNELNGDGYNVTTLLWIWPKTQEAGFSHNAIHAKHSCKWMLYIDVDEFVYSPSWHDS 391
Q + L+ + YNV+ +W W KTQEAGFSH A+ AK C W+ + DVDEF Y P+
Sbjct: 333 QEEIELLSSENYNVSRHVWPWIKTQEAGFSHCAVRAKEECNWVGFFDVDEFYYFPTHRSQ 392
Query: 392 G-PSKHLLKALIPQLHS---IGQVSIRCLDFGPSGQKSHPIEGVTQGYNCQRWDVSQQRH 447
G PSK+ LK+L+ S +G++ C +GPSG S P +GVT GY C++ + +RH
Sbjct: 393 GLPSKNALKSLVSNYTSWDLVGEIRTDCHSYGPSGLTSVPSQGVTVGYTCRQ--ANPERH 450
Query: 448 KSMVLLEAIDDSLDNVVHHFRLKNTFQWRELSMSSALVNHYKYQAWSEFKTKFRRRVSAY 507
KS++ E + SL N VHHF+LK L S A+VNHYKYQ W FK KF RRV+ Y
Sbjct: 451 KSIIRPELLTSSLLNEVHHFQLKEGVGHMSLVESVAVVNHYKYQVWDTFKAKFYRRVATY 510
Query: 508 VADWRDATNPNSKDRTPGLGFEPIKPQGWEFEFCNFTDDRLKLLTQRWFG-QLTPNGGYK 566
V DW++ N SKDR PGLG E I+P W+ FC D LK L F Q+T GY
Sbjct: 511 VVDWQENQNQGSKDRAPGLGTEAIEPPDWKRRFCEVWDTGLKDLVMSNFADQVT---GY- 566
Query: 567 MAWQR 571
+ WQR
Sbjct: 567 LPWQR 571
>sp|B9S2H4|Y232_RICCO UPF0392 protein RCOM_0530710 OS=Ricinus communis GN=RCOM_0699480
PE=3 SV=1
Length = 578
Score = 274 bits (701), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 172/495 (34%), Positives = 256/495 (51%), Gaps = 29/495 (5%)
Query: 99 VLLPDWEVLVLLSPETSLDSVEGFYCLFWNSQT---SPARFSGVLPFTERTAFKCAMPNG 155
VLLPD +L++ + + +++ Y +NS PA S +++ +C +P
Sbjct: 88 VLLPDHVLLIVSNKVATSQNLDCVYSNLYNSHDVVLKPA-LSVNQYHRDKSIVRCQLPPN 146
Query: 156 ARRPPLWQPILTKYPVKENPAKERELLRMKKLAYESISTEDDVVLFVKGVNARSRSRSPQ 215
++ + A ++ ++ YE++ + V +FVKG+N R S
Sbjct: 147 NYSAAVYLRWSWEAAEGVAAAAPASVVSWDRVVYEAMLDWNTVAVFVKGLNLRPHKESDS 206
Query: 216 PQDFMCVFGDA---------VKTPVTSCTQEVFRCSHPELTAFTSGTDQPIKMSLQIMHQ 266
+ F C FG + T + QEV RC P Q I++++ ++
Sbjct: 207 SK-FRCHFGLSKFDKDEGIVFTTEAITAAQEVIRCLLPRSIRNNPVKAQGIRVTVSRINA 265
Query: 267 VQNRT---LPSVAYYRPRQSH---AQEPKSEICVCTMVYNVAKFVKEWVIYHTKIGVEKF 320
++ LPSVA +S+ + K E+C CTM++N A F+ EW+ YH +GV+++
Sbjct: 266 GEDGVDAPLPSVAKVYGAKSYEKRSNRGKYELCACTMLWNQASFLHEWITYHAWLGVQRW 325
Query: 321 ILYDNGSEDDLQNVVNELNGDGYNVTTLLWIWPKTQEAGFSHNAIHAKHSCKWMLYIDVD 380
+YDN S+D +Q VV+ELN YNVT W W K QEAGFSH A+ A+ CKW+ + DVD
Sbjct: 326 FIYDNNSDDGIQEVVDELNLQNYNVTRHSWPWIKAQEAGFSHCALRARSECKWLGFFDVD 385
Query: 381 EFVYSPSWHDSGP-SKHLLKALIPQL---HSIGQVSIRCLDFGPSGQKSHPIEGVTQGYN 436
EF Y P ++ L+ L+ + ++ C FGPSG S P +GVT GY
Sbjct: 386 EFFYLPRHRGQDMLGENSLRTLVANYSDSSTYAEIRTICHSFGPSGLTSAPSQGVTVGYT 445
Query: 437 CQRWDVSQQRHKSMVLLEAIDDSLDNVVHHFRLKNTFQWRELSMSSALVNHYKYQAWSEF 496
C+ + +RHKS+V E +D +L NVVHHF+LK +++ + S+A+VNHYKYQ W F
Sbjct: 446 CRL--QAPERHKSIVRPELLDTTLLNVVHHFKLKEGYRYLNVPESTAVVNHYKYQVWDTF 503
Query: 497 KTKFRRRVSAYVADWRDATNPNSKDRTPGLGFEPIKPQGWEFEFCNFTDDRLKLLTQRWF 556
K KF RRVS YVA+W++ N SKDR PGLG I+P W FC D LK F
Sbjct: 504 KAKFFRRVSTYVANWQEDQNQGSKDRAPGLGTVAIEPPDWRLRFCEVWDTGLKDFVLANF 563
Query: 557 GQLTPNGGYKMAWQR 571
GY + W+R
Sbjct: 564 ADTA--SGY-LPWER 575
>sp|B9SLR1|Y231_RICCO UPF0392 protein RCOM_0530710 OS=Ricinus communis GN=RCOM_0530710
PE=3 SV=1
Length = 552
Score = 266 bits (681), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 148/386 (38%), Positives = 221/386 (57%), Gaps = 32/386 (8%)
Query: 183 RMKKLAYESISTEDDV-VLFVKGVNARSRSRSPQPQDFMCVFG-------DAVKTPVTSC 234
R L YE++ D+ V+FVKG N R+ R F CV+G +++ V S
Sbjct: 161 RWDSLVYEAMIDRDNTTVVFVKGFNLRA-DRIYNASKFECVYGWDFRKTKFVLRSNVISI 219
Query: 235 TQEVFRCSHPELTAFTS--GTDQPIKMSLQIMHQVQNRTLPSVA------YYRPRQSHAQ 286
QE+ RC P L+ + + IK+S+++ + TL S+A P
Sbjct: 220 AQEIVRCQTP-LSILNNQLKVNNAIKVSIRLKGK---GTLHSIARPGVQLLTDPEPGLRG 275
Query: 287 EPKSEICVCTMVYNVAKFVKEWVIYHTKIGVEKFILYDNGSEDDLQNVVNELNGDGYNVT 346
E E+C+CTM+ N +F+KEWV+YH++IGVE++ +YDN SEDD+ +V+ L +N++
Sbjct: 276 EKPHEMCICTMLRNQGRFLKEWVMYHSQIGVERWFIYDNNSEDDIDSVIESLIDAKFNIS 335
Query: 347 TLLWIWPKTQEAGFSHNAIHAKHSCKWMLYIDVDEFVYSPSWHDSGPSKHLLKALIPQLH 406
+W W K QEAGF+H A+ A+ C+W+ +IDVDEF + P+ +L A+ Q +
Sbjct: 336 RHVWPWVKAQEAGFAHCALRARGLCEWVGFIDVDEFFHLPT------GLNLQDAVKNQSN 389
Query: 407 S---IGQVSIRCLDFGPSGQKSHPIEGVTQGYNCQRWDVSQQRHKSMVLLEAIDDSLDNV 463
S + ++ + C FGPSG K P +GVT GY C+ + +RHKS+V EA++ +L NV
Sbjct: 390 SGNNVAELRVSCHSFGPSGLKHVPAQGVTVGYTCRM--MLPERHKSIVKPEALNSTLINV 447
Query: 464 VHHFRLKNTFQWRELSMSSALVNHYKYQAWSEFKTKFRRRVSAYVADWRDATNPNSKDRT 523
VHHF L++ F++ ++NHYKYQ W FK KF RRV+ YV DW++ N SKDR
Sbjct: 448 VHHFHLRDGFRYVNADKGILVINHYKYQVWEVFKEKFYRRVATYVVDWQNEQNVGSKDRA 507
Query: 524 PGLGFEPIKPQGWEFEFCNFTDDRLK 549
PGLG ++P W FC +D L+
Sbjct: 508 PGLGTRAVEPPDWSSRFCEVSDTGLR 533
>sp|Q21006|NHR34_CAEEL Nuclear hormone receptor family member nhr-34 OS=Caenorhabditis
elegans GN=nhr-34 PE=2 SV=3
Length = 631
Score = 34.3 bits (77), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 30/102 (29%), Positives = 47/102 (46%), Gaps = 11/102 (10%)
Query: 54 RFLPAKRPHARNLISNVMKDDFQEVTRLSRHVSSVQDSAKYQSVSVLLPDWEVLVLLSPE 113
R R H + + ++ +Q V RH SSVQ S + + +LLP E L +
Sbjct: 498 RLSEEGRAHIKRVRDKYIEALYQHVQHQHRHFSSVQTSMRISKILLLLPSIEHL----SQ 553
Query: 114 TSLDSVEGFYCLFWNSQTSPARFSGVLPFTERTAFKCAMPNG 155
D+V+ F LF + A +G LP+ ++ K +PNG
Sbjct: 554 QEDDNVQ-FLALF-----NLANLNG-LPYELHSSIKQHIPNG 588
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.134 0.422
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 221,237,076
Number of Sequences: 539616
Number of extensions: 9238333
Number of successful extensions: 18658
Number of sequences better than 100.0: 8
Number of HSP's better than 100.0 without gapping: 4
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 18642
Number of HSP's gapped (non-prelim): 8
length of query: 571
length of database: 191,569,459
effective HSP length: 123
effective length of query: 448
effective length of database: 125,196,691
effective search space: 56088117568
effective search space used: 56088117568
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 64 (29.3 bits)