BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 015719
(402 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9SD33|U183_ARATH UPF0183 protein At3g51130 OS=Arabidopsis thaliana GN=At3g51130 PE=2
SV=2
Length = 410
Score = 706 bits (1823), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 328/402 (81%), Positives = 366/402 (91%), Gaps = 3/402 (0%)
Query: 2 LQSQKPRRRCEGTAMGAIVLDLRPGVGIGPFSLGMPICEAFASIEQQPNIYDVVHVKYFD 61
L Q+PRRR EGTAMGA V DLRPGVGIGPFS+GMPICEAFA IEQQPNIYDVVHVKY+D
Sbjct: 11 LVMQRPRRRLEGTAMGATVFDLRPGVGIGPFSIGMPICEAFAQIEQQPNIYDVVHVKYYD 70
Query: 62 EEPLKLDIIISFPDHGFHLRFDPWSQRLRLIEIFDIKRLQMRYATSLIGGSSTLATFVAV 121
E+PLKLD++ISFPDHGFHLRFDPWSQRLRL+EIFD+KRLQMRYATS+IGG STLATFVAV
Sbjct: 71 EDPLKLDVVISFPDHGFHLRFDPWSQRLRLVEIFDVKRLQMRYATSMIGGPSTLATFVAV 130
Query: 122 YALFGPTFPGVYDKERSVYMLFYPGLSFAFPIPAQYADCCQDREAELPLEFPDGTTPVTC 181
YALFGPTFPG+YDKER +Y LFYPGLSF FPIP QY DCC D EA LPLEFPDGTTPVTC
Sbjct: 131 YALFGPTFPGIYDKERGIYSLFYPGLSFEFPIPNQYTDCCHDGEAALPLEFPDGTTPVTC 190
Query: 182 RVSIYDGSADKKVGVGSLFDKAIAPSLPVGSLYIEEVHAKLGEELHFTVGSQHIPFGASP 241
RVSIYD S+DKKVGVG L D+A P LP GSLY+EEVH K G+EL+FTVG QH+PFGASP
Sbjct: 191 RVSIYDNSSDKKVGVGKLMDRASVPPLPPGSLYMEEVHVKPGKELYFTVGGQHMPFGASP 250
Query: 242 QDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRGLDILFDGQTHKIK 301
QDVWTELGRPCGIH KQVDQMVIHSASDPRP++T+CGDYFYNY+TRGLDILFDG+THK+K
Sbjct: 251 QDVWTELGRPCGIHPKQVDQMVIHSASDPRPKTTICGDYFYNYFTRGLDILFDGETHKVK 310
Query: 302 KFIMHTNYPGHADFNSYIKCNFII-LGSDCTSAEVHSYKNKITPNTKWEQVKEILGDCGR 360
KF++HTNYPGHADFNSYIKCNF+I G+D +AE + NKITP+T W+QVKEILG+CG
Sbjct: 311 KFVLHTNYPGHADFNSYIKCNFVISAGAD--AAEANRSGNKITPSTNWDQVKEILGECGP 368
Query: 361 AAIQTQGSTSNPFGSTFVYGYQNIAFEVMKNGYISTVTMFQS 402
AAIQTQGSTSNPFGST+VYGYQN+AFEVMKNG+I+T+T+FQS
Sbjct: 369 AAIQTQGSTSNPFGSTYVYGYQNVAFEVMKNGHIATITLFQS 410
>sp|Q9VSH9|U183_DROME UPF0183 protein CG7083 OS=Drosophila melanogaster GN=CG7083 PE=2
SV=1
Length = 438
Score = 238 bits (607), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 154/428 (35%), Positives = 220/428 (51%), Gaps = 62/428 (14%)
Query: 21 LDLRPGVGIG----PFSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDH 76
L++ P + +G F LGM +A A I+ Q I V V Y D PL +DIII+ P
Sbjct: 4 LEIVPEISLGCDAWEFVLGMHFSQAIAIIQSQVGIIKGVQVLYSDTTPLGVDIIINLPQD 63
Query: 77 GFHLRFDPWSQRLRLIEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKE 136
G L FDP SQRL+ IE+F++K +++RY L + + FG T PGVYD
Sbjct: 64 GVRLIFDPVSQRLKTIEVFNMKLVKLRYFGVYFNSPEVLPSIEQIEHSFGATHPGVYDAA 123
Query: 137 RSVYMLFYPGLSFAFPIPAQ----YADCCQDREAELPLEFPDGTTPVTCRVSIYDGSADK 192
+ ++ L + GLSF FP+ ++ YA L F +G +PV ++S+Y GS
Sbjct: 124 KQLFALHFRGLSFYFPVDSKLHSGYAHGLSS------LVFLNGASPVVSKMSLYAGS--- 174
Query: 193 KVGVGSLFDKAIAPSLPVG----SLYIEEV--------HAKLGEELHFTVGS-------- 232
++ + + PSLP+ +Y+E H K + FT GS
Sbjct: 175 -----NVLENRV-PSLPLSCYHRQMYLESATVLRTAFGHTKGLKLKLFTEGSGRALEPRR 228
Query: 233 ----QHIPFGASPQDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRG 288
+ + FG S +DV T LG P I K D+M IHS+S R + D F+NY+T G
Sbjct: 229 QCFTRELLFGDSCEDVATSLGAPNRIFFKSEDKMKIHSSSVNRQAQSKRSDIFFNYFTLG 288
Query: 289 LDILFDGQTHKIKKFIMHTNYPGHADFNSYIKCNFIIL-----------GSDCTSAEVHS 337
+D+LFD +T KKFI+HTNYPGH +FN Y +C F L G D +
Sbjct: 289 IDVLFDARTQTCKKFILHTNYPGHFNFNMYHRCEFQFLLQADHPSMSDSGHDLVTPTKQE 348
Query: 338 YKNKITPNTKWEQVKEILGDCGRAAIQTQGS---TSNPFGSTFVYGYQNIAFEVMKNGYI 394
+ N IT TKW+ + L R + + S T+NPFGSTF YGYQ++ FEVM N +I
Sbjct: 349 HVN-ITAYTKWDAISSALATSERPVVLHRASSTNTANPFGSTFCYGYQDLIFEVMPNSHI 407
Query: 395 STVTMFQS 402
++VT++ +
Sbjct: 408 ASVTLYNT 415
>sp|Q9BSU1|CP070_HUMAN UPF0183 protein C16orf70 OS=Homo sapiens GN=C16orf70 PE=1 SV=1
Length = 422
Score = 214 bits (545), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 135/411 (32%), Positives = 207/411 (50%), Gaps = 64/411 (15%)
Query: 32 FSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHLRFDPWSQRLRL 91
F+LGMP+ +A A +++ I V V Y ++ PL D+I++ G L FD ++QRL++
Sbjct: 19 FTLGMPLAQAVAILQKHCRIIKNVQVLYSEQSPLSHDLILNLTQDGIKLMFDAFNQRLKV 78
Query: 92 IEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVYMLFYPGLSFAF 151
IE+ D+ +++++Y + T + FG T PGVY+ ++ L + GLSF+F
Sbjct: 79 IEVCDLTKVKLKYCGVHFNSQAIAPTIEQIDQSFGATHPGVYNSAEQLFHLNFRGLSFSF 138
Query: 152 PIPAQYADCCQDREAELP------------LEFPDGTTPVTCRVSIYDGSADKKVGVGSL 199
+ D E P L+ P G T R+ IY G++
Sbjct: 139 QL---------DSWTEAPKYEPNFAHGLASLQIPHGAT--VKRMYIYSGNS--------- 178
Query: 200 FDKAIAPSLPV----GSLYIEEVHA---------------------KLGEELHFTVGSQH 234
AP +P+ G++Y E V L + V +
Sbjct: 179 LQDTKAPMMPLSCFLGNVYAESVDVLRDGTGPAGLRLRLLAAGCGPGLLADAKMRVFERS 238
Query: 235 IPFGASPQDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRGLDILFD 294
+ FG S QDV + LG P + K D+M IHS S + + C DYF+NY+T G+DILFD
Sbjct: 239 VYFGDSCQDVLSMLGSPHKVFYKSEDKMKIHSPSPHKQVPSKCNDYFFNYFTLGVDILFD 298
Query: 295 GQTHKIKKFIMHTNYPGHADFNSYIKCNFII-LGSDCTSAEVHSYKNKITPNTKWEQVKE 353
THK+KKF++HTNYPGH +FN Y +C F I L +A+ + T +KW+ ++E
Sbjct: 299 ANTHKVKKFVLHTNYPGHYNFNIYHRCEFKIPLAIKKENADGQT--ETCTTYSKWDNIQE 356
Query: 354 ILGDCGRAAIQTQGSTS----NPFGSTFVYGYQNIAFEVMKNGYISTVTMF 400
+LG + S+S NPFGSTF +G Q + FEVM+N +I++VT++
Sbjct: 357 LLGHPVEKPVVLHRSSSPNNTNPFGSTFCFGLQRMIFEVMQNNHIASVTLY 407
>sp|O08654|CP070_RAT UPF0183 protein C16orf70 homolog OS=Rattus norvegicus PE=2 SV=1
Length = 422
Score = 213 bits (541), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 139/411 (33%), Positives = 210/411 (51%), Gaps = 64/411 (15%)
Query: 32 FSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHLRFDPWSQRLRL 91
F+LGMP+ +A A +++ I V V Y ++ PL D+I++ G L FD ++QRL++
Sbjct: 19 FTLGMPLAQAVAILQKHCRIIKNVQVLYSEQSPLSHDLILNLTQDGIKLLFDAFNQRLKV 78
Query: 92 IEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVYMLFYPGLSFAF 151
IE++D+ +++++Y + T + FG T PGVY+ ++ L + GLSF+F
Sbjct: 79 IEVYDLTKVKLKYCGVHFNSQAIAPTIEQIDQSFGATHPGVYNSAEQLFHLNFRGLSFSF 138
Query: 152 PIPAQYADCCQDREAELP------------LEFPDGTTPVTCRVSIYDGSADKKVGVGSL 199
+ D E P L+ P G T R+ IY G+ SL
Sbjct: 139 QL---------DSWTEAPKYEPNFAHGLASLQIPHGAT--VKRMYIYSGN--------SL 179
Query: 200 FDKAIAPSLPV----GSLYIEEVH-----------------AKLG----EELHFTVGSQH 234
D AP +P+ G++Y E V A G + V +
Sbjct: 180 QDTK-APMMPLSCFLGNVYAESVDVIRDGTGPSGLRLRLLAAGCGPGVLADAKMRVFERA 238
Query: 235 IPFGASPQDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRGLDILFD 294
+ FG S QDV + LG P + K D+M IHS S + + C DYF+NY+T G+DILFD
Sbjct: 239 VYFGDSCQDVLSMLGSPHKVFYKSEDKMKIHSPSPHKQVPSKCNDYFFNYFTLGVDILFD 298
Query: 295 GQTHKIKKFIMHTNYPGHADFNSYIKCNFIILGSDCTSAEVHSYKNKI-TPNTKWEQVKE 353
THK+KKF++HTNYPGH +FN Y +C F I E + +I T +KW+ ++E
Sbjct: 299 ANTHKVKKFVLHTNYPGHYNFNIYHRCEFKI--PLAIKKENAGGQTEICTTYSKWDSIQE 356
Query: 354 ILGDCGRAAIQTQGSTS----NPFGSTFVYGYQNIAFEVMKNGYISTVTMF 400
+LG + S+S NPFGSTF +G Q + FEVM+N +I++VT++
Sbjct: 357 LLGHPVEKPVVLHRSSSPNNTNPFGSTFCFGLQRMIFEVMQNNHIASVTLY 407
>sp|Q922R1|CP070_MOUSE UPF0183 protein C16orf70 homolog OS=Mus musculus PE=2 SV=2
Length = 422
Score = 209 bits (531), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 136/402 (33%), Positives = 207/402 (51%), Gaps = 46/402 (11%)
Query: 32 FSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHLRFDPWSQRLRL 91
F+LGMP+ +A A +++ I V V Y ++ PL D+I++ G L FD ++QRL++
Sbjct: 19 FTLGMPLAQAVAILQKHCRIIRNVQVLYSEQSPLSHDLILNLTQDGITLLFDAFNQRLKV 78
Query: 92 IEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVYMLFYPGLSFAF 151
IE+ ++ +++++Y + T + FG T PGVY+ ++ L + GLSF+F
Sbjct: 79 IEVCELTKVKLKYCGVHFNSQAIAPTIEQIDQSFGATHPGVYNSTEQLFHLNFRGLSFSF 138
Query: 152 PIPAQYADCCQDREAELP------------LEFPDGTTPVTCRVSIYDGSA--DKKVGVG 197
+ D E P L+ P G T R+ IY G++ D K V
Sbjct: 139 QL---------DSWTEAPKYEPNFAHGLASLQIPHGAT--VKRMYIYSGNSLQDTKAPVM 187
Query: 198 SL---FDKAIAPSLPV-------GSLYIEEVHAKLG----EELHFTVGSQHIPFGASPQD 243
L A S+ V L + + A G + V + + FG S QD
Sbjct: 188 PLSCFLGNVYAESVDVLRDGTGPSGLRLRLLAAGCGPGVLADAKMRVFERAVYFGDSCQD 247
Query: 244 VWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCGDYFYNYYTRGLDILFDGQTHKIKKF 303
V + LG P + K D+M IHS S + + C DYF+NY+T G+DILFD THK+KKF
Sbjct: 248 VLSMLGSPHKVFYKSEDKMKIHSPSPHKQVPSKCNDYFFNYFTLGVDILFDANTHKVKKF 307
Query: 304 IMHTNYPGHADFNSYIKCNFIILGSDCTSAEVHSYKNKI-TPNTKWEQVKEILGDCGRAA 362
++HTNYPGH +FN Y +C F I E + +I T +KW+ ++E+LG
Sbjct: 308 VLHTNYPGHYNFNIYHRCEFKI--PLAIKKENAGGQTEICTTYSKWDSIQELLGHPVEKP 365
Query: 363 IQTQGSTS----NPFGSTFVYGYQNIAFEVMKNGYISTVTMF 400
+ S+S NPFGSTF +G Q + FEVM+N +I++VT++
Sbjct: 366 VVLHRSSSPNNTNPFGSTFCFGLQRMIFEVMQNNHIASVTLY 407
>sp|P34692|U183_CAEEL UPF0183 protein T01G9.2 OS=Caenorhabditis elegans GN=T01G9.2 PE=1
SV=3
Length = 422
Score = 172 bits (436), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 123/404 (30%), Positives = 198/404 (49%), Gaps = 41/404 (10%)
Query: 25 PGVGIGP----FSLGMPICEAFASIEQQPNIYDVVHVKYFDEEPLKLDIIISFPDHGFHL 80
P VG+ F LGMPI + A I+Q P + V +KY ++P DIII G L
Sbjct: 26 PDVGLKSSQFEFVLGMPINQCIAMIQQHPRMLTKVELKYSKKDPFYQDIIIYIGSTGIRL 85
Query: 81 RFDPWSQRLRLIEIFDIKRLQMRYATSLIGGSSTLATFVAVYALFGPTFPGVYDKERSVY 140
FD SQ ++LIE+ ++ + + Y ++ + +AT V FG T PG YD + ++Y
Sbjct: 86 YFDGLSQLIKLIEVDNLSMITLTYNDTIFSDPNNMATLDRVNEFFGSTHPGSYDDKHNIY 145
Query: 141 MLFYPGLSFAFPIPAQYADCCQDREA----ELPLEFPDGTTPVTCRVSIYDG-SADKKVG 195
+ +PGLSF FP + ++ + R L++ + P ++SIY G + +
Sbjct: 146 VQSWPGLSFCFPYGGENSNL-EVRPGFGGNLRSLKYDANSQPKLTKMSIYRGPNPSEPES 204
Query: 196 VGSLFDKAIAPSLPVGSLYIEEVHAKLGEELHF--------------TVGSQHIPFGASP 241
V + F + I E +G ++ F + ++ I FG S
Sbjct: 205 VDTPFSCYCGQNRTRKVEAIWENGNIVGIDIQFDTQNGRIVDGEYDVSTYTRQIYFGDSV 264
Query: 242 QDVWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCG--DYFYNYYTRGLDILFDGQTHK 299
DV + LG P + K D+M IH + TL G ++F+NY+ GLDILFD + +
Sbjct: 265 SDVQSILGAPTKVFYKSDDKMKIHRGLH---KETLYGPPNFFFNYFVMGLDILFDFVSKR 321
Query: 300 IKKFIMHTNYPGHADFNSYIKCNFIILGSDCTSAEVHSYKNKITPNTKWEQVKE-ILGDC 358
+ KF++HTN PGH DF Y +CNF I +D + +I ++K+++ + D
Sbjct: 322 VVKFVLHTNAPGHCDFGMYSRCNFSIFLND--------KQYEIRTDSKFDEFSHAFMNDS 373
Query: 359 G--RAAIQTQGSTSNPFGSTFVYGYQNIAFEVMKNGYISTVTMF 400
R + + PFGSTF YG + I E +NG++++VT++
Sbjct: 374 NPPRPVVLAR-QEQQPFGSTFCYGIKQIIVERTENGFLTSVTIY 416
>sp|P15238|RPC_BP163 Repressor protein C OS=Rhizobium phage 16-3 GN=C PE=1 SV=4
Length = 263
Score = 34.3 bits (77), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 244 VWTELGRPCGIHQKQVDQMVIHSASDPRPRSTLCG 278
VW EL R GI ++++ QM+ + DP ++L G
Sbjct: 53 VWRELARELGIDEQEMRQMMTEAGRDPEKVTSLAG 87
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.140 0.436
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 163,584,553
Number of Sequences: 539616
Number of extensions: 7455045
Number of successful extensions: 13844
Number of sequences better than 100.0: 7
Number of HSP's better than 100.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 13820
Number of HSP's gapped (non-prelim): 8
length of query: 402
length of database: 191,569,459
effective HSP length: 120
effective length of query: 282
effective length of database: 126,815,539
effective search space: 35761981998
effective search space used: 35761981998
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 62 (28.5 bits)