BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047048
(240 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q682V0|XRCC4_ARATH DNA repair protein XRCC4 OS=Arabidopsis thaliana GN=XRCC4 PE=1 SV=2
Length = 264
Score = 296 bits (759), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 154/254 (60%), Positives = 187/254 (73%), Gaps = 30/254 (11%)
Query: 3 TRQTCLKLEIPHATDPIFVKGTWFQSHFDLSITDGLNAWLCNASEEQVKDRAALWDQPVT 62
T+ TCL+LEI A DPIFVKGTW S FD+S+TDG ++W+CNA+EE+V +RAA WDQPV+
Sbjct: 23 TKHTCLRLEISGA-DPIFVKGTWHNSRFDISVTDGSSSWICNATEEEVAERAAQWDQPVS 81
Query: 63 EYIKLAERYLGFQQPGCVYGFTDAADGCKRLSWTFEKEGTKLEWRWKCGPAPNSKKVPAG 122
EY+KLAE+YLGFQQP VY F+DA +G KRLSWTFEKEGTKLEWRWKC P+ +SKK+ G
Sbjct: 82 EYLKLAEQYLGFQQPNSVYSFSDALEGSKRLSWTFEKEGTKLEWRWKCKPSDDSKKITVG 141
Query: 123 ILDFLMDANIRLS----------------AEKCLAQSERFKNEKLEFESAIYAKFLGVLN 166
ILDFLM+ANIRLS AE+CLAQ E+ +EK EFESA YAKFL VLN
Sbjct: 142 ILDFLMEANIRLSEEVVNKTRSFEKMRSEAERCLAQGEKLCDEKTEFESATYAKFLSVLN 201
Query: 167 SKKSKLRELQDKLSKQAVMGDLQQEEEESTDETESFDEG-TDDDKSEEEPAKDITSTSIG 225
+KK+KLR L+DK V+ EEEESTD+ ESF+ G +DD+KSEEE +K TS
Sbjct: 202 AKKAKLRALRDKEDSVRVV-----EEEESTDKAESFESGRSDDEKSEEEASKKATS---- 252
Query: 226 VPASRARGRKRISR 239
S+ARG KR +R
Sbjct: 253 ---SKARGGKRAAR 263
>sp|Q13426|XRCC4_HUMAN DNA repair protein XRCC4 OS=Homo sapiens GN=XRCC4 PE=1 SV=2
Length = 336
Score = 53.1 bits (126), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 57/237 (24%), Positives = 103/237 (43%), Gaps = 34/237 (14%)
Query: 20 FVKGTW---FQSHFDLSITDGLNAWLCNASEEQVKDRAALWDQPVTEYI-KLAERYLGFQ 75
F++ +W +S F +++TDG +AW SE ++ A +Y+ +L + L
Sbjct: 19 FLQVSWEKTLESGFVITLTDGHSAWTGTVSESEISQEADDMAMEKGKYVGELRKALLSGA 78
Query: 76 QPGCVYGFTDAADGCKRLSWTFEKEGTKLEWR---WKCGPAPNSKKVPAGILDFLMDANI 132
P VY F + + C + FEK + +R + N +V ++ + +D
Sbjct: 79 GPADVYTFNFSKESC---YFFFEKNLKDVSFRLGSFNLEKVENPAEVIRELICYCLDTIA 135
Query: 133 RLSAEKCLAQSE-------------RFK---NEKLEFESAIYAKFLGVLNSKKSKLRELQ 176
A+ Q E RF+ + K E+ +Y +F+ VLN KK+K+R L
Sbjct: 136 ENQAKNEHLQKENERLLRDWNDVQGRFEKCVSAKEALETDLYKRFILVLNEKKTKIRSLH 195
Query: 177 DKL--SKQAVMGDLQQE------EEESTDETESFDEGTDDDKSEEEPAKDITSTSIG 225
+KL + Q D++QE E + D +DE TD++ + + S ++
Sbjct: 196 NKLLNAAQEREKDIKQEGETAICSEMTADRDPVYDESTDEESENQTDLSGLASAAVS 252
>sp|Q924T3|XRCC4_MOUSE DNA repair protein XRCC4 OS=Mus musculus GN=Xrcc4 PE=1 SV=1
Length = 326
Score = 41.6 bits (96), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 51/210 (24%), Positives = 89/210 (42%), Gaps = 38/210 (18%)
Query: 18 PIFVKGTW---FQSHFDLSITDGLNAWLCNASEEQVKDRAALWDQPVTEYIKLAERYLGF 74
P F++ +W S F +++TDG +AW SE ++ A +YI + L
Sbjct: 17 PYFLQVSWERAIGSGFVITLTDGHSAWTATVSELEISQEADDMAMEKGKYIDELRKALV- 75
Query: 75 QQPGC----VYGFTDAADGCKRLSWTFEKEGTKLEWR---WKCGPAPNSKKVPAGILDFL 127
PG Y F + + ++ EKE + +R + NS +V ++ +
Sbjct: 76 --PGSGAAGTYKFLFSKESQH---FSLEKELKDVSFRLGSFNLDKVSNSAEVIRELICYC 130
Query: 128 MDANIRLSAEKCLAQSE-------------RFK---NEKLEFESAIYAKFLGVLNSKKSK 171
+D A+ Q E RF+ + K E+ +Y +F+ VLN KK+K
Sbjct: 131 LDTITEKQAKNEHLQKENERLLRDWNDVQGRFEKCVSAKEALEADLYQRFILVLNEKKTK 190
Query: 172 LRELQDKLSKQAVMGDLQQEEEESTDETES 201
+R L ++ ++QQ EE + E E+
Sbjct: 191 IRSL------HKLLNEVQQLEESTKPEREN 214
>sp|P04583|ENV_HV1MA Envelope glycoprotein gp160 OS=Human immunodeficiency virus type 1
group M subtype A (isolate MAL) GN=env PE=1 SV=1
Length = 859
Score = 32.3 bits (72), Expect = 3.2, Method: Composition-based stats.
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 33 SITDGLNAWLCNASEEQVKDRAALWDQPVTEYIKL 67
++T+G N W N E+ +D +LWDQ + +KL
Sbjct: 87 NVTEGFNMWKNNMVEQMHEDIISLWDQSLKPCVKL 121
>sp|Q46DH6|KTHY_METBF Probable thymidylate kinase OS=Methanosarcina barkeri (strain
Fusaro / DSM 804) GN=tmk PE=3 SV=1
Length = 205
Score = 32.0 bits (71), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 17/39 (43%), Positives = 22/39 (56%)
Query: 124 LDFLMDANIRLSAEKCLAQSERFKNEKLEFESAIYAKFL 162
L FL D +S E+C + E+ K EKLEF + A FL
Sbjct: 130 LTFLFDIRPEISIERCGKRGEQSKFEKLEFLQGVRAIFL 168
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.314 0.131 0.390
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 91,226,527
Number of Sequences: 539616
Number of extensions: 3819606
Number of successful extensions: 21040
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 141
Number of HSP's successfully gapped in prelim test: 194
Number of HSP's that attempted gapping in prelim test: 19191
Number of HSP's gapped (non-prelim): 1429
length of query: 240
length of database: 191,569,459
effective HSP length: 114
effective length of query: 126
effective length of database: 130,053,235
effective search space: 16386707610
effective search space used: 16386707610
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 59 (27.3 bits)