BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 018777
(350 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q9SE51|SURF1_ARATH Surfeit locus protein 1 OS=Arabidopsis thaliana GN=SURF1 PE=2 SV=1
Length = 354
Score = 394 bits (1011), Expect = e-109, Method: Compositional matrix adjust.
Identities = 188/298 (63%), Positives = 238/298 (79%), Gaps = 8/298 (2%)
Query: 55 QDQENVRKGSAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL 114
QEN R S WS+ LLFLPGAI+FGLG+WQI RR++K K LEY+Q RL M+P++L
Sbjct: 63 PPQENKR-----GSKWSQLLLFLPGAITFGLGSWQIVRREEKFKTLEYQQQRLNMEPIKL 117
Query: 115 NITSPLTEDLKSLEFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNP 174
NI PL ++L +LEFRRV C+GVFDEQRSIY+GPRSRSISG+TENG++VITPLMPIP +
Sbjct: 118 NIDHPLDKNLNALEFRRVSCKGVFDEQRSIYLGPRSRSISGITENGFFVITPLMPIPGDL 177
Query: 175 QSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQ--QSQQSSWWWFWLKKPNIV 232
S++SP+LVNRGWVPRSWR+KS E S ++E N + + ++ SWW FW K P I
Sbjct: 178 DSMQSPILVNRGWVPRSWREKSQE-SAEAEFIANQSTKAKSPSNEPKSWWKFWSKTPVIT 236
Query: 233 EDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDT 292
++ + ++ VEVVGV+RG E PSIFVP+NDPS+ QWFYVDVPA+A A GLPENT+Y+ED
Sbjct: 237 KEHISAVKPVEVVGVIRGGENPSIFVPSNDPSTGQWFYVDVPAMARAVGLPENTIYVEDV 296
Query: 293 NENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRPKKSRR 350
+E+V+ S PYP+PKD++TL+RS VMPQDHLNY++TWYSLSAAVTFMA+KRL+ K RR
Sbjct: 297 HEHVDRSRPYPVPKDINTLIRSKVMPQDHLNYSITWYSLSAAVTFMAYKRLKAKPVRR 354
>sp|Q9LP74|SURFL_ARATH Surfeit locus protein 1-like OS=Arabidopsis thaliana GN=At1g48510
PE=2 SV=2
Length = 384
Score = 239 bits (609), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 136/313 (43%), Positives = 189/313 (60%), Gaps = 26/313 (8%)
Query: 34 RLYSSSAAAALSSAPQLSSSSQ--DQENVRKGSAP----SSTWSKWLLFLPGAISFGLGT 87
RL S S + S+ L ++SQ + E+ SAP S L +L G ++GLG
Sbjct: 11 RLISQSQYMSSSTTSNLPAASQTSNLESQLLSSAPPPAKKKRGSALLWYLVGFTTYGLGE 70
Query: 88 WQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRVICQGVFDEQRSIYVG 147
F Q +++ L+ R+ L+M P++LN T +DL L FRRV+C+G+FDEQRSIYVG
Sbjct: 71 TYKFL-QTQVEHLDSRKQCLEMKPMKLNTT----KDLDGLGFRRVVCKGIFDEQRSIYVG 125
Query: 148 PRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPL 207
P+ RS+S +E G+YVITPL+PIPN P S+KSP+LVNRGWVP W++ S E S
Sbjct: 126 PKPRSMSKSSEIGFYVITPLLPIPNEPNSMKSPILVNRGWVPSDWKENSLE----SLGTG 181
Query: 208 NLAPSVQQSQQSS-----------WWWFWLKKPNIVEDDVPSIASVEVVGVVRGSEKPSI 256
L + ++S++++ +W+ L P IVED V VEVVGVVR SE P I
Sbjct: 182 GLVAAAKESRKANKLLSSQQSLLSKFWYKLNNPMIVEDQVSRAMHVEVVGVVRKSETPGI 241
Query: 257 FVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPYPLPKDVSTLLRSSV 316
+ N PSS WFY+DVP +A A G E+T+YIE T +++ S YP+P+DV L RS
Sbjct: 242 YTLVNYPSSLAWFYLDVPKLALAMGFGEDTMYIESTYTDMDESRTYPVPRDVENLTRSKD 301
Query: 317 MPQDHLNYTLTWY 329
+P D+ YT+ W+
Sbjct: 302 IPLDYHLYTVLWH 314
>sp|P09925|SURF1_MOUSE Surfeit locus protein 1 OS=Mus musculus GN=Surf1 PE=2 SV=3
Length = 306
Score = 115 bits (289), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 93/352 (26%), Positives = 161/352 (45%), Gaps = 72/352 (20%)
Query: 7 MAVASISKTLTK-----LGGGSSFLLNHRAPPRLYSSSAAAALSSAPQLSSSSQDQENVR 61
MA+A + + +T+ G + F R+ ++ S + + P+ SS +
Sbjct: 5 MALAVLPRRMTRWSQWAYAGRAQFCAVRRS---VFGFSVRSGMVCRPRRCCSSTAETA-- 59
Query: 62 KGSAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLT 121
A ++ +W L L A +FGLGTWQ+ RR+ K+K++ ++R+ +P+ L P+
Sbjct: 60 AAKAEDDSFLQWFLLLIPATAFGLGTWQVQRRKWKLKLIAELESRVMAEPIPLP-ADPM- 117
Query: 122 EDLKSLEFRRVICQGVFDEQRSIYVGPRS--------RSISGV--TENGYYVITPLMPIP 171
+LK+LE+R V +G FD + +Y+ PR+ R + TE+G +V+TP
Sbjct: 118 -ELKNLEYRPVKVRGHFDHSKELYIMPRTMVDPVREARDAGRLSSTESGAHVVTPF---- 172
Query: 172 NNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNI 231
+ + +LVNRG+VPR + P +Q Q
Sbjct: 173 -HCSDLGVTILVNRGFVPRK----------------KVNPETRQKGQ------------- 202
Query: 232 VEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIED 291
+ V++VG+VR +E FVP N P W+Y D+ A+A G + ++I+
Sbjct: 203 ------VLGEVDLVGIVRLTENRKPFVPENSPERNHWYYRDLEAMAKITG--ADPIFIDA 254
Query: 292 TNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
+ P P+ LR+ +H+ Y LTWY L AA +++ F++
Sbjct: 255 DFHSTAPGG--PIGGQTRVTLRN-----EHMQYILTWYGLCAATSYLWFQKF 299
>sp|Q15526|SURF1_HUMAN Surfeit locus protein 1 OS=Homo sapiens GN=SURF1 PE=1 SV=1
Length = 300
Score = 115 bits (287), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 87/307 (28%), Positives = 145/307 (47%), Gaps = 65/307 (21%)
Query: 48 PQLSSSSQDQENVRKGSAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRL 107
P SS + + K A ++ +W+L L +FGLGTWQ+ RR+ K+ ++ ++R+
Sbjct: 41 PSRCGSSAAEASATK--AEDDSFLQWVLLLIPVTAFGLGTWQVQRRKWKLNLIAELESRV 98
Query: 108 QMDPLRLNITSPLTEDLKSLEFRRVICQGVFDEQRSIYVGPRSRS-----------ISGV 156
+P+ L P+ +LK+LE+R V +G FD + +Y+ PR+ IS
Sbjct: 99 LAEPVPLP-ADPM--ELKNLEYRPVKVRGCFDHSKELYMMPRTMVDPVREAREGGLISSS 155
Query: 157 TENGYYVITPLMPIPNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQS 216
T++G YV+TP + + +LVNRG+VPR + P +Q
Sbjct: 156 TQSGAYVVTPF-----HCTDLGVTILVNRGFVPRK----------------KVNPETRQK 194
Query: 217 QQSSWWWFWLKKPNIVEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAI 276
Q +E + V+++G+VR +E FVP N+P W Y D+ A+
Sbjct: 195 GQ-------------IEGE------VDLIGMVRLTETRQPFVPENNPERNHWHYRDLEAM 235
Query: 277 ACACGLPENTVYIEDTNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVT 336
A G ++I+ ++ P P+ LR+ +HL Y +TWY LSAA +
Sbjct: 236 ARITG--AEPIFIDANFQSTVPGG--PIGGQTRVTLRN-----EHLQYIVTWYGLSAATS 286
Query: 337 FMAFKRL 343
++ FK+
Sbjct: 287 YLWFKKF 293
>sp|Q9QXU2|SURF1_RAT Surfeit locus protein 1 OS=Rattus norvegicus GN=Surf1 PE=2 SV=1
Length = 306
Score = 114 bits (285), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/281 (29%), Positives = 137/281 (48%), Gaps = 63/281 (22%)
Query: 73 WLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRV 132
+LLF+P A +FGLGTWQ+ RR+ K+K++ ++R+ +P+ L P+ +LK+LE+R V
Sbjct: 72 FLLFIP-ATAFGLGTWQVQRRKWKLKLIAELESRVMAEPIPLP-ADPM--ELKNLEYRPV 127
Query: 133 ICQGVFDEQRSIYVGPRS--------RSISGV--TENGYYVITPLMPIPNNPQSVKSPVL 182
+G FD + +Y+ PR+ R + TE+G YV+TP + + +L
Sbjct: 128 KVRGHFDHSKELYIMPRTMVDPVREARDAGRLSSTESGAYVVTPF-----HCSDLGVTIL 182
Query: 183 VNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASV 242
VNRG+VPR + P +Q Q + V
Sbjct: 183 VNRGFVPRK----------------KVNPETRQQGQ-------------------VLGEV 207
Query: 243 EVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPY 302
++VG+VR +E FVP N+P W+Y D+ A+A G + ++I+ + P
Sbjct: 208 DLVGIVRLTENRKPFVPENNPERSLWYYRDLDAMAKRTG--TDPIFIDADFNSTTPGG-- 263
Query: 303 PLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
P+ LR+ +H+ Y +TWY L AA +++ F++
Sbjct: 264 PIGGQTRVTLRN-----EHMQYIITWYGLCAATSYLWFRKF 299
>sp|A4IHH4|SURF1_XENTR Surfeit locus protein 1 OS=Xenopus tropicalis GN=surf1 PE=2 SV=1
Length = 307
Score = 109 bits (273), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 96/353 (27%), Positives = 150/353 (42%), Gaps = 68/353 (19%)
Query: 7 MAVASISKTLTKLGGGSSFL-----LNHRAPPRLYSSSAAAALSSAPQLSSSSQDQENVR 61
MA+ ++K L G + L L+H A P + S A L + ++
Sbjct: 1 MALPGVTKLLLLPGVRAQLLNTPVRLSHWATPGRCTKSCHAYLQKNLRFCTTRSFSSVSP 60
Query: 62 KGSAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLT 121
+ T KWLL L +F LGTWQ+ RR K+K+++ + R+ P+ L T P+
Sbjct: 61 AAESSEDTVLKWLLLLIPVATFSLGTWQVQRRSWKLKLIQEMEARVSGKPIPLT-TDPM- 118
Query: 122 EDLKSLEFRRVICQGVFDEQRSIYVGPRS-----------RSISGVTENGYYVITPLMPI 170
++K LE+R V +G FD + +Y+ PR+ ++ T++G VITP
Sbjct: 119 -EIKELEYRPVKVRGHFDHSKELYILPRTLVDPEREAREAGQLASNTQSGAQVITPFY-- 175
Query: 171 PNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPN 230
+ +LVNRG+VP+ + P + Q S
Sbjct: 176 ---CSDLGITILVNRGFVPKK----------------KVNPETRPKGQVS---------- 206
Query: 231 IVEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIE 290
VE+VG+VR +E FVP NDPS W Y D+ A+A G + I+
Sbjct: 207 ---------GEVELVGIVRLNETRKPFVPHNDPSRNLWHYKDLSAMAQVVG--AEPILID 255
Query: 291 DTNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
+ P P+ LR+ +H+ Y +TWY L AA T++ K+
Sbjct: 256 ADRGSTVPGG--PIGGQTRVTLRN-----EHMQYIVTWYGLCAATTYLWCKKF 301
>sp|Q800L1|SURF1_CHICK Surfeit locus protein 1 OS=Gallus gallus GN=SURF1 PE=3 SV=1
Length = 309
Score = 108 bits (271), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 131/291 (45%), Gaps = 63/291 (21%)
Query: 64 SAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTED 123
+A W KW L L +F LGTWQI RR+ K+ ++ +RL +P+ L + P+ +
Sbjct: 65 AAGEDAWLKWGLLLVPLTAFCLGTWQIQRRKWKLDLIAQLASRLSSEPIPLTL-DPM--E 121
Query: 124 LKSLEFRRVICQGVFDEQRSIYVGPRS--------RSISGVT---ENGYYVITPLMPIPN 172
LK LE+R V +G FD + +Y+ PRS R +T ENG VITP
Sbjct: 122 LKELEYRPVKVRGHFDHSKELYILPRSLVDPEREAREAGKLTSHAENGANVITPFYCT-- 179
Query: 173 NPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIV 232
+ +LVNRG+VP+ L P + Q +
Sbjct: 180 ---ELGVTILVNRGFVPKK----------------KLKPETRLKGQ-------------I 207
Query: 233 EDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDT 292
E++ +++ GVVR SEK FVP N+ +W Y D+ A+A G ++I+
Sbjct: 208 EEE------IDLTGVVRLSEKRKPFVPENNIEKNRWHYRDLEAMAKVTG--AEPIFIDAD 259
Query: 293 NENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
+ P P VS + +H+ Y +TWY L AA +F+ +++
Sbjct: 260 FRSTVPGGPIGGQTRVS-------LRNEHMQYIVTWYGLCAATSFLWYRKF 303
>sp|O57593|SURF1_TAKRU Surfeit locus protein 1 (Fragment) OS=Takifugu rubripes GN=surf1
PE=3 SV=1
Length = 240
Score = 105 bits (263), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 122/286 (42%), Gaps = 63/286 (22%)
Query: 72 KWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRR 131
KW L L A +FGLGTWQ+ RRQ K+++++ +P+ L I +L SLE+RR
Sbjct: 4 KWFLLLIPATTFGLGTWQVKRRQWKMELIDGLTKLTTAEPIPLPIDPA---ELSSLEYRR 60
Query: 132 VICQGVFDEQRSIYVGPRS-----------RSISGVTENGYYVITPLMPIPNNPQSVKSP 180
V +G +D + +Y+ PRS +S E G VITP + +
Sbjct: 61 VKMRGKYDHSKELYILPRSPVDPEKEAREAGRLSSSGETGANVITPF-----HVTDLGIT 115
Query: 181 VLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIA 240
+LVNRG+VP+ + P + Q
Sbjct: 116 ILVNRGYVPKK----------------KIRPETRMKGQVE-------------------G 140
Query: 241 SVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSN 300
+EVVGVVR +E FVP ND W Y D+ A+ G ++++ + P
Sbjct: 141 EMEVVGVVRLTETRKPFVPNNDVERNHWHYRDLEAMCQVTG--AEPIFVDADFSSTVPGG 198
Query: 301 PYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRPK 346
P+ LR+ +H+ Y +TWY L AA ++M F + K
Sbjct: 199 --PIGGQTRVTLRN-----EHMQYIVTWYGLCAATSYMWFAKFIKK 237
>sp|A9UWF0|SURF1_MONBE SURF1-like protein OS=Monosiga brevicollis GN=18583 PE=3 SV=1
Length = 261
Score = 102 bits (254), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 84/285 (29%), Positives = 128/285 (44%), Gaps = 60/285 (21%)
Query: 73 WLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRV 132
W L ++FGLGTWQIFR+Q K +++ + +L +P L T+P DL +E+ RV
Sbjct: 19 WALLSVPVVTFGLGTWQIFRKQQKEELIATLEAKLSKEPAALP-TNP--ADLAHMEYERV 75
Query: 133 ICQGVFDEQRSIYVGPRS------RSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRG 186
G F + + VGPR+ ++ + E G VITP +LVNRG
Sbjct: 76 AVTGTFLHDQEMLVGPRTVTREVFSGMADLPEAGVQVITPF-----RLADTGEVILVNRG 130
Query: 187 WVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVG 246
+VP +++ P P + + Q +V + G
Sbjct: 131 FVP------------EAQAP----PHKRAAGQVE-------------------GTVRLEG 155
Query: 247 VVRGSEKPSIFVPANDPSSCQWFYVDVPAIAC-ACGLPENTVYIEDTNENVNPSNPYPLP 305
+VR E + FVP N P W+++DV +A LP V I+ T E P +PL
Sbjct: 156 IVRHGESQTAFVPDNHPEQNTWYWIDVFTMASNRSALP---VLIDATAE-CTPPGGFPLG 211
Query: 306 KDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRPKKSRR 350
+ +R+ +HL+Y +TWYS+S A+T + LR K R
Sbjct: 212 GQTNITVRN-----EHLSYIITWYSIS-AITLAMWVFLRRKGGNR 250
>sp|Q556J9|SURF1_DICDI SURF1-like protein OS=Dictyostelium discoideum GN=surf1-1 PE=3 SV=2
Length = 270
Score = 102 bits (253), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 88/284 (30%), Positives = 139/284 (48%), Gaps = 40/284 (14%)
Query: 74 LLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL------NITSPLTEDLKSL 127
L F+ I+FGLGTWQ++R K ++++ ++R++ DP+ L N DL
Sbjct: 10 LFFIFPVIAFGLGTWQVYRYDWKKRLIQRAKDRMEEDPIELSNSFIKNFKGSSFGDLNKY 69
Query: 128 EFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGW 187
EFRRV G + + + +GP RSI G GYYVI+PL S + +L+NRGW
Sbjct: 70 EFRRVYLNGKVIDNQYVLLGP--RSIDGTL--GYYVISPLQ------LSDGTRILLNRGW 119
Query: 188 VPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIA--SVEVV 245
+ KS+ + + L L ++ Q + + SI ++
Sbjct: 120 SAST--PKSNYKIPYAIEELKLIHQKEKEQGQQ------------QGNQESILYRYFNIL 165
Query: 246 GVV-RGSEKPSIFVPANDPSSCQWFYVDVPAIACACG---LPENTVYIEDTNENVNPSN- 300
GV+ + E+ S F P N P QW+ +DV A+A L NT +++T N PS+
Sbjct: 166 GVISKTKERGSAFTPTNQPEKGQWYSLDVDAMADQLNTEPLMINT--MDETEINSKPSSL 223
Query: 301 PYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLR 344
P P K + SS + H++Y TWY+LSA++ F+ F+ +R
Sbjct: 224 PNPQFKRFDNDVESSFHNK-HMSYIGTWYTLSASLFFIYFRYMR 266
>sp|Q9Y810|SHY1_SCHPO Cytochrome oxidase assembly protein shy1 OS=Schizosaccharomyces
pombe (strain 972 / ATCC 24843) GN=shy1 PE=3 SV=1
Length = 290
Score = 80.9 bits (198), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 76/272 (27%), Positives = 123/272 (45%), Gaps = 62/272 (22%)
Query: 81 ISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRVICQGVFDE 140
++F LGTWQ+ RR+ K+ ++ RLQ + L T +D K LE+ RV+ +GVF
Sbjct: 49 VTFALGTWQVKRREWKMGIINTLTERLQQPAILLPKTVT-EQDTKKLEWTRVLLRGVFCH 107
Query: 141 QRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGWVPRSWRDKSSEVS 200
+ + VGPR++ + GY+V+TP I ++ + + LVNRGW+ RS+ ++SS
Sbjct: 108 DQEMLVGPRTKE----GQPGYHVVTPF--ILDDGRRI----LVNRGWIARSFAEQSS--- 154
Query: 201 RDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGVVRGSEKPSIFVPA 260
RD P +L K P ++E G++R F+
Sbjct: 155 RD---PSSLP----------------KGPVVIE------------GLLRQHTDKPRFMMK 183
Query: 261 NDPSSCQWFYVDVPAIACACG-LPENTVYIE------DTNENVNPSNPYPLPKDVSTLLR 313
N+P +++++V A G LP ++ ++V P P V
Sbjct: 184 NEPEKNSFYFLNVREFAQLKGTLPILITELQPSLTPLQEADHVKRGLPLGHPLKVEIF-- 241
Query: 314 SSVMPQDHLNYTLTWYSL---SAAVTFMAFKR 342
H Y +TWYSL SA + ++ FKR
Sbjct: 242 -----NSHTEYIITWYSLSVVSAIMLYVYFKR 268
>sp|Q92GL0|SURF1_RICCN SURF1-like protein OS=Rickettsia conorii (strain ATCC VR-613 /
Malish 7) GN=RC1113 PE=3 SV=1
Length = 240
Score = 80.5 bits (197), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 68/277 (24%), Positives = 120/277 (43%), Gaps = 63/277 (22%)
Query: 71 SKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSP---LTEDLKSL 127
+ +L+F+ I LG WQ+ R ++K +L + ++ N+TSP L E L
Sbjct: 3 TNFLVFITFTILISLGFWQLSRLKEK---------KLFLASMQANLTSPAINLAEIQDGL 53
Query: 128 EFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGW 187
+ +V G F + IY+ R RS+S ++GYY++TP I + +LV RGW
Sbjct: 54 PYHKVKITGQFLPNKDIYLYGR-RSMSS-EKDGYYLVTPFKTIED------KVILVARGW 105
Query: 188 VPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGV 247
++ ++ + D + E++GV
Sbjct: 106 FSNRNKNIITQATNDRQH-------------------------------------EIIGV 128
Query: 248 VRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPYPLPKD 307
SEK I++PAND + W +++ + GL YI ++++ + LP
Sbjct: 129 TMPSEKTRIYLPANDIKNNVWLTLNLKETSKVLGLDLENFYIIAEGKDISNLDIL-LPLA 187
Query: 308 VSTLLRSSVMPQDHLNYTLTWYSL--SAAVTFMAFKR 342
++ L + + DHL Y LTW+ L S V ++ ++R
Sbjct: 188 INHL---AAIRNDHLEYALTWFGLAISLIVIYVIYRR 221
>sp|A8Y2C9|SURF1_CAEBR SURF1-like protein OS=Caenorhabditis briggsae GN=sft-1 PE=3 SV=1
Length = 317
Score = 80.1 bits (196), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 69/294 (23%), Positives = 123/294 (41%), Gaps = 70/294 (23%)
Query: 68 STWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL--NITSPLTEDLK 125
ST S +L LP A +F LG WQI+R K++++E+ ++RL + + L +++S L+
Sbjct: 76 STGSILMLGLP-AFAFSLGVWQIYRLIWKLELIEHLKSRLSQEAIELPDDLSS---SSLE 131
Query: 126 SLEFRRVICQGVFDEQRSIYVGPRSR---------------SISGVTENGYYVITPLMPI 170
LE+ RV G F Q+ + PR R S + ++ +G ++ITP
Sbjct: 132 PLEYCRVRVTGEFLHQKEFVISPRGRFDPAKKTSASVGSMLSENEMSSHGGHLITPF--- 188
Query: 171 PNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPN 230
++ +L+NRGW+P + D S + +
Sbjct: 189 --RLKNTGKVILINRGWLPTFYFDPESHAKTNPQ-------------------------- 220
Query: 231 IVEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIE 290
+V + +VR +E+ FV N P W+Y D+ +A G V+++
Sbjct: 221 ---------GTVILEAIVRKTEQRPQFVGQNVPEQGVWYYRDLEQMAKWHG--TEPVWLD 269
Query: 291 DTNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLR 344
E P P +++ + +H+NY TW++L+ M + R
Sbjct: 270 AAYETTVPGGPIGGQTNIN-------VRNEHMNYLTTWFTLTLVTMLMWIHKFR 316
>sp|Q9U4F3|SURF1_DROME SURF1-like protein OS=Drosophila melanogaster GN=Surf1 PE=2 SV=1
Length = 300
Score = 79.3 bits (194), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 77/283 (27%), Positives = 120/283 (42%), Gaps = 65/283 (22%)
Query: 73 WLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRV 132
W L L A +FGLG WQ+ R+ K ++++ +L P+ L LT DL +E+R V
Sbjct: 65 WFLLLIPATTFGLGCWQVKRKIWKEQLIKDLNKQLSTAPVAL--PDDLT-DLAQMEYRLV 121
Query: 133 ICQGVFDEQRSIYVGPRS-------RSISGV-----TENGYYVITPLMPIPNNPQSVKSP 180
+G F + + +GPRS + G+ + NGY ++TP +
Sbjct: 122 KIRGRFLHDKEMRLGPRSLIRPDGVETQGGLFSQRDSGNGYLIVTPFQLADRD-----DI 176
Query: 181 VLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIA 240
VLVNRGW VSR +P QQ A
Sbjct: 177 VLVNRGW-----------VSRKQVEPETRPLGQQQ------------------------A 201
Query: 241 SVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSN 300
VE+ VVR E F P D + Y D+ + A G V+++ + ++
Sbjct: 202 EVELTAVVRKGEARPQFTP--DHKGNVYLYRDLARMCAATG--AAPVFLDAVYDPQTAAH 257
Query: 301 PYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
P+ LR+ DHL+Y +TW+SLSAA +F+ ++++
Sbjct: 258 A-PIGGQTRVTLRN-----DHLSYLVTWFSLSAATSFLWYRQI 294
>sp|Q9N5N8|SURF1_CAEEL SURF1-like protein OS=Caenorhabditis elegans GN=sft-1 PE=3 SV=1
Length = 323
Score = 72.4 bits (176), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 69/293 (23%), Positives = 115/293 (39%), Gaps = 70/293 (23%)
Query: 69 TWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL--NITSPLTEDLKS 126
T S +L +P +F LG WQ FR + K+ ++E+ + RL L +++ E L+
Sbjct: 83 TGSVLMLTIP-VFAFSLGIWQTFRLKWKLDLIEHLKGRLNQTAQELPEDLSC---ESLEP 138
Query: 127 LEFRRVICQGVFDEQRSIYVGPRSRSISG---------------VTENGYYVITPLMPIP 171
LE+ RV G F ++ + PR R G ++ +G ++ITP
Sbjct: 139 LEYCRVTVTGEFLHEKEFIISPRGRFDPGKKTSAAAGSMLSENEMSSHGGHLITPF---- 194
Query: 172 NNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNI 231
++ +L+NRGW+P + D + + L L
Sbjct: 195 -RLKNSGKIILINRGWLPSFYFDPETRQKTNPRGTLTL---------------------- 231
Query: 232 VEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIED 291
P+I VR +EK FV N P W+Y D+ +A G V ++
Sbjct: 232 -----PAI--------VRKTEKRPQFVGQNVPEQGVWYYRDLNQMAKHYG--TEPVLLDA 276
Query: 292 TNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLR 344
E P P +++ + +HLNY TW++L+ M + R
Sbjct: 277 AYETTVPGGPIGGQTNIN-------VRNEHLNYLTTWFTLTLVTMLMWIHKFR 322
>sp|Q4UN32|SURF1_RICFE SURF1-like protein OS=Rickettsia felis (strain ATCC VR-1525 /
URRWXCal2) GN=RF_0175 PE=3 SV=1
Length = 226
Score = 71.2 bits (173), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 65/277 (23%), Positives = 114/277 (41%), Gaps = 63/277 (22%)
Query: 71 SKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSP---LTEDLKSL 127
+ ++ + I LG WQ+ R ++K +L + ++ N+TSP L E SL
Sbjct: 3 TNLVVLITFTILISLGFWQLSRLKEK---------KLFLASMQANLTSPAINLAEIQDSL 53
Query: 128 EFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGW 187
+ +V G F + IY+ R SG ++GYY++TP I + +LV RGW
Sbjct: 54 PYHKVKITGQFLPNKDIYLYGRRSMSSG--KDGYYLVTPFKTIED------KVILVARGW 105
Query: 188 VPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGV 247
+ ++ + D + E++GV
Sbjct: 106 FSNRNKIIITQATNDRQH-------------------------------------EIIGV 128
Query: 248 VRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPYPLPKD 307
SEK ++PAND + W +D+ + L YI ++++ + LP
Sbjct: 129 TMPSEKTRSYLPANDIKNNVWLTLDLKEASQTLELNLEDFYIIAEGKDISNLDIL-LPLS 187
Query: 308 VSTLLRSSVMPQDHLNYTLTWYSL--SAAVTFMAFKR 342
++ L + + DHL Y LTW+ L S V ++ ++R
Sbjct: 188 INHL---AAIRNDHLEYALTWFGLAISLIVIYVIYRR 221
>sp|Q1RJM4|SURF1_RICBR SURF1-like protein OS=Rickettsia bellii (strain RML369-C)
GN=RBE_0359 PE=3 SV=1
Length = 241
Score = 70.1 bits (170), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 72/278 (25%), Positives = 118/278 (42%), Gaps = 65/278 (23%)
Query: 71 SKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLK---SL 127
+K + + I LG WQ+ R ++K +L + ++ N+TSP + K +L
Sbjct: 3 TKLTVLITFIILVLLGFWQLNRLKEK---------KLFLASMQENLTSPAIDLAKIQDNL 53
Query: 128 EFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGW 187
+ +V G F + IY+ R RS+S ++GYY++TP +LV RGW
Sbjct: 54 PYHKVKITGHFLPDKDIYLYGR-RSMSS-EKDGYYLVTPF------KTDEDKIILVARGW 105
Query: 188 VPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGV 247
S R+K+ ++QP E++GV
Sbjct: 106 F--SNRNKNIITQATNDQP-----------------------------------HELIGV 128
Query: 248 VRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLP-ENTVYIEDTNENVNPSNPYPLPK 306
SEK ++PAND + W +D+ + GL EN IE++ + N PL
Sbjct: 129 TMPSEKTRSYLPANDIKNNVWLTLDLQEASKVLGLNLENFYLIEESKDISNLDILLPLSI 188
Query: 307 DVSTLLRSSVMPQDHLNYTLTWYSLSAA--VTFMAFKR 342
+ +R+ DHL Y TW+ L+A+ V + +KR
Sbjct: 189 NHLAAIRN-----DHLEYAFTWFGLAASLVVIYRIYKR 221
>sp|P53266|SHY1_YEAST Cytochrome oxidase assembly protein SHY1 OS=Saccharomyces
cerevisiae (strain ATCC 204508 / S288c) GN=SHY1 PE=1
SV=1
Length = 389
Score = 64.7 bits (156), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 61/117 (52%), Gaps = 14/117 (11%)
Query: 74 LLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL--NITSPLTEDLKSLEFRR 131
L+F ISF LGTWQ+ R + K K++ + +L +P+ L + T + ED E+R+
Sbjct: 76 LMFAMPIISFYLGTWQVRRLKWKTKLIAACETKLTYEPIPLPKSFTPDMCED---WEYRK 132
Query: 132 VICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGWV 188
VI G F ++VGPR ++ E GY++ TP + VL+ RGW+
Sbjct: 133 VILTGHFLHNEEMFVGPRKKN----GEKGYFLFTPFI-----RDDTGEKVLIERGWI 180
>sp|Q9ZCJ8|SURF1_RICPR SURF1-like protein OS=Rickettsia prowazekii (strain Madrid E)
GN=RP733 PE=3 SV=1
Length = 244
Score = 63.9 bits (154), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 64/264 (24%), Positives = 114/264 (43%), Gaps = 63/264 (23%)
Query: 73 WLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSP---LTEDLKSLEF 129
+L+ I LG WQ+ R ++K +L +D ++ +I SP L + ++L +
Sbjct: 5 FLILTTFIILTSLGFWQLSRLKEK---------KLFLDSIQSHIISPGINLEKVQENLLY 55
Query: 130 RRVICQGVFDEQRSIYV-GPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGWV 188
+V G F + IY+ G R + + ++GYY++TP I + +LV RGW
Sbjct: 56 HKVKITGQFLPNKDIYLYGIR---LMAMEKDGYYLVTPFKTIAD------QVILVVRGWF 106
Query: 189 PRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGVV 248
S R+K N+ +Q E++GV+
Sbjct: 107 --SNRNK------------NIIMKATNNQIH-----------------------EIIGVI 129
Query: 249 RGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPYPLPKDV 308
SEK ++PAND + W +D+ + A L YI ++++ + LP +
Sbjct: 130 MPSEKTLSYLPANDIKNNVWLTLDLKEASKALKLNLENFYIIAEGKDISNLDIL-LPLSL 188
Query: 309 STLLRSSVMPQDHLNYTLTWYSLS 332
+ L +++ DHL Y +TW+ L+
Sbjct: 189 NHL---ALIKNDHLEYAITWFGLA 209
>sp|Q9K809|SPPA_BACHD Putative signal peptide peptidase SppA OS=Bacillus halodurans
(strain ATCC BAA-125 / DSM 18197 / FERM 7344 / JCM 9153
/ C-125) GN=sppA PE=3 SV=1
Length = 331
Score = 40.0 bits (92), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 40/165 (24%), Positives = 73/165 (44%), Gaps = 18/165 (10%)
Query: 31 APPRLYSSSAAAALSSAPQL-------SSSSQDQENVRKGSAPSSTWSKWLLFLPGAISF 83
A L+ +SAA +L S+P + + +S Q V G+ + + +L L G I
Sbjct: 12 AAAMLFVASAAISLVSSPAVDVDEWVGTGTSYKQTIVETGTDFGKSIA--ILELSGVIQD 69
Query: 84 -----GLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRVICQGVF 138
L ++ +D +K LE + + L + +P L+S E + + + V
Sbjct: 70 TGSAPSLLNTGVYHHRDFLKQLEKAGEDPNIAGIILQVNTPGGGVLESAEIHKQVEEIVQ 129
Query: 139 DEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLV 183
D ++ +YV + + SG GYY+ P I +PQ++ + V
Sbjct: 130 DSEKPVYVSMGNMAASG----GYYISAPATKIYAHPQTITGSIGV 170
>sp|A0AVT1|UBA6_HUMAN Ubiquitin-like modifier-activating enzyme 6 OS=Homo sapiens GN=UBA6
PE=1 SV=1
Length = 1052
Score = 33.1 bits (74), Expect = 3.4, Method: Composition-based stats.
Identities = 29/124 (23%), Positives = 49/124 (39%), Gaps = 20/124 (16%)
Query: 171 PNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSS--------WW 222
P P + + +L + + R + +DSE+ L LA S+ ++ + W
Sbjct: 319 PEAPLEIHTAMLALDQFQEKYSRKPNVGCQQDSEELLKLATSISETLEEKPDVNADIVHW 378
Query: 223 WFWLKKPNI--VEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACAC 280
W + + + V +AS EV+ V G P CQW Y++ I +
Sbjct: 379 LSWTAQGFLSPLAAAVGGVASQEVLKAVTGKFSPL----------CQWLYLEAADIVESL 428
Query: 281 GLPE 284
G PE
Sbjct: 429 GKPE 432
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.316 0.130 0.396
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 130,951,986
Number of Sequences: 539616
Number of extensions: 5512442
Number of successful extensions: 15125
Number of sequences better than 100.0: 26
Number of HSP's better than 100.0 without gapping: 17
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 15038
Number of HSP's gapped (non-prelim): 49
length of query: 350
length of database: 191,569,459
effective HSP length: 118
effective length of query: 232
effective length of database: 127,894,771
effective search space: 29671586872
effective search space used: 29671586872
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)