BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 018777
         (350 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9SE51|SURF1_ARATH Surfeit locus protein 1 OS=Arabidopsis thaliana GN=SURF1 PE=2 SV=1
          Length = 354

 Score =  394 bits (1011), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 188/298 (63%), Positives = 238/298 (79%), Gaps = 8/298 (2%)

Query: 55  QDQENVRKGSAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL 114
             QEN R      S WS+ LLFLPGAI+FGLG+WQI RR++K K LEY+Q RL M+P++L
Sbjct: 63  PPQENKR-----GSKWSQLLLFLPGAITFGLGSWQIVRREEKFKTLEYQQQRLNMEPIKL 117

Query: 115 NITSPLTEDLKSLEFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNP 174
           NI  PL ++L +LEFRRV C+GVFDEQRSIY+GPRSRSISG+TENG++VITPLMPIP + 
Sbjct: 118 NIDHPLDKNLNALEFRRVSCKGVFDEQRSIYLGPRSRSISGITENGFFVITPLMPIPGDL 177

Query: 175 QSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQ--QSQQSSWWWFWLKKPNIV 232
            S++SP+LVNRGWVPRSWR+KS E S ++E   N +   +   ++  SWW FW K P I 
Sbjct: 178 DSMQSPILVNRGWVPRSWREKSQE-SAEAEFIANQSTKAKSPSNEPKSWWKFWSKTPVIT 236

Query: 233 EDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDT 292
           ++ + ++  VEVVGV+RG E PSIFVP+NDPS+ QWFYVDVPA+A A GLPENT+Y+ED 
Sbjct: 237 KEHISAVKPVEVVGVIRGGENPSIFVPSNDPSTGQWFYVDVPAMARAVGLPENTIYVEDV 296

Query: 293 NENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRPKKSRR 350
           +E+V+ S PYP+PKD++TL+RS VMPQDHLNY++TWYSLSAAVTFMA+KRL+ K  RR
Sbjct: 297 HEHVDRSRPYPVPKDINTLIRSKVMPQDHLNYSITWYSLSAAVTFMAYKRLKAKPVRR 354


>sp|Q9LP74|SURFL_ARATH Surfeit locus protein 1-like OS=Arabidopsis thaliana GN=At1g48510
           PE=2 SV=2
          Length = 384

 Score =  239 bits (609), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 136/313 (43%), Positives = 189/313 (60%), Gaps = 26/313 (8%)

Query: 34  RLYSSSAAAALSSAPQLSSSSQ--DQENVRKGSAP----SSTWSKWLLFLPGAISFGLGT 87
           RL S S   + S+   L ++SQ  + E+    SAP        S  L +L G  ++GLG 
Sbjct: 11  RLISQSQYMSSSTTSNLPAASQTSNLESQLLSSAPPPAKKKRGSALLWYLVGFTTYGLGE 70

Query: 88  WQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRVICQGVFDEQRSIYVG 147
              F  Q +++ L+ R+  L+M P++LN T    +DL  L FRRV+C+G+FDEQRSIYVG
Sbjct: 71  TYKFL-QTQVEHLDSRKQCLEMKPMKLNTT----KDLDGLGFRRVVCKGIFDEQRSIYVG 125

Query: 148 PRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPL 207
           P+ RS+S  +E G+YVITPL+PIPN P S+KSP+LVNRGWVP  W++ S E    S    
Sbjct: 126 PKPRSMSKSSEIGFYVITPLLPIPNEPNSMKSPILVNRGWVPSDWKENSLE----SLGTG 181

Query: 208 NLAPSVQQSQQSS-----------WWWFWLKKPNIVEDDVPSIASVEVVGVVRGSEKPSI 256
            L  + ++S++++            +W+ L  P IVED V     VEVVGVVR SE P I
Sbjct: 182 GLVAAAKESRKANKLLSSQQSLLSKFWYKLNNPMIVEDQVSRAMHVEVVGVVRKSETPGI 241

Query: 257 FVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPYPLPKDVSTLLRSSV 316
           +   N PSS  WFY+DVP +A A G  E+T+YIE T  +++ S  YP+P+DV  L RS  
Sbjct: 242 YTLVNYPSSLAWFYLDVPKLALAMGFGEDTMYIESTYTDMDESRTYPVPRDVENLTRSKD 301

Query: 317 MPQDHLNYTLTWY 329
           +P D+  YT+ W+
Sbjct: 302 IPLDYHLYTVLWH 314


>sp|P09925|SURF1_MOUSE Surfeit locus protein 1 OS=Mus musculus GN=Surf1 PE=2 SV=3
          Length = 306

 Score =  115 bits (289), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 93/352 (26%), Positives = 161/352 (45%), Gaps = 72/352 (20%)

Query: 7   MAVASISKTLTK-----LGGGSSFLLNHRAPPRLYSSSAAAALSSAPQLSSSSQDQENVR 61
           MA+A + + +T+       G + F    R+   ++  S  + +   P+   SS  +    
Sbjct: 5   MALAVLPRRMTRWSQWAYAGRAQFCAVRRS---VFGFSVRSGMVCRPRRCCSSTAETA-- 59

Query: 62  KGSAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLT 121
              A   ++ +W L L  A +FGLGTWQ+ RR+ K+K++   ++R+  +P+ L    P+ 
Sbjct: 60  AAKAEDDSFLQWFLLLIPATAFGLGTWQVQRRKWKLKLIAELESRVMAEPIPLP-ADPM- 117

Query: 122 EDLKSLEFRRVICQGVFDEQRSIYVGPRS--------RSISGV--TENGYYVITPLMPIP 171
            +LK+LE+R V  +G FD  + +Y+ PR+        R    +  TE+G +V+TP     
Sbjct: 118 -ELKNLEYRPVKVRGHFDHSKELYIMPRTMVDPVREARDAGRLSSTESGAHVVTPF---- 172

Query: 172 NNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNI 231
            +   +   +LVNRG+VPR                  + P  +Q  Q             
Sbjct: 173 -HCSDLGVTILVNRGFVPRK----------------KVNPETRQKGQ------------- 202

Query: 232 VEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIED 291
                  +  V++VG+VR +E    FVP N P    W+Y D+ A+A   G   + ++I+ 
Sbjct: 203 ------VLGEVDLVGIVRLTENRKPFVPENSPERNHWYYRDLEAMAKITG--ADPIFIDA 254

Query: 292 TNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
              +  P    P+       LR+     +H+ Y LTWY L AA +++ F++ 
Sbjct: 255 DFHSTAPGG--PIGGQTRVTLRN-----EHMQYILTWYGLCAATSYLWFQKF 299


>sp|Q15526|SURF1_HUMAN Surfeit locus protein 1 OS=Homo sapiens GN=SURF1 PE=1 SV=1
          Length = 300

 Score =  115 bits (287), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 87/307 (28%), Positives = 145/307 (47%), Gaps = 65/307 (21%)

Query: 48  PQLSSSSQDQENVRKGSAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRL 107
           P    SS  + +  K  A   ++ +W+L L    +FGLGTWQ+ RR+ K+ ++   ++R+
Sbjct: 41  PSRCGSSAAEASATK--AEDDSFLQWVLLLIPVTAFGLGTWQVQRRKWKLNLIAELESRV 98

Query: 108 QMDPLRLNITSPLTEDLKSLEFRRVICQGVFDEQRSIYVGPRSRS-----------ISGV 156
             +P+ L    P+  +LK+LE+R V  +G FD  + +Y+ PR+             IS  
Sbjct: 99  LAEPVPLP-ADPM--ELKNLEYRPVKVRGCFDHSKELYMMPRTMVDPVREAREGGLISSS 155

Query: 157 TENGYYVITPLMPIPNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQS 216
           T++G YV+TP      +   +   +LVNRG+VPR                  + P  +Q 
Sbjct: 156 TQSGAYVVTPF-----HCTDLGVTILVNRGFVPRK----------------KVNPETRQK 194

Query: 217 QQSSWWWFWLKKPNIVEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAI 276
            Q             +E +      V+++G+VR +E    FVP N+P    W Y D+ A+
Sbjct: 195 GQ-------------IEGE------VDLIGMVRLTETRQPFVPENNPERNHWHYRDLEAM 235

Query: 277 ACACGLPENTVYIEDTNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVT 336
           A   G     ++I+   ++  P    P+       LR+     +HL Y +TWY LSAA +
Sbjct: 236 ARITG--AEPIFIDANFQSTVPGG--PIGGQTRVTLRN-----EHLQYIVTWYGLSAATS 286

Query: 337 FMAFKRL 343
           ++ FK+ 
Sbjct: 287 YLWFKKF 293


>sp|Q9QXU2|SURF1_RAT Surfeit locus protein 1 OS=Rattus norvegicus GN=Surf1 PE=2 SV=1
          Length = 306

 Score =  114 bits (285), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 82/281 (29%), Positives = 137/281 (48%), Gaps = 63/281 (22%)

Query: 73  WLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRV 132
           +LLF+P A +FGLGTWQ+ RR+ K+K++   ++R+  +P+ L    P+  +LK+LE+R V
Sbjct: 72  FLLFIP-ATAFGLGTWQVQRRKWKLKLIAELESRVMAEPIPLP-ADPM--ELKNLEYRPV 127

Query: 133 ICQGVFDEQRSIYVGPRS--------RSISGV--TENGYYVITPLMPIPNNPQSVKSPVL 182
             +G FD  + +Y+ PR+        R    +  TE+G YV+TP      +   +   +L
Sbjct: 128 KVRGHFDHSKELYIMPRTMVDPVREARDAGRLSSTESGAYVVTPF-----HCSDLGVTIL 182

Query: 183 VNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASV 242
           VNRG+VPR                  + P  +Q  Q                    +  V
Sbjct: 183 VNRGFVPRK----------------KVNPETRQQGQ-------------------VLGEV 207

Query: 243 EVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPY 302
           ++VG+VR +E    FVP N+P    W+Y D+ A+A   G   + ++I+    +  P    
Sbjct: 208 DLVGIVRLTENRKPFVPENNPERSLWYYRDLDAMAKRTG--TDPIFIDADFNSTTPGG-- 263

Query: 303 PLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
           P+       LR+     +H+ Y +TWY L AA +++ F++ 
Sbjct: 264 PIGGQTRVTLRN-----EHMQYIITWYGLCAATSYLWFRKF 299


>sp|A4IHH4|SURF1_XENTR Surfeit locus protein 1 OS=Xenopus tropicalis GN=surf1 PE=2 SV=1
          Length = 307

 Score =  109 bits (273), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 96/353 (27%), Positives = 150/353 (42%), Gaps = 68/353 (19%)

Query: 7   MAVASISKTLTKLGGGSSFL-----LNHRAPPRLYSSSAAAALSSAPQLSSSSQDQENVR 61
           MA+  ++K L   G  +  L     L+H A P   + S  A L    +  ++        
Sbjct: 1   MALPGVTKLLLLPGVRAQLLNTPVRLSHWATPGRCTKSCHAYLQKNLRFCTTRSFSSVSP 60

Query: 62  KGSAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLT 121
              +   T  KWLL L    +F LGTWQ+ RR  K+K+++  + R+   P+ L  T P+ 
Sbjct: 61  AAESSEDTVLKWLLLLIPVATFSLGTWQVQRRSWKLKLIQEMEARVSGKPIPLT-TDPM- 118

Query: 122 EDLKSLEFRRVICQGVFDEQRSIYVGPRS-----------RSISGVTENGYYVITPLMPI 170
            ++K LE+R V  +G FD  + +Y+ PR+             ++  T++G  VITP    
Sbjct: 119 -EIKELEYRPVKVRGHFDHSKELYILPRTLVDPEREAREAGQLASNTQSGAQVITPFY-- 175

Query: 171 PNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPN 230
                 +   +LVNRG+VP+                  + P  +   Q S          
Sbjct: 176 ---CSDLGITILVNRGFVPKK----------------KVNPETRPKGQVS---------- 206

Query: 231 IVEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIE 290
                      VE+VG+VR +E    FVP NDPS   W Y D+ A+A   G     + I+
Sbjct: 207 ---------GEVELVGIVRLNETRKPFVPHNDPSRNLWHYKDLSAMAQVVG--AEPILID 255

Query: 291 DTNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
               +  P    P+       LR+     +H+ Y +TWY L AA T++  K+ 
Sbjct: 256 ADRGSTVPGG--PIGGQTRVTLRN-----EHMQYIVTWYGLCAATTYLWCKKF 301


>sp|Q800L1|SURF1_CHICK Surfeit locus protein 1 OS=Gallus gallus GN=SURF1 PE=3 SV=1
          Length = 309

 Score =  108 bits (271), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 85/291 (29%), Positives = 131/291 (45%), Gaps = 63/291 (21%)

Query: 64  SAPSSTWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTED 123
           +A    W KW L L    +F LGTWQI RR+ K+ ++    +RL  +P+ L +  P+  +
Sbjct: 65  AAGEDAWLKWGLLLVPLTAFCLGTWQIQRRKWKLDLIAQLASRLSSEPIPLTL-DPM--E 121

Query: 124 LKSLEFRRVICQGVFDEQRSIYVGPRS--------RSISGVT---ENGYYVITPLMPIPN 172
           LK LE+R V  +G FD  + +Y+ PRS        R    +T   ENG  VITP      
Sbjct: 122 LKELEYRPVKVRGHFDHSKELYILPRSLVDPEREAREAGKLTSHAENGANVITPFYCT-- 179

Query: 173 NPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIV 232
               +   +LVNRG+VP+                  L P  +   Q             +
Sbjct: 180 ---ELGVTILVNRGFVPKK----------------KLKPETRLKGQ-------------I 207

Query: 233 EDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDT 292
           E++      +++ GVVR SEK   FVP N+    +W Y D+ A+A   G     ++I+  
Sbjct: 208 EEE------IDLTGVVRLSEKRKPFVPENNIEKNRWHYRDLEAMAKVTG--AEPIFIDAD 259

Query: 293 NENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
             +  P  P      VS       +  +H+ Y +TWY L AA +F+ +++ 
Sbjct: 260 FRSTVPGGPIGGQTRVS-------LRNEHMQYIVTWYGLCAATSFLWYRKF 303


>sp|O57593|SURF1_TAKRU Surfeit locus protein 1 (Fragment) OS=Takifugu rubripes GN=surf1
           PE=3 SV=1
          Length = 240

 Score =  105 bits (263), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 81/286 (28%), Positives = 122/286 (42%), Gaps = 63/286 (22%)

Query: 72  KWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRR 131
           KW L L  A +FGLGTWQ+ RRQ K+++++        +P+ L I      +L SLE+RR
Sbjct: 4   KWFLLLIPATTFGLGTWQVKRRQWKMELIDGLTKLTTAEPIPLPIDPA---ELSSLEYRR 60

Query: 132 VICQGVFDEQRSIYVGPRS-----------RSISGVTENGYYVITPLMPIPNNPQSVKSP 180
           V  +G +D  + +Y+ PRS             +S   E G  VITP      +   +   
Sbjct: 61  VKMRGKYDHSKELYILPRSPVDPEKEAREAGRLSSSGETGANVITPF-----HVTDLGIT 115

Query: 181 VLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIA 240
           +LVNRG+VP+                  + P  +   Q                      
Sbjct: 116 ILVNRGYVPKK----------------KIRPETRMKGQVE-------------------G 140

Query: 241 SVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSN 300
            +EVVGVVR +E    FVP ND     W Y D+ A+    G     ++++    +  P  
Sbjct: 141 EMEVVGVVRLTETRKPFVPNNDVERNHWHYRDLEAMCQVTG--AEPIFVDADFSSTVPGG 198

Query: 301 PYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRPK 346
             P+       LR+     +H+ Y +TWY L AA ++M F +   K
Sbjct: 199 --PIGGQTRVTLRN-----EHMQYIVTWYGLCAATSYMWFAKFIKK 237


>sp|A9UWF0|SURF1_MONBE SURF1-like protein OS=Monosiga brevicollis GN=18583 PE=3 SV=1
          Length = 261

 Score =  102 bits (254), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 84/285 (29%), Positives = 128/285 (44%), Gaps = 60/285 (21%)

Query: 73  WLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRV 132
           W L     ++FGLGTWQIFR+Q K +++   + +L  +P  L  T+P   DL  +E+ RV
Sbjct: 19  WALLSVPVVTFGLGTWQIFRKQQKEELIATLEAKLSKEPAALP-TNP--ADLAHMEYERV 75

Query: 133 ICQGVFDEQRSIYVGPRS------RSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRG 186
              G F   + + VGPR+        ++ + E G  VITP              +LVNRG
Sbjct: 76  AVTGTFLHDQEMLVGPRTVTREVFSGMADLPEAGVQVITPF-----RLADTGEVILVNRG 130

Query: 187 WVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVG 246
           +VP            +++ P    P  + + Q                      +V + G
Sbjct: 131 FVP------------EAQAP----PHKRAAGQVE-------------------GTVRLEG 155

Query: 247 VVRGSEKPSIFVPANDPSSCQWFYVDVPAIAC-ACGLPENTVYIEDTNENVNPSNPYPLP 305
           +VR  E  + FVP N P    W+++DV  +A     LP   V I+ T E   P   +PL 
Sbjct: 156 IVRHGESQTAFVPDNHPEQNTWYWIDVFTMASNRSALP---VLIDATAE-CTPPGGFPLG 211

Query: 306 KDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLRPKKSRR 350
              +  +R+     +HL+Y +TWYS+S A+T   +  LR K   R
Sbjct: 212 GQTNITVRN-----EHLSYIITWYSIS-AITLAMWVFLRRKGGNR 250


>sp|Q556J9|SURF1_DICDI SURF1-like protein OS=Dictyostelium discoideum GN=surf1-1 PE=3 SV=2
          Length = 270

 Score =  102 bits (253), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 88/284 (30%), Positives = 139/284 (48%), Gaps = 40/284 (14%)

Query: 74  LLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL------NITSPLTEDLKSL 127
           L F+   I+FGLGTWQ++R   K ++++  ++R++ DP+ L      N       DL   
Sbjct: 10  LFFIFPVIAFGLGTWQVYRYDWKKRLIQRAKDRMEEDPIELSNSFIKNFKGSSFGDLNKY 69

Query: 128 EFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGW 187
           EFRRV   G   + + + +GP  RSI G    GYYVI+PL        S  + +L+NRGW
Sbjct: 70  EFRRVYLNGKVIDNQYVLLGP--RSIDGTL--GYYVISPLQ------LSDGTRILLNRGW 119

Query: 188 VPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIA--SVEVV 245
              +   KS+     + + L L    ++ Q               + +  SI      ++
Sbjct: 120 SAST--PKSNYKIPYAIEELKLIHQKEKEQGQQ------------QGNQESILYRYFNIL 165

Query: 246 GVV-RGSEKPSIFVPANDPSSCQWFYVDVPAIACACG---LPENTVYIEDTNENVNPSN- 300
           GV+ +  E+ S F P N P   QW+ +DV A+A       L  NT  +++T  N  PS+ 
Sbjct: 166 GVISKTKERGSAFTPTNQPEKGQWYSLDVDAMADQLNTEPLMINT--MDETEINSKPSSL 223

Query: 301 PYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLR 344
           P P  K     + SS   + H++Y  TWY+LSA++ F+ F+ +R
Sbjct: 224 PNPQFKRFDNDVESSFHNK-HMSYIGTWYTLSASLFFIYFRYMR 266


>sp|Q9Y810|SHY1_SCHPO Cytochrome oxidase assembly protein shy1 OS=Schizosaccharomyces
           pombe (strain 972 / ATCC 24843) GN=shy1 PE=3 SV=1
          Length = 290

 Score = 80.9 bits (198), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 76/272 (27%), Positives = 123/272 (45%), Gaps = 62/272 (22%)

Query: 81  ISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRVICQGVFDE 140
           ++F LGTWQ+ RR+ K+ ++     RLQ   + L  T    +D K LE+ RV+ +GVF  
Sbjct: 49  VTFALGTWQVKRREWKMGIINTLTERLQQPAILLPKTVT-EQDTKKLEWTRVLLRGVFCH 107

Query: 141 QRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGWVPRSWRDKSSEVS 200
            + + VGPR++      + GY+V+TP   I ++ + +    LVNRGW+ RS+ ++SS   
Sbjct: 108 DQEMLVGPRTKE----GQPGYHVVTPF--ILDDGRRI----LVNRGWIARSFAEQSS--- 154

Query: 201 RDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGVVRGSEKPSIFVPA 260
           RD   P +L                 K P ++E            G++R       F+  
Sbjct: 155 RD---PSSLP----------------KGPVVIE------------GLLRQHTDKPRFMMK 183

Query: 261 NDPSSCQWFYVDVPAIACACG-LPENTVYIE------DTNENVNPSNPYPLPKDVSTLLR 313
           N+P    +++++V   A   G LP     ++         ++V    P   P  V     
Sbjct: 184 NEPEKNSFYFLNVREFAQLKGTLPILITELQPSLTPLQEADHVKRGLPLGHPLKVEIF-- 241

Query: 314 SSVMPQDHLNYTLTWYSL---SAAVTFMAFKR 342
                  H  Y +TWYSL   SA + ++ FKR
Sbjct: 242 -----NSHTEYIITWYSLSVVSAIMLYVYFKR 268


>sp|Q92GL0|SURF1_RICCN SURF1-like protein OS=Rickettsia conorii (strain ATCC VR-613 /
           Malish 7) GN=RC1113 PE=3 SV=1
          Length = 240

 Score = 80.5 bits (197), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 68/277 (24%), Positives = 120/277 (43%), Gaps = 63/277 (22%)

Query: 71  SKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSP---LTEDLKSL 127
           + +L+F+   I   LG WQ+ R ++K         +L +  ++ N+TSP   L E    L
Sbjct: 3   TNFLVFITFTILISLGFWQLSRLKEK---------KLFLASMQANLTSPAINLAEIQDGL 53

Query: 128 EFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGW 187
            + +V   G F   + IY+  R RS+S   ++GYY++TP   I +        +LV RGW
Sbjct: 54  PYHKVKITGQFLPNKDIYLYGR-RSMSS-EKDGYYLVTPFKTIED------KVILVARGW 105

Query: 188 VPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGV 247
                ++  ++ + D +                                      E++GV
Sbjct: 106 FSNRNKNIITQATNDRQH-------------------------------------EIIGV 128

Query: 248 VRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPYPLPKD 307
              SEK  I++PAND  +  W  +++   +   GL     YI    ++++  +   LP  
Sbjct: 129 TMPSEKTRIYLPANDIKNNVWLTLNLKETSKVLGLDLENFYIIAEGKDISNLDIL-LPLA 187

Query: 308 VSTLLRSSVMPQDHLNYTLTWYSL--SAAVTFMAFKR 342
           ++ L   + +  DHL Y LTW+ L  S  V ++ ++R
Sbjct: 188 INHL---AAIRNDHLEYALTWFGLAISLIVIYVIYRR 221


>sp|A8Y2C9|SURF1_CAEBR SURF1-like protein OS=Caenorhabditis briggsae GN=sft-1 PE=3 SV=1
          Length = 317

 Score = 80.1 bits (196), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 69/294 (23%), Positives = 123/294 (41%), Gaps = 70/294 (23%)

Query: 68  STWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL--NITSPLTEDLK 125
           ST S  +L LP A +F LG WQI+R   K++++E+ ++RL  + + L  +++S     L+
Sbjct: 76  STGSILMLGLP-AFAFSLGVWQIYRLIWKLELIEHLKSRLSQEAIELPDDLSS---SSLE 131

Query: 126 SLEFRRVICQGVFDEQRSIYVGPRSR---------------SISGVTENGYYVITPLMPI 170
            LE+ RV   G F  Q+   + PR R               S + ++ +G ++ITP    
Sbjct: 132 PLEYCRVRVTGEFLHQKEFVISPRGRFDPAKKTSASVGSMLSENEMSSHGGHLITPF--- 188

Query: 171 PNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPN 230
               ++    +L+NRGW+P  + D  S    + +                          
Sbjct: 189 --RLKNTGKVILINRGWLPTFYFDPESHAKTNPQ-------------------------- 220

Query: 231 IVEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIE 290
                     +V +  +VR +E+   FV  N P    W+Y D+  +A   G     V+++
Sbjct: 221 ---------GTVILEAIVRKTEQRPQFVGQNVPEQGVWYYRDLEQMAKWHG--TEPVWLD 269

Query: 291 DTNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLR 344
              E   P  P     +++       +  +H+NY  TW++L+     M   + R
Sbjct: 270 AAYETTVPGGPIGGQTNIN-------VRNEHMNYLTTWFTLTLVTMLMWIHKFR 316


>sp|Q9U4F3|SURF1_DROME SURF1-like protein OS=Drosophila melanogaster GN=Surf1 PE=2 SV=1
          Length = 300

 Score = 79.3 bits (194), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 77/283 (27%), Positives = 120/283 (42%), Gaps = 65/283 (22%)

Query: 73  WLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRV 132
           W L L  A +FGLG WQ+ R+  K ++++    +L   P+ L     LT DL  +E+R V
Sbjct: 65  WFLLLIPATTFGLGCWQVKRKIWKEQLIKDLNKQLSTAPVAL--PDDLT-DLAQMEYRLV 121

Query: 133 ICQGVFDEQRSIYVGPRS-------RSISGV-----TENGYYVITPLMPIPNNPQSVKSP 180
             +G F   + + +GPRS        +  G+     + NGY ++TP      +       
Sbjct: 122 KIRGRFLHDKEMRLGPRSLIRPDGVETQGGLFSQRDSGNGYLIVTPFQLADRD-----DI 176

Query: 181 VLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIA 240
           VLVNRGW           VSR   +P       QQ                        A
Sbjct: 177 VLVNRGW-----------VSRKQVEPETRPLGQQQ------------------------A 201

Query: 241 SVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSN 300
            VE+  VVR  E    F P  D     + Y D+  +  A G     V+++   +    ++
Sbjct: 202 EVELTAVVRKGEARPQFTP--DHKGNVYLYRDLARMCAATG--AAPVFLDAVYDPQTAAH 257

Query: 301 PYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRL 343
             P+       LR+     DHL+Y +TW+SLSAA +F+ ++++
Sbjct: 258 A-PIGGQTRVTLRN-----DHLSYLVTWFSLSAATSFLWYRQI 294


>sp|Q9N5N8|SURF1_CAEEL SURF1-like protein OS=Caenorhabditis elegans GN=sft-1 PE=3 SV=1
          Length = 323

 Score = 72.4 bits (176), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 69/293 (23%), Positives = 115/293 (39%), Gaps = 70/293 (23%)

Query: 69  TWSKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL--NITSPLTEDLKS 126
           T S  +L +P   +F LG WQ FR + K+ ++E+ + RL      L  +++    E L+ 
Sbjct: 83  TGSVLMLTIP-VFAFSLGIWQTFRLKWKLDLIEHLKGRLNQTAQELPEDLSC---ESLEP 138

Query: 127 LEFRRVICQGVFDEQRSIYVGPRSRSISG---------------VTENGYYVITPLMPIP 171
           LE+ RV   G F  ++   + PR R   G               ++ +G ++ITP     
Sbjct: 139 LEYCRVTVTGEFLHEKEFIISPRGRFDPGKKTSAAAGSMLSENEMSSHGGHLITPF---- 194

Query: 172 NNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNI 231
              ++    +L+NRGW+P  + D  +    +    L L                      
Sbjct: 195 -RLKNSGKIILINRGWLPSFYFDPETRQKTNPRGTLTL---------------------- 231

Query: 232 VEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIED 291
                P+I        VR +EK   FV  N P    W+Y D+  +A   G     V ++ 
Sbjct: 232 -----PAI--------VRKTEKRPQFVGQNVPEQGVWYYRDLNQMAKHYG--TEPVLLDA 276

Query: 292 TNENVNPSNPYPLPKDVSTLLRSSVMPQDHLNYTLTWYSLSAAVTFMAFKRLR 344
             E   P  P     +++       +  +HLNY  TW++L+     M   + R
Sbjct: 277 AYETTVPGGPIGGQTNIN-------VRNEHLNYLTTWFTLTLVTMLMWIHKFR 322


>sp|Q4UN32|SURF1_RICFE SURF1-like protein OS=Rickettsia felis (strain ATCC VR-1525 /
           URRWXCal2) GN=RF_0175 PE=3 SV=1
          Length = 226

 Score = 71.2 bits (173), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 65/277 (23%), Positives = 114/277 (41%), Gaps = 63/277 (22%)

Query: 71  SKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSP---LTEDLKSL 127
           +  ++ +   I   LG WQ+ R ++K         +L +  ++ N+TSP   L E   SL
Sbjct: 3   TNLVVLITFTILISLGFWQLSRLKEK---------KLFLASMQANLTSPAINLAEIQDSL 53

Query: 128 EFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGW 187
            + +V   G F   + IY+  R    SG  ++GYY++TP   I +        +LV RGW
Sbjct: 54  PYHKVKITGQFLPNKDIYLYGRRSMSSG--KDGYYLVTPFKTIED------KVILVARGW 105

Query: 188 VPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGV 247
                +   ++ + D +                                      E++GV
Sbjct: 106 FSNRNKIIITQATNDRQH-------------------------------------EIIGV 128

Query: 248 VRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPYPLPKD 307
              SEK   ++PAND  +  W  +D+   +    L     YI    ++++  +   LP  
Sbjct: 129 TMPSEKTRSYLPANDIKNNVWLTLDLKEASQTLELNLEDFYIIAEGKDISNLDIL-LPLS 187

Query: 308 VSTLLRSSVMPQDHLNYTLTWYSL--SAAVTFMAFKR 342
           ++ L   + +  DHL Y LTW+ L  S  V ++ ++R
Sbjct: 188 INHL---AAIRNDHLEYALTWFGLAISLIVIYVIYRR 221


>sp|Q1RJM4|SURF1_RICBR SURF1-like protein OS=Rickettsia bellii (strain RML369-C)
           GN=RBE_0359 PE=3 SV=1
          Length = 241

 Score = 70.1 bits (170), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 72/278 (25%), Positives = 118/278 (42%), Gaps = 65/278 (23%)

Query: 71  SKWLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLK---SL 127
           +K  + +   I   LG WQ+ R ++K         +L +  ++ N+TSP  +  K   +L
Sbjct: 3   TKLTVLITFIILVLLGFWQLNRLKEK---------KLFLASMQENLTSPAIDLAKIQDNL 53

Query: 128 EFRRVICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGW 187
            + +V   G F   + IY+  R RS+S   ++GYY++TP              +LV RGW
Sbjct: 54  PYHKVKITGHFLPDKDIYLYGR-RSMSS-EKDGYYLVTPF------KTDEDKIILVARGW 105

Query: 188 VPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGV 247
              S R+K+      ++QP                                    E++GV
Sbjct: 106 F--SNRNKNIITQATNDQP-----------------------------------HELIGV 128

Query: 248 VRGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLP-ENTVYIEDTNENVNPSNPYPLPK 306
              SEK   ++PAND  +  W  +D+   +   GL  EN   IE++ +  N     PL  
Sbjct: 129 TMPSEKTRSYLPANDIKNNVWLTLDLQEASKVLGLNLENFYLIEESKDISNLDILLPLSI 188

Query: 307 DVSTLLRSSVMPQDHLNYTLTWYSLSAA--VTFMAFKR 342
           +    +R+     DHL Y  TW+ L+A+  V +  +KR
Sbjct: 189 NHLAAIRN-----DHLEYAFTWFGLAASLVVIYRIYKR 221


>sp|P53266|SHY1_YEAST Cytochrome oxidase assembly protein SHY1 OS=Saccharomyces
           cerevisiae (strain ATCC 204508 / S288c) GN=SHY1 PE=1
           SV=1
          Length = 389

 Score = 64.7 bits (156), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 61/117 (52%), Gaps = 14/117 (11%)

Query: 74  LLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRL--NITSPLTEDLKSLEFRR 131
           L+F    ISF LGTWQ+ R + K K++   + +L  +P+ L  + T  + ED    E+R+
Sbjct: 76  LMFAMPIISFYLGTWQVRRLKWKTKLIAACETKLTYEPIPLPKSFTPDMCED---WEYRK 132

Query: 132 VICQGVFDEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGWV 188
           VI  G F     ++VGPR ++     E GY++ TP +            VL+ RGW+
Sbjct: 133 VILTGHFLHNEEMFVGPRKKN----GEKGYFLFTPFI-----RDDTGEKVLIERGWI 180


>sp|Q9ZCJ8|SURF1_RICPR SURF1-like protein OS=Rickettsia prowazekii (strain Madrid E)
           GN=RP733 PE=3 SV=1
          Length = 244

 Score = 63.9 bits (154), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 64/264 (24%), Positives = 114/264 (43%), Gaps = 63/264 (23%)

Query: 73  WLLFLPGAISFGLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSP---LTEDLKSLEF 129
           +L+     I   LG WQ+ R ++K         +L +D ++ +I SP   L +  ++L +
Sbjct: 5   FLILTTFIILTSLGFWQLSRLKEK---------KLFLDSIQSHIISPGINLEKVQENLLY 55

Query: 130 RRVICQGVFDEQRSIYV-GPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLVNRGWV 188
            +V   G F   + IY+ G R   +  + ++GYY++TP   I +        +LV RGW 
Sbjct: 56  HKVKITGQFLPNKDIYLYGIR---LMAMEKDGYYLVTPFKTIAD------QVILVVRGWF 106

Query: 189 PRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSSWWWFWLKKPNIVEDDVPSIASVEVVGVV 248
             S R+K            N+      +Q                         E++GV+
Sbjct: 107 --SNRNK------------NIIMKATNNQIH-----------------------EIIGVI 129

Query: 249 RGSEKPSIFVPANDPSSCQWFYVDVPAIACACGLPENTVYIEDTNENVNPSNPYPLPKDV 308
             SEK   ++PAND  +  W  +D+   + A  L     YI    ++++  +   LP  +
Sbjct: 130 MPSEKTLSYLPANDIKNNVWLTLDLKEASKALKLNLENFYIIAEGKDISNLDIL-LPLSL 188

Query: 309 STLLRSSVMPQDHLNYTLTWYSLS 332
           + L   +++  DHL Y +TW+ L+
Sbjct: 189 NHL---ALIKNDHLEYAITWFGLA 209


>sp|Q9K809|SPPA_BACHD Putative signal peptide peptidase SppA OS=Bacillus halodurans
           (strain ATCC BAA-125 / DSM 18197 / FERM 7344 / JCM 9153
           / C-125) GN=sppA PE=3 SV=1
          Length = 331

 Score = 40.0 bits (92), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 40/165 (24%), Positives = 73/165 (44%), Gaps = 18/165 (10%)

Query: 31  APPRLYSSSAAAALSSAPQL-------SSSSQDQENVRKGSAPSSTWSKWLLFLPGAISF 83
           A   L+ +SAA +L S+P +       + +S  Q  V  G+    + +  +L L G I  
Sbjct: 12  AAAMLFVASAAISLVSSPAVDVDEWVGTGTSYKQTIVETGTDFGKSIA--ILELSGVIQD 69

Query: 84  -----GLGTWQIFRRQDKIKMLEYRQNRLQMDPLRLNITSPLTEDLKSLEFRRVICQGVF 138
                 L    ++  +D +K LE       +  + L + +P    L+S E  + + + V 
Sbjct: 70  TGSAPSLLNTGVYHHRDFLKQLEKAGEDPNIAGIILQVNTPGGGVLESAEIHKQVEEIVQ 129

Query: 139 DEQRSIYVGPRSRSISGVTENGYYVITPLMPIPNNPQSVKSPVLV 183
           D ++ +YV   + + SG    GYY+  P   I  +PQ++   + V
Sbjct: 130 DSEKPVYVSMGNMAASG----GYYISAPATKIYAHPQTITGSIGV 170


>sp|A0AVT1|UBA6_HUMAN Ubiquitin-like modifier-activating enzyme 6 OS=Homo sapiens GN=UBA6
           PE=1 SV=1
          Length = 1052

 Score = 33.1 bits (74), Expect = 3.4,   Method: Composition-based stats.
 Identities = 29/124 (23%), Positives = 49/124 (39%), Gaps = 20/124 (16%)

Query: 171 PNNPQSVKSPVLVNRGWVPRSWRDKSSEVSRDSEQPLNLAPSVQQSQQSS--------WW 222
           P  P  + + +L    +  +  R  +    +DSE+ L LA S+ ++ +           W
Sbjct: 319 PEAPLEIHTAMLALDQFQEKYSRKPNVGCQQDSEELLKLATSISETLEEKPDVNADIVHW 378

Query: 223 WFWLKKPNI--VEDDVPSIASVEVVGVVRGSEKPSIFVPANDPSSCQWFYVDVPAIACAC 280
             W  +  +  +   V  +AS EV+  V G   P           CQW Y++   I  + 
Sbjct: 379 LSWTAQGFLSPLAAAVGGVASQEVLKAVTGKFSPL----------CQWLYLEAADIVESL 428

Query: 281 GLPE 284
           G PE
Sbjct: 429 GKPE 432


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.316    0.130    0.396 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 130,951,986
Number of Sequences: 539616
Number of extensions: 5512442
Number of successful extensions: 15125
Number of sequences better than 100.0: 26
Number of HSP's better than 100.0 without gapping: 17
Number of HSP's successfully gapped in prelim test: 9
Number of HSP's that attempted gapping in prelim test: 15038
Number of HSP's gapped (non-prelim): 49
length of query: 350
length of database: 191,569,459
effective HSP length: 118
effective length of query: 232
effective length of database: 127,894,771
effective search space: 29671586872
effective search space used: 29671586872
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)