BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 025227
         (256 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q9C9G4|ENDO2_ARATH Endonuclease 2 OS=Arabidopsis thaliana GN=ENDO2 PE=2 SV=1
          Length = 290

 Score =  407 bits (1046), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 190/252 (75%), Positives = 219/252 (86%), Gaps = 2/252 (0%)

Query: 5   QSRLREAAADAVKQLLPESADNDLGSVCTWADHVKFHYHWSSALHFIDTPDNLCTYQYNR 64
           Q+RL E AA AVK+LLPESA+ DL S+C WAD VKF YHWSS LH+I+TPD  C+YQYNR
Sbjct: 41  QTRLDETAAKAVKELLPESAEGDLSSLCLWADRVKFRYHWSSPLHYINTPD-ACSYQYNR 99

Query: 65  DCKDEDGVKGRCVAGAINNYTTQLLSYNSASSSHSEYNLTEALLFLSHFIGDIHQPLHVG 124
           DCKDE G KGRCVAGAI NYTTQLLSY +A+SS S+YNLTEALLF+SHF+GDIHQPLHV 
Sbjct: 100 DCKDESGEKGRCVAGAIYNYTTQLLSYKTAASSQSQYNLTEALLFVSHFMGDIHQPLHVS 159

Query: 125 FTSDRGGNTIDVHWYTRKQVLHHVWDNNIIETAEERFYNSNIDGLVDAIQQNITTDWADL 184
           + SD+GGNTI+VHWYTRK  LHH+WD+NIIETAE   YNS ++G+VDA+++NITT+WAD 
Sbjct: 160 YASDKGGNTIEVHWYTRKANLHHIWDSNIIETAEADLYNSALEGMVDALKKNITTEWADQ 219

Query: 185 VKKWETCSANNTACPDVYASEGIKAACDWAYKGVSEGSVLEDEYFNSRLPIVKLRLAQGG 244
           VK+WETC+   TACPD+YASEGI+AACDWAYKGV+EG  LEDEYF SRLPIV  RLAQGG
Sbjct: 220 VKRWETCT-KKTACPDIYASEGIQAACDWAYKGVTEGDTLEDEYFYSRLPIVYQRLAQGG 278

Query: 245 VRLAATLNRIFG 256
           VRLAATLNRIFG
Sbjct: 279 VRLAATLNRIFG 290


>sp|F4JJL0|ENDO4_ARATH Endonuclease 4 OS=Arabidopsis thaliana GN=ENDO4 PE=3 SV=1
          Length = 299

 Score =  312 bits (800), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 147/253 (58%), Positives = 183/253 (72%), Gaps = 2/253 (0%)

Query: 5   QSRLREAAADAVKQLLPESADNDLGSVCTWADHVKFHYHW--SSALHFIDTPDNLCTYQY 62
           +S   E    AVK+LLP+SAD DL SVC+W D +K H+ W  +S LH++DTPD  C Y+Y
Sbjct: 38  ESYFEEETVAAVKKLLPKSADGDLASVCSWPDEIKHHWQWRWTSPLHYVDTPDYRCNYEY 97

Query: 63  NRDCKDEDGVKGRCVAGAINNYTTQLLSYNSASSSHSEYNLTEALLFLSHFIGDIHQPLH 122
            RDC D    + RCV GAI NYT QL+S +  S +   YNLTEAL+FLSHFIGDIHQPLH
Sbjct: 98  CRDCHDTHKNQDRCVTGAIFNYTMQLMSASENSDTIVHYNLTEALMFLSHFIGDIHQPLH 157

Query: 123 VGFTSDRGGNTIDVHWYTRKQVLHHVWDNNIIETAEERFYNSNIDGLVDAIQQNITTDWA 182
           VGF  D GGNTI V WY RK  LHHVWDN IIE+A + +YN ++  +++A+Q N+T DW+
Sbjct: 158 VGFLGDEGGNTITVRWYRRKTNLHHVWDNMIIESALKTYYNKSLPLMIEALQANLTNDWS 217

Query: 183 DLVKKWETCSANNTACPDVYASEGIKAACDWAYKGVSEGSVLEDEYFNSRLPIVKLRLAQ 242
           + V  WE+C  N TACP+ YASE I  AC +AY+  + G+ L D+YF SRLPIV+ RLAQ
Sbjct: 218 NDVPLWESCQLNQTACPNPYASESINLACKYAYRNATPGTTLGDDYFLSRLPIVEKRLAQ 277

Query: 243 GGVRLAATLNRIF 255
           GG+RLAATLNRIF
Sbjct: 278 GGIRLAATLNRIF 290


>sp|Q9SXA6|ENDO1_ARATH Endonuclease 1 OS=Arabidopsis thaliana GN=ENDO1 PE=1 SV=1
          Length = 305

 Score =  283 bits (725), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 136/254 (53%), Positives = 172/254 (67%), Gaps = 5/254 (1%)

Query: 5   QSRLREAAADAVKQLLPESADNDLGSVCTWADHVK--FHYHWSSALHFIDTPDNLCTYQY 62
           Q+ L    A  V+ LLP+    DL ++C W D ++  + Y W+S LH+IDTPD  C+Y+Y
Sbjct: 42  QNLLEAGPAHVVENLLPDYVKGDLSALCVWPDQIRHWYKYRWTSHLHYIDTPDQACSYEY 101

Query: 63  NRDCKDEDGVKGRCVAGAINNYTTQLLSYNSASSSHSEYNLTEALLFLSHFIGDIHQPLH 122
           +RDC D+ G+K  CV GAI N+T+QL  Y   +S    YN+TEALLFLSHF+GDIHQP+H
Sbjct: 102 SRDCHDQHGLKDMCVDGAIQNFTSQLQHYGEGTSD-RRYNMTEALLFLSHFMGDIHQPMH 160

Query: 123 VGFTSDRGGNTIDVHWYTRKQVLHHVWDNNIIETAEERFYNSNIDGLVDAIQQNITTD-W 181
           VGFTSD GGNTID+ WY  K  LHHVWD  II TA +  Y+ N+D L + +++NIT   W
Sbjct: 161 VGFTSDEGGNTIDLRWYKHKSNLHHVWDREIILTALKENYDKNLDLLQEDLEKNITNGLW 220

Query: 182 ADLVKKWETCSANNTACPDVYASEGIKAACDWAYKGVSEGSVLEDEYFNSRLPIVKLRLA 241
            D +  W  C+ +  ACP  YASE IK AC W YKGV  G  L +EYFN+RLPIV  R+ 
Sbjct: 221 HDDLSSWTECN-DLIACPHKYASESIKLACKWGYKGVKSGETLSEEYFNTRLPIVMKRIV 279

Query: 242 QGGVRLAATLNRIF 255
           QGGVRLA  LNR+F
Sbjct: 280 QGGVRLAMILNRVF 293


>sp|Q8LDW6|ENDO3_ARATH Endonuclease 3 OS=Arabidopsis thaliana GN=ENDO3 PE=2 SV=1
          Length = 294

 Score =  282 bits (721), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 136/253 (53%), Positives = 173/253 (68%), Gaps = 7/253 (2%)

Query: 5   QSRLREAAADAVKQLLPESADNDLGSVCTWADHVK--FHYHWSSALHFIDTPDNLCTYQY 62
           QS   E    AVK+LLPESA+ +L +VC+W D +K    + W+SALHF DTPD  C Y+Y
Sbjct: 38  QSYFEEDTVVAVKKLLPESANGELAAVCSWPDEIKKLPQWRWTSALHFADTPDYKCNYEY 97

Query: 63  NRDCKDEDGVKGRCVAGAINNYTTQLLSYNSASSSHSEYNLTEALLFLSHFIGDIHQPLH 122
           +RDC      K  CV GAI NYT QL+S +  S S   YNLTEAL+FLSH++GDIHQPLH
Sbjct: 98  SRDCP-----KDWCVTGAIFNYTNQLMSTSENSQSIVHYNLTEALMFLSHYMGDIHQPLH 152

Query: 123 VGFTSDRGGNTIDVHWYTRKQVLHHVWDNNIIETAEERFYNSNIDGLVDAIQQNITTDWA 182
            GF  D GGN I VHWY ++  LH VWD+ IIE+A E +YNS++  ++  +Q  +   W+
Sbjct: 153 EGFIGDLGGNKIKVHWYNQETNLHRVWDDMIIESALETYYNSSLPRMIHELQAKLKNGWS 212

Query: 183 DLVKKWETCSANNTACPDVYASEGIKAACDWAYKGVSEGSVLEDEYFNSRLPIVKLRLAQ 242
           + V  WE+C  N TACP+ YASE I  AC +AY+  + G+ L D YF SRLP+V+ RLAQ
Sbjct: 213 NDVPSWESCQLNQTACPNPYASESIDLACKYAYRNATAGTTLGDYYFVSRLPVVEKRLAQ 272

Query: 243 GGVRLAATLNRIF 255
           GG+RLA TLNRIF
Sbjct: 273 GGIRLAGTLNRIF 285


>sp|F4JJL3|ENDO5_ARATH Endonuclease 5 OS=Arabidopsis thaliana GN=ENDO5 PE=2 SV=1
          Length = 296

 Score =  272 bits (695), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 130/244 (53%), Positives = 170/244 (69%), Gaps = 3/244 (1%)

Query: 15  AVKQLLPESADND-LGSVCTWADHVK--FHYHWSSALHFIDTPDNLCTYQYNRDCKDEDG 71
           AVK+LLPES D   L   C+W D +K    + W+S LH+++TP+  C Y+Y RDC D   
Sbjct: 44  AVKKLLPESVDGGGLADFCSWPDEIKKLSQWQWTSTLHYVNTPEYRCNYEYCRDCHDTHK 103

Query: 72  VKGRCVAGAINNYTTQLLSYNSASSSHSEYNLTEALLFLSHFIGDIHQPLHVGFTSDRGG 131
            K  CV GAI NYT QL+S +  S +   YNLTEALLFLSH++GD+HQPLH GF  D GG
Sbjct: 104 HKDWCVTGAIFNYTNQLMSASENSQNIVHYNLTEALLFLSHYMGDVHQPLHTGFLGDLGG 163

Query: 132 NTIDVHWYTRKQVLHHVWDNNIIETAEERFYNSNIDGLVDAIQQNITTDWADLVKKWETC 191
           NTI V+WY  K  LHHVWDN II++A E +YNS++  ++ A+Q  +   W++ V  W++C
Sbjct: 164 NTIIVNWYHNKSNLHHVWDNMIIDSALETYYNSSLPHMIQALQAKLKNGWSNDVPSWKSC 223

Query: 192 SANNTACPDVYASEGIKAACDWAYKGVSEGSVLEDEYFNSRLPIVKLRLAQGGVRLAATL 251
             +  ACP++YASE I  AC +AY+  + G+ L DEYF SRLP+V+ RLAQGG+RLAATL
Sbjct: 224 HFHQKACPNLYASESIDLACKYAYRNATPGTTLGDEYFLSRLPVVEKRLAQGGIRLAATL 283

Query: 252 NRIF 255
           NRIF
Sbjct: 284 NRIF 287


>sp|P24021|NUS1_ASPOR Nuclease S1 OS=Aspergillus oryzae (strain ATCC 42149 / RIB 40)
           GN=nucS PE=1 SV=2
          Length = 287

 Score = 95.5 bits (236), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 86/271 (31%), Positives = 120/271 (44%), Gaps = 43/271 (15%)

Query: 5   QSRLREAAADAVKQLLPESADNDLGSVCTWADHVKF--HYHWSSALHFIDTPDNL---CT 59
           QS +  +     + +L + + + L +V TWAD  K+     +S   HFID  DN    C 
Sbjct: 34  QSFVASSTESFCQNILGDDSTSYLANVATWADTYKYTDAGEFSKPYHFIDAQDNPPQSCG 93

Query: 60  YQYNRDCKDEDGVKGRCVAGAINNYTTQLLSYNSASSSHSEYNLTEALLFLSHFIGDIHQ 119
             Y+RDC    G  G C   AI NYT  LL   + S +        AL F+ H IGDIHQ
Sbjct: 94  VDYDRDC----GSAG-CSISAIQNYTNILLESPNGSEA------LNALKFVVHIIGDIHQ 142

Query: 120 PLHVGFTSDRGGNTIDVHWYTRKQVLHHVWDNNIIETAEERFYNSNIDGLVDAIQQNITT 179
           PLH     + GGN IDV +      LHH+WD N+ E A   +  S      D + + I T
Sbjct: 143 PLH-DENLEAGGNGIDVTYDGETTNLHHIWDTNMPEEAAGGYSLSVAKTYADLLTERIKT 201

Query: 180 --------DWADLVKKWETCS--------ANNTACPDVYASEGIKAACDWAYKGVSEGSV 223
                    W D +   +  S        AN   C  V   +G+      AY   ++   
Sbjct: 202 GTYSSKKDSWTDGIDIKDPVSTSMIWAADANTYVCSTVL-DDGL------AYINSTD--- 251

Query: 224 LEDEYFNSRLPIVKLRLAQGGVRLAATLNRI 254
           L  EY++   P+ +  +A+ G RLAA L+ I
Sbjct: 252 LSGEYYDKSQPVFEELIAKAGYRLAAWLDLI 282


>sp|P24504|NUP3_PENSQ Nuclease PA3 OS=Penicillium sp. PE=1 SV=1
          Length = 270

 Score = 82.4 bits (202), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 80/250 (32%), Positives = 108/250 (43%), Gaps = 54/250 (21%)

Query: 34  WADHVKFHY--HWSSALHFIDTPDNL---CTYQYNRDCKDEDGVKGRCVAGAINNYTTQL 88
           WAD  +      WS++LHFID  DN    C   Y RDC    G  G C   AI NYT ++
Sbjct: 43  WADEYRLTSAGKWSASLHFIDAEDNPPTNCNVDYERDC----GSSG-CSISAIANYTQRV 97

Query: 89  LSYNSASSSHSEYNLTEALLFLSHFIGDIHQPLHVGFTSDRGGNTIDVHWYTRKQVLHHV 148
                + SS S  N  EAL FL HFIGD+ QPLH    +  GGN I+V +      LH  
Sbjct: 98  -----SDSSLSSENHAEALRFLVHFIGDMTQPLHDEAYA-VGGNKINVTFDGYHDNLHSD 151

Query: 149 WD--------------------NNIIETAEERFYNSNIDGLV--DAIQQNITTDWADLVK 186
           WD                      +++  E   Y +   G +  D I + ITT       
Sbjct: 152 WDTYMPQKLIGGHALSDAESWAKTLVQNIESGNYTAQATGWIKGDNISEPITT-----AT 206

Query: 187 KWETCSANNTACPDVYASEGIKAACDWAYKGVSEGSVLEDEYFNSRLPIVKLRLAQGGVR 246
           +W +  AN   C  V    G  A          +   L   Y++S +  ++L++A+GG R
Sbjct: 207 RWAS-DANALVCT-VVMPHGAAAL---------QTGDLYPTYYDSVIDTIELQIAKGGYR 255

Query: 247 LAATLNRIFG 256
           LA  +N I G
Sbjct: 256 LANWINEIHG 265


>sp|P24289|NUP1_PENCI Nuclease P1 OS=Penicillium citrinum PE=1 SV=1
          Length = 270

 Score = 81.3 bits (199), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 80/250 (32%), Positives = 108/250 (43%), Gaps = 54/250 (21%)

Query: 34  WADHVKFHY--HWSSALHFIDTPDNL---CTYQYNRDCKDEDGVKGRCVAGAINNYTTQL 88
           WAD  +      WS++LHFID  DN    C   Y RDC    G  G C   AI NYT ++
Sbjct: 43  WADEYRLTSAGKWSASLHFIDAEDNPPTNCNVDYERDC----GSSG-CSISAIANYTQRV 97

Query: 89  LSYNSASSSHSEYNLTEALLFLSHFIGDIHQPLHVGFTSDRGGNTIDVHWYTRKQVLHHV 148
                + SS S  N  EAL FL HFIGD+ QPLH    +  GGN I+V +      LH  
Sbjct: 98  -----SDSSLSSENHAEALRFLVHFIGDMTQPLHDEAYA-VGGNKINVTFDGYHDNLHSD 151

Query: 149 WD--------------------NNIIETAEERFYNSNIDGLV--DAIQQNITTDWADLVK 186
           WD                      +++  E   Y +   G +  D I + ITT       
Sbjct: 152 WDTYMPQKLIGGHALSDAESWAKTLVQNIESGNYTAQAIGWIKGDNISEPITT-----AT 206

Query: 187 KWETCSANNTACPDVYASEGIKAACDWAYKGVSEGSVLEDEYFNSRLPIVKLRLAQGGVR 246
           +W +  AN   C  V    G  A          +   L   Y++S +  ++L++A+GG R
Sbjct: 207 RWAS-DANALVCT-VVMPHGAAAL---------QTGDLYPTYYDSVIDTIELQIAKGGYR 255

Query: 247 LAATLNRIFG 256
           LA  +N I G
Sbjct: 256 LANWINEIHG 265


>sp|P33376|PHLD_BACCE Phospholipase C OS=Bacillus cereus GN=cerA PE=3 SV=1
          Length = 283

 Score = 33.1 bits (74), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 37/80 (46%), Gaps = 22/80 (27%)

Query: 47  ALHFIDTPDNLCTY-QYNRDCKDEDGVKGRCVAGAINNYTTQLLSYNSASSSHSEYNLTE 105
           A HF D PDN  TY  Y +  K E G K   +AG                 S+   ++ +
Sbjct: 105 ASHFYD-PDNGKTYIPYAKQAK-ETGAKYFKLAG----------------ESYKNKDMKQ 146

Query: 106 ALLFLS---HFIGDIHQPLH 122
           A  +L    H++GD++QP+H
Sbjct: 147 AFFYLGLSLHYLGDVNQPMH 166


>sp|C5DGT3|BLI1_LACTC Biogenesis of lysosome-related organelles complex 1 subunit BLI1
           OS=Lachancea thermotolerans (strain ATCC 56472 / CBS
           6340 / NRRL Y-8284) GN=BLI1 PE=3 SV=1
          Length = 113

 Score = 32.3 bits (72), Expect = 3.2,   Method: Compositional matrix adjust.
 Identities = 18/70 (25%), Positives = 35/70 (50%), Gaps = 5/70 (7%)

Query: 159 ERFYNSNIDGLVDAIQQNITTDWADLVKKWETCSANNTACPDVYASEGIKAACDWAYKGV 218
           ER    +I+G +D +Q+ +  + A  +  +E+ + ++  C D     G+KA  +     +
Sbjct: 9   ERVLRQHIEGCIDQVQEYVDLESAQAISSFESKAVDHFRCLD-----GLKAQYNTREDKI 63

Query: 219 SEGSVLEDEY 228
           SE   L+ EY
Sbjct: 64  SEFKRLQSEY 73


>sp|O75452|RDH16_HUMAN Retinol dehydrogenase 16 OS=Homo sapiens GN=RDH16 PE=1 SV=2
          Length = 317

 Score = 30.8 bits (68), Expect = 9.8,   Method: Compositional matrix adjust.
 Identities = 21/84 (25%), Positives = 37/84 (44%), Gaps = 6/84 (7%)

Query: 136 VHWYTRKQVLHHVWDNNIIETAEERFYNSNIDGLVDA-----IQQNITTDWADLVKKWET 190
           +HWY  +QVL H+ D  +  T  +  +   +   +DA     +   +T   A+ ++  +T
Sbjct: 15  LHWYRERQVLSHLRDKYVFITGCDSGFGKLLARQLDARGLRVLAACLTEKGAEQLRG-QT 73

Query: 191 CSANNTACPDVYASEGIKAACDWA 214
                T   DV  +E + AA  W 
Sbjct: 74  SDRLETVTLDVTKTESVAAAAQWV 97


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.319    0.133    0.422 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 97,246,359
Number of Sequences: 539616
Number of extensions: 4017687
Number of successful extensions: 9075
Number of sequences better than 100.0: 19
Number of HSP's better than 100.0 without gapping: 9
Number of HSP's successfully gapped in prelim test: 10
Number of HSP's that attempted gapping in prelim test: 9044
Number of HSP's gapped (non-prelim): 19
length of query: 256
length of database: 191,569,459
effective HSP length: 115
effective length of query: 141
effective length of database: 129,513,619
effective search space: 18261420279
effective search space used: 18261420279
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (27.7 bits)