BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 040581
(394 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P13917|7SB1_SOYBN Basic 7S globulin OS=Glycine max GN=BG PE=1 SV=2
Length = 427
Score = 193 bits (491), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 143/417 (34%), Positives = 204/417 (48%), Gaps = 46/417 (11%)
Query: 5 SSQFLLFLCFIFLVNASVNSVAIPVVKDVSTLQYVAKIHHGVSQVPINLVLDLGGPLTWV 64
S FL FL +N V +PV D ST + A + + + +++DL G WV
Sbjct: 14 SCSFLFFLSDSVTPTKPINLVVLPVQNDGSTGLHWANLQKRTPLMQVPVLVDLNGNHLWV 73
Query: 65 DCDSSADVSSSSSRRLIPSQSIQCSRSGK----SPVPGNGSDTTTNTCGVFTQNGISGLV 120
+C+ SS + + S QCSR+ S + NTCG+ + N I+
Sbjct: 74 NCEQQY---SSKTYQAPFCHSTQCSRANTHQCLSCPAASRPGCHKNTCGLMSTNPITQQT 130
Query: 121 TTGDLAEDTIAVR----SELEDPSITAVDQFLFSCAPTFLLQ-GLARGARGMLGLGRAPI 175
G+L ED +A+ S + + V QFLFSCAP+FL+Q GL R +G+ GLG API
Sbjct: 131 GLGELGEDVLAIHATQGSTQQLGPLVTVPQFLFSCAPSFLVQKGLPRNTQGVAGLGHAPI 190
Query: 176 SLPSQLATGIGHQRKFFMCLS---SSNGVVL---------SHHTSTTKLPLMYTPL-IGK 222
SLP+QLA+ G QR+F CLS +S G ++ L +TPL I
Sbjct: 191 SLPNQLASHFGLQRQFTTCLSRYPTSKGAIIFGDAPNNMRQFQNQDIFHDLAFTPLTITL 250
Query: 223 SQDYFINVKSIKING------NPLSVTI----EGLTKLSTIVPYATMESSIYATFAKAFT 272
+Y + V SI+IN N +S TI G T +ST P+ ++ S+Y F + F
Sbjct: 251 QGEYNVRVNSIRINQHSVFPLNKISSTIVGSTSGGTMISTSTPHMVLQQSVYQAFTQVFA 310
Query: 273 KAAAAASRDMSVVAPVAPFSLCFSSKGFNGSAVPVIDFVL-QSEMVKWRFYGSNSMVKVN 331
+ ++ V VAPF LCF+S N A P +D V+ + WR G + MV+
Sbjct: 311 QQLPKQAQ----VKSVAPFGLCFNSNKIN--AYPSVDLVMDKPNGPVWRISGEDLMVQAQ 364
Query: 332 EEVVCLGFLDGGSDLTSSIVLGGFQLEDNVMDFDLGTSMLGFST----LRGTCCSDF 384
V CLG ++GG + I LG QLE+N++ FDL S +GFST G C+D
Sbjct: 365 PGVTCLGVMNGGMQPRAEITLGARQLEENLVVFDLARSRVGFSTSSLHSHGVKCADL 421
>sp|Q8RVH5|7SBG2_SOYBN Basic 7S globulin 2 OS=Glycine max PE=1 SV=1
Length = 433
Score = 189 bits (479), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 139/403 (34%), Positives = 198/403 (49%), Gaps = 53/403 (13%)
Query: 22 VNSVAIPVVKDVSTLQYVAKIHHGVSQVPINLVLDLGGPLTWVDCDSSADVSSSSSRRLI 81
+N + +PV D ST + A + + + +++DL G WV+C+ SS + +
Sbjct: 38 INLLVLPVQNDASTGLHWANLQKRTPLMQVPVLVDLNGNHLWVNCEQHY---SSKTYQAP 94
Query: 82 PSQSIQCSRSGKSPV--------PGNGSDTTTNTCGVFTQNGISGLVTTGDLAEDTIAVR 133
S QCSR+ PG NTCG+ + N I+ G+L +D +A+
Sbjct: 95 FCHSTQCSRANTHQCLSCPAASRPG----CHKNTCGLMSTNPITQQTGLGELGQDVLAIH 150
Query: 134 ----SELEDPSITAVDQFLFSCAPTFLLQ-GLARGARGMLGLGRAPISLPSQLATGIGHQ 188
S + + V QFLFSCAP+FLLQ GL R +G+ GLG APISLP+QLA+ G Q
Sbjct: 151 ATQGSTQQLGPLVTVPQFLFSCAPSFLLQKGLPRNIQGVAGLGHAPISLPNQLASHFGLQ 210
Query: 189 RKFFMCLS---SSNGVVL---------SHHTSTTKLPLMYTPLIGKSQ-DYFINVKSIKI 235
+F CLS +S G ++ H L +TPL Q +Y + V SI+I
Sbjct: 211 HQFTTCLSRYPTSKGALIFGDAPNNMQQFHNQDIFHDLAFTPLTVTPQGEYNVRVSSIRI 270
Query: 236 NG------NPLSVTI---EGLTKLSTIVPYATMESSIYATFAKAFTKAAAAASRDMSVVA 286
N N +S TI G T +ST P+ ++ S+Y +AFT+ A + V
Sbjct: 271 NQHSVFPPNKISSTIVGSSGGTMISTSTPHMVLQQSLY----QAFTQVFAQQLEKQAQVK 326
Query: 287 PVAPFSLCFSSKGFNGSAVPVIDFVL-QSEMVKWRFYGSNSMVKVNEEVVCLGFLDGGSD 345
VAPF LCF+S N A P +D V+ + WR G + MV+ V CLG ++GG
Sbjct: 327 SVAPFGLCFNSNKIN--AYPSVDLVMDKPNGPVWRISGEDLMVQAQPGVTCLGVMNGGMQ 384
Query: 346 LTSSIVLGGFQLEDNVMDFDLGTSMLGFST----LRGTCCSDF 384
+ + LG QLE+ +M FDL S +GFST G C D
Sbjct: 385 PRAEVTLGTRQLEEKLMVFDLARSRVGFSTSSLHSHGVKCGDL 427
>sp|Q766C2|NEP2_NEPGR Aspartic proteinase nepenthesin-2 OS=Nepenthes gracilis GN=nep2
PE=1 SV=1
Length = 438
Score = 55.5 bits (132), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 85/360 (23%), Positives = 141/360 (39%), Gaps = 61/360 (16%)
Query: 37 QYVAKIHHGVSQVPINLVLDLGGPLTWVDCDSSADVSS----------SSSRRLIPSQSI 86
+Y+ + G + ++D G L W C+ S SSS +P +S
Sbjct: 95 EYLMNVAIGTPDSSFSAIMDTGSDLIWTQCEPCTQCFSQPTPIFNPQDSSSFSTLPCESQ 154
Query: 87 QCSRSGKSPVPGNGSDTTTNTCGVFTQNGISGLVTTGDLAEDTIAVRSELEDPSITAVDQ 146
C +P S+T N +T G T G +A +T + ++V
Sbjct: 155 YCQD-----LP---SETCNNNECQYTYGYGDGSTTQGYMATETFTFET-------SSVPN 199
Query: 147 FLFSCAPTFLLQGLARG-ARGMLGLGRAPISLPSQLATGIGHQRKFFMCLS---SSNGVV 202
F C QG +G G++G+G P+SLPSQL G +F C++ SS+
Sbjct: 200 IAFGCGEDN--QGFGQGNGAGLIGMGWGPLSLPSQLGVG-----QFSYCMTSYGSSSPST 252
Query: 203 LSHHTSTTKLPL--MYTPLIGKSQD---YFINVKSIKINGNPLSV-----------TIEG 246
L+ ++ + +P T LI S + Y+I ++ I + G+ L + T
Sbjct: 253 LALGSAASGVPEGSPSTTLIHSSLNPTYYYITLQGITVGGDNLGIPSSTFQLQDDGTGGM 312
Query: 247 LTKLSTIVPYATMESSIYATFAKAFTKAAAAASRDMSVVAPVAPFSLCFSSKGFNGSAVP 306
+ T + Y ++ Y A+AFT + D S S CF +GS V
Sbjct: 313 IIDSGTTLTYLPQDA--YNAVAQAFTDQINLPTVDESSSG----LSTCFQQPS-DGSTVQ 365
Query: 307 VIDFVLQSEMVKWRFYGSNSMVKVNEEVVCLGFLDGGSDLTSSIVLGGFQLEDNVMDFDL 366
V + +Q + N ++ E V+CL G S + G Q ++ + +DL
Sbjct: 366 VPEISMQFDGGVLNLGEQNILISPAEGVICLAM--GSSSQLGISIFGNIQQQETQVLYDL 423
>sp|Q6XBF8|CDR1_ARATH Aspartic proteinase CDR1 OS=Arabidopsis thaliana GN=CDR1 PE=1 SV=1
Length = 437
Score = 51.6 bits (122), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/329 (25%), Positives = 132/329 (40%), Gaps = 44/329 (13%)
Query: 37 QYVAKIHHGVSQVPINLVLDLGGPLTWVDCDSSADVSSSSSRRLIPSQS-----IQCSRS 91
+Y+ + G PI + D G L W C D + P S + CS S
Sbjct: 89 EYLMNVSIGTPPFPIMAIADTGSDLLWTQCAPCDDCYTQVDPLFDPKTSSTYKDVSCSSS 148
Query: 92 GKSPVPGNGS-DTTTNTCGVFTQNGISGLVTTGDLAEDTIAVRSELEDPSITAVDQFLFS 150
+ + S T NTC G + T G++A DT+ + S P + +
Sbjct: 149 QCTALENQASCSTNDNTCSYSLSYGDNSY-TKGNIAVDTLTLGSSDTRP--MQLKNIIIG 205
Query: 151 C----APTFLLQGLARGARGMLGLGRAPISLPSQLATGIGHQRKFFMCLSSSNGVVLSHH 206
C A TF + G++GLG P+SL QL I KF CL + S
Sbjct: 206 CGHNNAGTF-----NKKGSGIVGLGGGPVSLIKQLGDSI--DGKFSYCLVP----LTSKK 254
Query: 207 TSTTKLP-----------LMYTPLIGK-SQD--YFINVKSIKINGNPLSV--TIEGLTKL 250
T+K+ ++ TPLI K SQ+ Y++ +KSI + + + ++
Sbjct: 255 DQTSKINFGTNAIVSGSGVVSTPLIAKASQETFYYLTLKSISVGSKQIQYSGSDSESSEG 314
Query: 251 STIVPYATMESSIYATFAKAFTKAAAAASRDMSVVAPVAPFSLCFSSKGFNGSAVPVIDF 310
+ I+ T + + F A A++ P + SLC+S+ G VPVI
Sbjct: 315 NIIIDSGTTLTLLPTEFYSELEDAVASSIDAEKKQDPQSGLSLCYSATG--DLKVPVITM 372
Query: 311 VLQSEMVKWRFYGSNSMVKVNEEVVCLGF 339
VK SN+ V+V+E++VC F
Sbjct: 373 HFDGADVKLD--SSNAFVQVSEDLVCFAF 399
>sp|Q766C3|NEP1_NEPGR Aspartic proteinase nepenthesin-1 OS=Nepenthes gracilis GN=nep1
PE=1 SV=1
Length = 437
Score = 49.7 bits (117), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 78/361 (21%), Positives = 144/361 (39%), Gaps = 41/361 (11%)
Query: 37 QYVAKIHHGVSQVPINLVLDLGGPLTWVDCDSSADVSSSSSRRLIPSQS-----IQCSRS 91
+Y+ + G P + ++D G L W C + S+ P S + CS
Sbjct: 94 EYLMNLSIGTPAQPFSAIMDTGSDLIWTQCQPCTQCFNQSTPIFNPQGSSSFSTLPCSSQ 153
Query: 92 GKSPVPGNGSDTTTNTCGVFTQNGISGLVTTGDLAEDTIAVRSELEDPSITAVDQFLFSC 151
+ S T +N +T G T G + +T+ S + P+IT F C
Sbjct: 154 LCQAL---SSPTCSNNFCQYTYGYGDGSETQGSMGTETLTFGS-VSIPNIT------FGC 203
Query: 152 APTFLLQGLARG-ARGMLGLGRAPISLPSQLATGIGHQRKFFMCL-----SSSNGVVLSH 205
QG +G G++G+GR P+SLPSQL KF C+ S+ + ++L
Sbjct: 204 GENN--QGFGQGNGAGLVGMGRGPLSLPSQLDV-----TKFSYCMTPIGSSTPSNLLLGS 256
Query: 206 HTSTTKLPLMYTPLIGKSQD---YFI-----NVKSIKINGNPLSVTIEGLTKLSTIVPYA 257
++ T LI SQ Y+I +V S ++ +P + + I+ +
Sbjct: 257 LANSVTAGSPNTTLIQSSQIPTFYYITLNGLSVGSTRLPIDPSAFALNSNNGTGGIIIDS 316
Query: 258 TMESSIYATFAKAFTKAAAAASRDMSVV-APVAPFSLCFSSKGFNGSAVPVIDFVLQSEM 316
+ + A + + ++ VV + F LCF + + S + + FV+ +
Sbjct: 317 GTTLTYFVNNAYQSVRQEFISQINLPVVNGSSSGFDLCFQTPS-DPSNLQIPTFVMHFDG 375
Query: 317 VKWRFYGSNSMVKVNEEVVCLGFLDGGSDLTSSIVLGGFQLEDNVMDFDLGTSMLGFSTL 376
N + + ++CL GS + G Q ++ ++ +D G S++ F++
Sbjct: 376 GDLELPSENYFISPSNGLICLAM---GSSSQGMSIFGNIQQQNMLVVYDTGNSVVSFASA 432
Query: 377 R 377
+
Sbjct: 433 Q 433
>sp|Q9S9K4|ASPL2_ARATH Aspartic proteinase-like protein 2 OS=Arabidopsis thaliana
GN=At1g65240 PE=1 SV=2
Length = 475
Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 79/369 (21%), Positives = 141/369 (38%), Gaps = 52/369 (14%)
Query: 38 YVAKIHHGVSQVPINLVLDLGGPLTWVDCD---------------SSADVSSSSSRRLIP 82
Y KI G ++ +D G + W++C S D+++SS+ + +
Sbjct: 74 YFTKIKLGSPPKEYHVQVDTGSDILWINCKPCPKCPTKTNLNFRLSLFDMNASSTSKKVG 133
Query: 83 SQSIQCS--RSGKSPVPGNG-------SDTTTNTCGVFTQNGISGLVTTGDLAEDTIAVR 133
CS S P G +D +T+ G F ++ ++ TGDL +
Sbjct: 134 CDDDFCSFISQSDSCQPALGCSYHIVYADESTSD-GKFIRDMLTLEQVTGDLKTGPLG-- 190
Query: 134 SELEDPSITAVDQFLFSCAPTFLLQGLARG---ARGMLGLGRAPISLPSQLATGIGHQRK 190
+ +F C Q L G G++G G++ S+ SQLA +R
Sbjct: 191 -----------QEVVFGCGSDQSGQ-LGNGDSAVDGVMGFGQSNTSVLSQLAATGDAKRV 238
Query: 191 FFMCLSSSNGVVLSHHTSTTKLPLMYTPLIGKSQDYFINVKSIKINGNPLSVTIEGLTKL 250
F CL + G + + TP++ Y + + + ++G L + +
Sbjct: 239 FSHCLDNVKGGGIFAVGVVDSPKVKTTPMVPNQMHYNVMLMGMDVDGTSLDLPRSIVRNG 298
Query: 251 STIVPYATMESSIYATFAKAFTKA--AAAASRDMSVVAPVAPFSLCFSSKGFNGSAVPVI 308
TIV T A F K + +R + V CFS A P +
Sbjct: 299 GTIVDSGT----TLAYFPKVLYDSLIETILARQPVKLHIVEETFQCFSFSTNVDEAFPPV 354
Query: 309 DFVLQSEMVKWRFYGSNSMVKVNEEVVCLGFLDGG---SDLTSSIVLGGFQLEDNVMDFD 365
F + + VK Y + + + EE+ C G+ GG + + I+LG L + ++ +D
Sbjct: 355 SFEFE-DSVKLTVYPHDYLFTLEEELYCFGWQAGGLTTDERSEVILLGDLVLSNKLVVYD 413
Query: 366 LGTSMLGFS 374
L ++G++
Sbjct: 414 LDNEVIGWA 422
>sp|Q3EBM5|ASPR1_ARATH Probable aspartic protease At2g35615 OS=Arabidopsis thaliana
GN=At2g35615 PE=3 SV=1
Length = 447
Score = 48.1 bits (113), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 91/385 (23%), Positives = 151/385 (39%), Gaps = 72/385 (18%)
Query: 37 QYVAKIHHGVSQVPINLVLDLGGPLTWVDCDSSA----------DVSSSSSRRLIPSQSI 86
++ I G + + + D G LTWV C D SS+ + P S
Sbjct: 84 EFFMSITIGTPPIKVFAIADTGSDLTWVQCKPCQQCYKENGPIFDKKKSSTYKSEPCDSR 143
Query: 87 QCSRSGKSPVPGNGSDTTTNTCGVFTQNGISGLVTTGDLAEDTIAVRSELEDPSITAVDQ 146
C + G D + N C G + GD+A +T+++ S P +
Sbjct: 144 NCQALSSTE---RGCDESNNICKYRYSYGDQSF-SKGDVATETVSIDSASGSP--VSFPG 197
Query: 147 FLFSCAPTFLLQGLARGA------RGMLGLGRAPISLPSQLATGIGHQRKFFMCLS---- 196
+F C G G G++GLG +SL SQL + I +KF CLS
Sbjct: 198 TVFGC-------GYNNGGTFDETGSGIIGLGGGHLSLISQLGSSI--SKKFSYCLSHKSA 248
Query: 197 SSNGVVLSHHTSTTKLP--------LMYTPLIGKS--QDYFINVKSIKING--------- 237
++NG + + T +P ++ TPL+ K Y++ +++I +
Sbjct: 249 TTNGTSVI-NLGTNSIPSSLSKDSGVVSTPLVDKEPLTYYYLTLEAISVGKKKIPYTGSS 307
Query: 238 -NPLSVTIEGLTKLSTIVPYAT----MESSIYATFAKAFTKAAAAASRDMSVVAPVAPFS 292
NP I T + I+ T +E+ + F+ A ++ A R V P S
Sbjct: 308 YNPNDDGILSETSGNIIIDSGTTLTLLEAGFFDKFSSAVEESVTGAKR---VSDPQGLLS 364
Query: 293 LCFSSKGFNGSA-VPVIDFVLQSEMVKWRFYGSNSMVKVNEEVVCLGFLDGGSDLTSSIV 351
CF S GSA + + + + R N+ VK++E++VCL + T +
Sbjct: 365 HCFKS----GSAEIGLPEITVHFTGADVRLSPINAFVKLSEDMVCLSMV----PTTEVAI 416
Query: 352 LGGFQLEDNVMDFDLGTSMLGFSTL 376
G F D ++ +DL T + F +
Sbjct: 417 YGNFAQMDFLVGYDLETRTVSFQHM 441
>sp|Q9LZL3|PCS1L_ARATH Aspartic proteinase PCS1 OS=Arabidopsis thaliana GN=PCS1 PE=2 SV=1
Length = 453
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 85/371 (22%), Positives = 143/371 (38%), Gaps = 65/371 (17%)
Query: 51 INLVLDLGGPLTWVDCDSSA--------DVSSSSSRRLIPSQSIQC-SRSGKSPVPGN-G 100
I++V+D G L+W+ C+ S+ D + SSS IP S C +R+ +P +
Sbjct: 86 ISMVIDTGSELSWLRCNRSSNPNPVNNFDPTRSSSYSPIPCSSPTCRTRTRDFLIPASCD 145
Query: 101 SDTTTNTCGVFTQNGISGLVTTGDLAEDTIAVRSELEDPSITAVDQFLFSCAPTFLLQGL 160
SD + + S G+LA + + D ++ +F C +
Sbjct: 146 SDKLCHATLSYADASSS----EGNLAAEIFHFGNSTNDSNL------IFGCMGSVSGSDP 195
Query: 161 ARGAR--GMLGLGRAPISLPSQLATGIGHQRKFFMCLSSSN---GVVLSHHTSTTKL-PL 214
+ G+LG+ R +S SQ+ KF C+S ++ G +L ++ T L PL
Sbjct: 196 EEDTKTTGLLGMNRGSLSFISQMGFP-----KFSYCISGTDDFPGFLLLGDSNFTWLTPL 250
Query: 215 MYTPLIGKSQD--------YFINVKSIKINGN----PLSVTIE-----GLTKLSTIVPYA 257
YTPLI S Y + + IK+NG P SV + G T + + +
Sbjct: 251 NYTPLIRISTPLPYFDRVAYTVQLTGIKVNGKLLPIPKSVLVPDHTGAGQTMVDSGTQFT 310
Query: 258 TMESSIYATFAKAF---TKAAAAASRDMSVVAPVAPFSLCFSSKGFNGSA-----VPVID 309
+ +Y F T D V LC+ + +P +
Sbjct: 311 FLLGPVYTALRSHFLNRTNGILTVYEDPDFVF-QGTMDLCYRISPVRIRSGILHRLPTVS 369
Query: 310 FVLQSEMV----KWRFYGSNSMVKVNEEVVCLGFLDGGSDLT--SSIVLGGFQLEDNVMD 363
V + + + Y + N+ V C F G SDL + V+G ++ ++
Sbjct: 370 LVFEGAEIAVSGQPLLYRVPHLTVGNDSVYCFTF--GNSDLMGMEAYVIGHHHQQNMWIE 427
Query: 364 FDLGTSMLGFS 374
FDL S +G +
Sbjct: 428 FDLQRSRIGLA 438
>sp|A2ZC67|ASP1_ORYSI Aspartic proteinase Asp1 OS=Oryza sativa subsp. indica GN=ASP1 PE=2
SV=2
Length = 410
Score = 40.8 bits (94), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 49/231 (21%), Positives = 92/231 (39%), Gaps = 31/231 (13%)
Query: 166 GMLGLGRAPISLPSQLAT-GIGHQRKFFMCLSSS-NGVVLSHHTSTTKLPLMYTPLIGKS 223
G+LGLGR ++L SQL + G+ + C+SS G + + ++P+ +
Sbjct: 169 GILGLGRGKVTLLSQLKSQGVITKHVLGHCISSKGKGFLFFGDAKVPTSGVTWSPMNREH 228
Query: 224 QDYFINVKSIKINGN-------PLSVTIEGLTKLS--TIVPYATMESSIYATFAK--AFT 272
+ Y +++ N N P+ V + + + PY S + +T +K F
Sbjct: 229 KHYSPRQGTLQFNSNSKPISAAPMEVIFDSGATYTYFALQPYHATLSVVKSTLSKECKFL 288
Query: 273 KAAAAASRDMSV-------VAPVAPFSLCFSSKGFNGSAVPVIDFVLQSEMVKWRFYGSN 325
R ++V + + CF S + F + +
Sbjct: 289 TEVKEKDRALTVCWKGKDKIRTIDEVKKCFRSLS--------LKFADGDKKATLEIPPEH 340
Query: 326 SMVKVNEEVVCLGFLDGGSD---LTSSIVLGGFQLEDNVMDFDLGTSMLGF 373
++ E VCLG LDG + L + ++GG + D ++ +D S+LG+
Sbjct: 341 YLIISQEGHVCLGILDGSKEHPSLAGTNLIGGITMLDQMVIYDSERSLLGW 391
>sp|Q0IU52|ASP1_ORYSJ Aspartic proteinase Asp1 OS=Oryza sativa subsp. japonica GN=ASP1
PE=2 SV=1
Length = 410
Score = 34.7 bits (78), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 49/232 (21%), Positives = 91/232 (39%), Gaps = 35/232 (15%)
Query: 167 MLGLGRAPISLPSQLAT-GIGHQRKFFMCLSSSNGVVLSHHTSTTKLP---LMYTPLIGK 222
+LGL R ++L SQL + G+ + C+SS G L ++P + +TP+ +
Sbjct: 170 ILGLSRGKVTLLSQLKSQGVITKHVLGHCISSKGGGFL--FFGDAQVPTSGVTWTPMNRE 227
Query: 223 SQDY-------FINVKSIKINGNPLSVTIEGLTKLSTIV--PYATMESSIYATFAK--AF 271
+ Y + S I+ P++V + + PY S + +T F
Sbjct: 228 HKYYSPGHGTLHFDSNSKAISAAPMAVIFDSGATYTYFAAQPYQATLSVVKSTLNSECKF 287
Query: 272 TKAAAAASRDMSV-------VAPVAPFSLCFSSKGFNGSAVPVIDFVLQSEMVKWRFYGS 324
R ++V + + CF S ++F +
Sbjct: 288 LTEVTEKDRALTVCWKGKDKIVTIDEVKKCFRSLS--------LEFADGDKKATLEIPPE 339
Query: 325 NSMVKVNEEVVCLGFLDGGSD---LTSSIVLGGFQLEDNVMDFDLGTSMLGF 373
+ ++ E VCLG LDG + L + ++GG + D ++ +D S+LG+
Sbjct: 340 HYLIISQEGHVCLGILDGSKEHLSLAGTNLIGGITMLDQMVIYDSERSLLGW 391
>sp|Q9LS40|ASPG1_ARATH Protein ASPARTIC PROTEASE IN GUARD CELL 1 OS=Arabidopsis thaliana
GN=ASPG1 PE=1 SV=1
Length = 500
Score = 34.7 bits (78), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 33/113 (29%), Positives = 49/113 (43%), Gaps = 10/113 (8%)
Query: 25 VAIPVVKDVS--TLQYVAKIHHGVSQVPINLVLDLGGPLTWVDCDSSADVSSSSSRRLIP 82
+ PVV S + +Y ++I G + LVLD G + W+ C+ AD S P
Sbjct: 147 LTTPVVSGASQGSGEYFSRIGVGTPAKEMYLVLDTGSDVNWIQCEPCADCYQQSDPVFNP 206
Query: 83 S-----QSIQCSRSGKSPVPGNGSDTTTNTCGVFTQNGISGLVTTGDLAEDTI 130
+ +S+ CS S + S +N C G G T G+LA DT+
Sbjct: 207 TSSSTYKSLTCSAPQCSLL--ETSACRSNKCLYQVSYG-DGSFTVGELATDTV 256
>sp|Q21VT1|CITG_RHOFD Probable 2-(5''-triphosphoribosyl)-3'-dephosphocoenzyme-A synthase
OS=Rhodoferax ferrireducens (strain DSM 15236 / ATCC
BAA-621 / T118) GN=citG PE=3 SV=1
Length = 302
Score = 33.9 bits (76), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 18/45 (40%), Positives = 27/45 (60%), Gaps = 6/45 (13%)
Query: 273 KAAAAASRDMSV------VAPVAPFSLCFSSKGFNGSAVPVIDFV 311
+A + A RDM V +A ++P+ F +G +GSAVPV DF+
Sbjct: 45 RANSGAHRDMDVGTFRASIAAISPWFSFFFERGVDGSAVPVEDFL 89
>sp|Q2SSF2|TRUB_MYCCT tRNA pseudouridine synthase B OS=Mycoplasma capricolum subsp.
capricolum (strain California kid / ATCC 27343 / NCTC
10154) GN=truB PE=3 SV=1
Length = 292
Score = 33.5 bits (75), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 22/82 (26%), Positives = 41/82 (50%), Gaps = 2/82 (2%)
Query: 22 VNSVAIPVVKDVSTLQYVAKIHHGVS-QVPINLVLDLGGPLTWVDCDSSADVSSSSSRRL 80
+ S+AI + KD++T+ YV +++ +S IN +D+ L+W S D + ++
Sbjct: 176 IRSLAIDICKDLNTIGYVVELNRTLSGNFDINSAIDI-KDLSWKHLTSIYDAVKINDFKV 234
Query: 81 IPSQSIQCSRSGKSPVPGNGSD 102
+ +I + GK V N D
Sbjct: 235 VKYHNILDVKQGKKIVLNNIKD 256
>sp|Q8IU53|CASC2_HUMAN Protein CASC2, isoforms 1/2 OS=Homo sapiens GN=CASC2 PE=2 SV=1
Length = 76
Score = 33.5 bits (75), Expect = 3.2, Method: Composition-based stats.
Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 3/68 (4%)
Query: 163 GARGMLGLGRAPISLPSQLATGIGHQ---RKFFMCLSSSNGVVLSHHTSTTKLPLMYTPL 219
G RG++ LG P++ P + T G Q +K G+++ T P +YT L
Sbjct: 3 GTRGLMLLGPGPVAGPRDVGTCRGRQMEIQKHKDNKKLPQGIIIVFRLQTHTTPQIYTQL 62
Query: 220 IGKSQDYF 227
GK + +F
Sbjct: 63 KGKLRKFF 70
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.133 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 136,767,434
Number of Sequences: 539616
Number of extensions: 5477522
Number of successful extensions: 13607
Number of sequences better than 100.0: 48
Number of HSP's better than 100.0 without gapping: 7
Number of HSP's successfully gapped in prelim test: 41
Number of HSP's that attempted gapping in prelim test: 13564
Number of HSP's gapped (non-prelim): 54
length of query: 394
length of database: 191,569,459
effective HSP length: 119
effective length of query: 275
effective length of database: 127,355,155
effective search space: 35022667625
effective search space used: 35022667625
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 62 (28.5 bits)