BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 033423
(119 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224130730|ref|XP_002320913.1| predicted protein [Populus trichocarpa]
gi|222861686|gb|EEE99228.1| predicted protein [Populus trichocarpa]
Length = 95
Score = 131 bits (330), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 62/95 (65%), Positives = 69/95 (72%)
Query: 25 QLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
Q V + CA P CPINCFR DPVCG DGVTYWCGC +A+C+G +VAKLG C
Sbjct: 1 QEVTNKDGKGEACAGLKAPASCPINCFRADPVCGFDGVTYWCGCADAMCSGTRVAKLGAC 60
Query: 85 EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
EVGSGGSA LPGQALLL+HIVWL L+ SLL G F
Sbjct: 61 EVGSGGSASLPGQALLLIHIVWLILIGFSLLFGFF 95
>gi|356553023|ref|XP_003544858.1| PREDICTED: uncharacterized protein LOC100777832 [Glycine max]
Length = 124
Score = 131 bits (329), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 62/87 (71%), Positives = 70/87 (80%), Gaps = 1/87 (1%)
Query: 33 SDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
SD +C + T P CP CFR DPVCG DGVTYWCGC A CAGV+VAKLGFCEVG+GGSA
Sbjct: 39 SDSLCGK-TTPSSCPAKCFRADPVCGADGVTYWCGCAEAACAGVEVAKLGFCEVGNGGSA 97
Query: 93 PLPGQALLLVHIVWLFLLALSLLLGVF 119
P+PGQALLLVHIVWL +L S+L G+F
Sbjct: 98 PIPGQALLLVHIVWLIVLGFSVLFGLF 124
>gi|449476421|ref|XP_004154732.1| PREDICTED: uncharacterized protein LOC101228097 [Cucumis sativus]
Length = 130
Score = 130 bits (327), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 57/79 (72%), Positives = 69/79 (87%)
Query: 41 TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALL 100
+ P CP+ CFR DPVCGVDGVTYWCGC +ALC+GVKVAK+GFCEVG+GGSAP+P QALL
Sbjct: 52 SAPSYCPVKCFRTDPVCGVDGVTYWCGCADALCSGVKVAKMGFCEVGNGGSAPIPAQALL 111
Query: 101 LVHIVWLFLLALSLLLGVF 119
LVHI+WL +L +S+L G+F
Sbjct: 112 LVHILWLIILGVSVLFGLF 130
>gi|449445981|ref|XP_004140750.1| PREDICTED: uncharacterized protein LOC101210178 [Cucumis sativus]
gi|449522135|ref|XP_004168083.1| PREDICTED: uncharacterized LOC101210178 [Cucumis sativus]
Length = 130
Score = 130 bits (327), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 58/77 (75%), Positives = 69/77 (89%)
Query: 43 PELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLV 102
P CP+ CFR DPVCGVDGVTYWCGC +ALC+GVKVAK+GFCEVG+GGSAP+PGQALLLV
Sbjct: 54 PSSCPVKCFRTDPVCGVDGVTYWCGCADALCSGVKVAKMGFCEVGNGGSAPIPGQALLLV 113
Query: 103 HIVWLFLLALSLLLGVF 119
HI+WL +L +S+L G+F
Sbjct: 114 HILWLIILGVSVLFGLF 130
>gi|357464293|ref|XP_003602428.1| hypothetical protein MTR_3g093240 [Medicago truncatula]
gi|355491476|gb|AES72679.1| hypothetical protein MTR_3g093240 [Medicago truncatula]
Length = 133
Score = 130 bits (326), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 60/92 (65%), Positives = 67/92 (72%)
Query: 28 QSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVG 87
+ T E VCA CP+ CFR DPVCGVDGVTYWCGC A CAGVKV K+GFCEVG
Sbjct: 42 KVTAEGKNVCAGAVASSWCPVKCFRTDPVCGVDGVTYWCGCAEAACAGVKVGKMGFCEVG 101
Query: 88 SGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
SGGSAPL QA LL+HIVWL +LA S+ G+F
Sbjct: 102 SGGSAPLSAQAFLLLHIVWLIVLAFSVFFGLF 133
>gi|356500817|ref|XP_003519227.1| PREDICTED: uncharacterized protein LOC100780930 [Glycine max]
Length = 125
Score = 125 bits (315), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 60/87 (68%), Positives = 69/87 (79%), Gaps = 1/87 (1%)
Query: 33 SDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
S+ +C + T P CP CFR DPVCG DGVTYWCGC A CAGV+VAK GFCEVG+GGSA
Sbjct: 40 SESLCGK-TMPLSCPAKCFRADPVCGADGVTYWCGCAEAACAGVEVAKFGFCEVGNGGSA 98
Query: 93 PLPGQALLLVHIVWLFLLALSLLLGVF 119
P+PGQALLLVHIVWL +L S+L G+F
Sbjct: 99 PIPGQALLLVHIVWLIVLGFSVLFGLF 125
>gi|115440619|ref|NP_001044589.1| Os01g0811200 [Oryza sativa Japonica Group]
gi|18844762|dbj|BAB85233.1| serine protease inhibitor-like protein [Oryza sativa Japonica
Group]
gi|20160706|dbj|BAB89649.1| serine protease inhibitor-like protein [Oryza sativa Japonica
Group]
gi|113534120|dbj|BAF06503.1| Os01g0811200 [Oryza sativa Japonica Group]
Length = 128
Score = 120 bits (302), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 54/77 (70%), Positives = 66/77 (85%), Gaps = 1/77 (1%)
Query: 41 TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALL 100
++ +LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALL
Sbjct: 51 SEAQLCPVRCFRPDPVCGADGVTYWCGCPEAACAGARVARRGYCEVGA-GSAPVSGQALL 109
Query: 101 LVHIVWLFLLALSLLLG 117
LVHIVWLF+L ++LLG
Sbjct: 110 LVHIVWLFVLGAAVLLG 126
>gi|413952150|gb|AFW84799.1| serine-type endopeptidase inhibitor [Zea mays]
Length = 131
Score = 120 bits (301), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 55/74 (74%), Positives = 64/74 (86%), Gaps = 1/74 (1%)
Query: 45 LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG-GSAPLPGQALLLVH 103
LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+G GSAP+ GQALLLVH
Sbjct: 56 LCPVRCFRPDPVCGADGVTYWCGCPEATCAGARVARRGYCEVGAGSGSAPVSGQALLLVH 115
Query: 104 IVWLFLLALSLLLG 117
IVWLF+L ++LLG
Sbjct: 116 IVWLFVLGAAVLLG 129
>gi|449438983|ref|XP_004137267.1| PREDICTED: uncharacterized protein LOC101220346 [Cucumis sativus]
Length = 129
Score = 120 bits (301), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 55/79 (69%), Positives = 68/79 (86%), Gaps = 1/79 (1%)
Query: 41 TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALL 100
+ P CP+ CFR DPVCGVDGVTYWCGC +ALC+GVKVAK+GFCEVG+GGSAP+P QALL
Sbjct: 52 SAPSYCPVKCFRTDPVCGVDGVTYWCGCADALCSGVKVAKMGFCEVGNGGSAPIPAQALL 111
Query: 101 LVHIVWLFLLALSLLLGVF 119
LVHI+ + +L +S+L G+F
Sbjct: 112 LVHILPI-ILGVSVLFGLF 129
>gi|226505326|ref|NP_001149593.1| serine-type endopeptidase inhibitor precursor [Zea mays]
gi|195628284|gb|ACG35972.1| serine-type endopeptidase inhibitor [Zea mays]
Length = 131
Score = 120 bits (300), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 55/74 (74%), Positives = 64/74 (86%), Gaps = 1/74 (1%)
Query: 45 LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG-GSAPLPGQALLLVH 103
LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+G GSAP+ GQALLLVH
Sbjct: 56 LCPVRCFRPDPVCGADGVTYWCGCPEATCAGARVARRGYCEVGAGSGSAPVSGQALLLVH 115
Query: 104 IVWLFLLALSLLLG 117
IVWLF+L ++LLG
Sbjct: 116 IVWLFVLGAAVLLG 129
>gi|414880042|tpg|DAA57173.1| TPA: serine-type endopeptidase inhibitor [Zea mays]
Length = 227
Score = 120 bits (300), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 54/73 (73%), Positives = 63/73 (86%), Gaps = 1/73 (1%)
Query: 45 LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHI 104
LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALLLVHI
Sbjct: 154 LCPVRCFRPDPVCGADGVTYWCGCPEAACAGARVARRGYCEVGA-GSAPVSGQALLLVHI 212
Query: 105 VWLFLLALSLLLG 117
VWLF+L ++LLG
Sbjct: 213 VWLFVLGAAVLLG 225
>gi|125528114|gb|EAY76228.1| hypothetical protein OsI_04164 [Oryza sativa Indica Group]
Length = 132
Score = 119 bits (299), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 54/77 (70%), Positives = 66/77 (85%), Gaps = 1/77 (1%)
Query: 41 TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALL 100
++ +LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALL
Sbjct: 55 SEAQLCPVRCFRPDPVCGADGVTYWCGCPEAACAGARVARRGYCEVGA-GSAPVSGQALL 113
Query: 101 LVHIVWLFLLALSLLLG 117
LVHIVWLF+L ++LLG
Sbjct: 114 LVHIVWLFVLGAAVLLG 130
>gi|224064390|ref|XP_002301452.1| predicted protein [Populus trichocarpa]
gi|222843178|gb|EEE80725.1| predicted protein [Populus trichocarpa]
Length = 95
Score = 118 bits (295), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 64/95 (67%), Positives = 69/95 (72%), Gaps = 3/95 (3%)
Query: 28 QSTQESDG---VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
Q DG CA P CPINCFR DPVCGVDGVTYWCGC +ALC+G +V KLG C
Sbjct: 1 QEVTREDGKGDACAGLKAPASCPINCFRADPVCGVDGVTYWCGCADALCSGTRVDKLGAC 60
Query: 85 EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
EVGSGGS+ LPGQALLL+HIVWL LL SLL G F
Sbjct: 61 EVGSGGSSSLPGQALLLIHIVWLILLGFSLLFGFF 95
>gi|357125538|ref|XP_003564450.1| PREDICTED: uncharacterized protein LOC100825392 [Brachypodium
distachyon]
Length = 125
Score = 117 bits (293), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 54/73 (73%), Positives = 64/73 (87%), Gaps = 1/73 (1%)
Query: 45 LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHI 104
LCP+ CFRPDPVCG DGVTYWCGCP A+CAG +VA+ G+CEVG+ GSAP+ GQALLLVHI
Sbjct: 52 LCPVRCFRPDPVCGADGVTYWCGCPEAVCAGARVARRGYCEVGA-GSAPVSGQALLLVHI 110
Query: 105 VWLFLLALSLLLG 117
VWLF+L ++LLG
Sbjct: 111 VWLFVLGAAVLLG 123
>gi|226510470|ref|NP_001149605.1| serine-type endopeptidase inhibitor precursor [Zea mays]
gi|195628428|gb|ACG36044.1| serine-type endopeptidase inhibitor [Zea mays]
Length = 129
Score = 117 bits (293), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 54/73 (73%), Positives = 63/73 (86%), Gaps = 1/73 (1%)
Query: 45 LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHI 104
LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALLLVHI
Sbjct: 56 LCPVRCFRPDPVCGADGVTYWCGCPEAACAGARVARRGYCEVGA-GSAPVSGQALLLVHI 114
Query: 105 VWLFLLALSLLLG 117
VWLF+L ++LLG
Sbjct: 115 VWLFVLGAAVLLG 127
>gi|242054737|ref|XP_002456514.1| hypothetical protein SORBIDRAFT_03g037650 [Sorghum bicolor]
gi|241928489|gb|EES01634.1| hypothetical protein SORBIDRAFT_03g037650 [Sorghum bicolor]
Length = 131
Score = 117 bits (292), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 54/73 (73%), Positives = 63/73 (86%), Gaps = 1/73 (1%)
Query: 45 LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHI 104
LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALLLVHI
Sbjct: 58 LCPVRCFRPDPVCGADGVTYWCGCPEATCAGARVARRGYCEVGA-GSAPVSGQALLLVHI 116
Query: 105 VWLFLLALSLLLG 117
VWLF+L ++LLG
Sbjct: 117 VWLFVLGAAVLLG 129
>gi|18411677|ref|NP_567214.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
thaliana]
gi|332656644|gb|AEE82044.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
thaliana]
Length = 144
Score = 116 bits (290), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 51/95 (53%), Positives = 64/95 (67%)
Query: 25 QLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
+ + + C KP CP+ CFRPDPVCG D VTYWCGC +ALC GV+V K G C
Sbjct: 50 EKINGEKNRGEFCEGIAKPASCPVQCFRPDPVCGEDSVTYWCGCADALCHGVRVVKQGAC 109
Query: 85 EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
+VG+G +PGQALLL+HIVW+ LL S+L G+F
Sbjct: 110 DVGNGVGLSVPGQALLLIHIVWMMLLGFSILFGLF 144
>gi|21555759|gb|AAM63928.1| unknown [Arabidopsis thaliana]
Length = 135
Score = 115 bits (289), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 51/95 (53%), Positives = 64/95 (67%)
Query: 25 QLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
+ + + C KP CP+ CFRPDPVCG D VTYWCGC +ALC GV+V K G C
Sbjct: 41 EKINGEKNRGEFCEGIAKPASCPVQCFRPDPVCGEDSVTYWCGCADALCHGVRVVKQGAC 100
Query: 85 EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
+VG+G +PGQALLL+HIVW+ LL S+L G+F
Sbjct: 101 DVGNGVGLSVPGQALLLIHIVWMMLLGFSILFGLF 135
>gi|297810001|ref|XP_002872884.1| hypothetical protein ARALYDRAFT_490419 [Arabidopsis lyrata subsp.
lyrata]
gi|297318721|gb|EFH49143.1| hypothetical protein ARALYDRAFT_490419 [Arabidopsis lyrata subsp.
lyrata]
Length = 137
Score = 115 bits (289), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 51/95 (53%), Positives = 64/95 (67%)
Query: 25 QLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
+ + + C KP CP+ CFRPDPVCG D VTYWCGC +ALC GV+V K G C
Sbjct: 43 EKIDGEKNRGEFCEGIAKPASCPVQCFRPDPVCGEDSVTYWCGCADALCHGVRVVKQGAC 102
Query: 85 EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
+VG+G +PGQALLL+HIVW+ LL S+L G+F
Sbjct: 103 DVGNGVGLSVPGQALLLIHIVWMMLLGFSILFGLF 137
>gi|255548137|ref|XP_002515125.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
gi|223545605|gb|EEF47109.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
Length = 130
Score = 115 bits (287), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 55/87 (63%), Positives = 66/87 (75%), Gaps = 1/87 (1%)
Query: 23 GLQLVQSTQESDG-VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKL 81
G+ + Q T + +G VC T P CPINCFR DPVCGVDG TYWCGC +ALC+G VAK+
Sbjct: 33 GMLIEQITSKGNGDVCKGVTAPASCPINCFRADPVCGVDGRTYWCGCDDALCSGTTVAKI 92
Query: 82 GFCEVGSGGSAPLPGQALLLVHIVWLF 108
G C++G+GGSA LP QALLLVHI+WL
Sbjct: 93 GACDMGNGGSASLPRQALLLVHILWLI 119
>gi|356507538|ref|XP_003522521.1| PREDICTED: uncharacterized protein LOC100797240 [Glycine max]
Length = 131
Score = 114 bits (285), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 53/88 (60%), Positives = 62/88 (70%), Gaps = 1/88 (1%)
Query: 32 ESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGS 91
E +C+ P CP+ CFR DPVC VDGVTYWCGC A A ++AKLGFCEVG+GGS
Sbjct: 45 EGKNLCSA-AAPSSCPVKCFRTDPVCSVDGVTYWCGCSEAAYASAQIAKLGFCEVGNGGS 103
Query: 92 APLPGQALLLVHIVWLFLLALSLLLGVF 119
L GQALLLVHIVWL +L S+ G+F
Sbjct: 104 VTLSGQALLLVHIVWLIVLGFSVFFGLF 131
>gi|224132550|ref|XP_002321347.1| predicted protein [Populus trichocarpa]
gi|222868343|gb|EEF05474.1| predicted protein [Populus trichocarpa]
Length = 132
Score = 112 bits (280), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 54/85 (63%), Positives = 63/85 (74%), Gaps = 3/85 (3%)
Query: 35 GVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPL 94
G +R + CP+ CFRPDPVCGVDGVTYWCGC +A CAG KV K GFCEVG+ G+A
Sbjct: 51 GEWSRRSPVSSCPVKCFRPDPVCGVDGVTYWCGCDDARCAGTKVTKKGFCEVGNNGAA-- 108
Query: 95 PGQALLLVHIVWLFLLALSLLLGVF 119
QALLLVHIVWL +L S+L G+F
Sbjct: 109 -AQALLLVHIVWLIVLGFSILFGLF 132
>gi|297821082|ref|XP_002878424.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
lyrata subsp. lyrata]
gi|297324262|gb|EFH54683.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
lyrata subsp. lyrata]
Length = 117
Score = 111 bits (277), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 53/95 (55%), Positives = 64/95 (67%), Gaps = 2/95 (2%)
Query: 14 LIFFLALNIGLQLVQSTQESDG--VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNA 71
+F + IGLQ ++ G VC R + CPINCFR DPVCG DGVTYWCGCP+A
Sbjct: 10 FLFLVLCLIGLQAADDFLDNSGGDVCPRVSDSGGCPINCFRADPVCGADGVTYWCGCPDA 69
Query: 72 LCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHIVW 106
C G +V K G C+ G+ GSA +PGQALLL+HIVW
Sbjct: 70 ACHGARVVKRGACDTGNAGSASVPGQALLLIHIVW 104
>gi|225455171|ref|XP_002268733.1| PREDICTED: uncharacterized protein LOC100253810 [Vitis vinifera]
Length = 128
Score = 109 bits (272), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 55/89 (61%), Positives = 62/89 (69%), Gaps = 5/89 (5%)
Query: 32 ESDG-VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGG 90
ESDG +C CP+ CFRPDPVCGVDGVTYWCGC A CAG VA G CEVG+G
Sbjct: 44 ESDGGLCGGSEVAASCPVKCFRPDPVCGVDGVTYWCGCTEARCAGTAVAHFGICEVGNGS 103
Query: 91 SAPLPGQALLLVHIVWLFLLALSLLLGVF 119
+ GQALLLVHIVWL +L S+L G+F
Sbjct: 104 A----GQALLLVHIVWLMVLGFSVLFGLF 128
>gi|116784747|gb|ABK23457.1| unknown [Picea sitchensis]
Length = 131
Score = 107 bits (268), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 56/83 (67%), Positives = 67/83 (80%)
Query: 36 VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLP 95
VC+ + CP++CFRPDPVCGVDG+TYWCGC +A CAGV VAK+GFCEVG+GGS L
Sbjct: 48 VCSEKNESASCPVSCFRPDPVCGVDGITYWCGCTDAQCAGVDVAKIGFCEVGNGGSGLLS 107
Query: 96 GQALLLVHIVWLFLLALSLLLGV 118
G+ALLLVHIVWL LL +L+GV
Sbjct: 108 GEALLLVHIVWLILLGFIVLIGV 130
>gi|357490411|ref|XP_003615493.1| hypothetical protein MTR_5g068760 [Medicago truncatula]
gi|355516828|gb|AES98451.1| hypothetical protein MTR_5g068760 [Medicago truncatula]
gi|388491276|gb|AFK33704.1| unknown [Medicago truncatula]
gi|388497980|gb|AFK37056.1| unknown [Medicago truncatula]
Length = 120
Score = 107 bits (266), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 61/87 (70%), Positives = 65/87 (74%), Gaps = 1/87 (1%)
Query: 33 SDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
S VC+ T P CP CFR DPVCG DGVTYWCGC A CAG KVAKLGFCEVG+GGSA
Sbjct: 35 SQNVCS-VTTPSSCPAKCFRTDPVCGADGVTYWCGCAEAACAGAKVAKLGFCEVGNGGSA 93
Query: 93 PLPGQALLLVHIVWLFLLALSLLLGVF 119
PGQALLLVHIVWL +L S+L G F
Sbjct: 94 TFPGQALLLVHIVWLIVLGFSVLFGFF 120
>gi|21554106|gb|AAM63186.1| unknown [Arabidopsis thaliana]
Length = 117
Score = 106 bits (265), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 63/99 (63%), Gaps = 3/99 (3%)
Query: 11 IAVLIFFLALN-IGLQLVQS--TQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCG 67
+++ FL L IGLQ + VC R C INCFR DPVCG DGVTYWCG
Sbjct: 6 LSIRFLFLVLCLIGLQAADDFPDKSRGDVCPRVKDRGGCTINCFRADPVCGTDGVTYWCG 65
Query: 68 CPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHIVW 106
CP+A C G +V K G C+ G+ GSA +PGQALLL+HIVW
Sbjct: 66 CPDAACHGTRVVKKGACDTGNAGSASVPGQALLLIHIVW 104
>gi|18412211|ref|NP_567123.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
thaliana]
gi|6899896|emb|CAB71905.1| hypothetical protein [Arabidopsis thaliana]
gi|17473699|gb|AAL38306.1| unknown protein [Arabidopsis thaliana]
gi|21386987|gb|AAM47897.1| unknown protein [Arabidopsis thaliana]
gi|332646768|gb|AEE80289.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
thaliana]
Length = 117
Score = 106 bits (264), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/99 (53%), Positives = 63/99 (63%), Gaps = 3/99 (3%)
Query: 11 IAVLIFFLALN-IGLQLVQS--TQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCG 67
+++ FL L IGLQ + VC R C INCFR DPVCG DGVTYWCG
Sbjct: 6 LSIRFLFLVLCLIGLQAADDFPDKSRGDVCPRVKDRGGCTINCFRADPVCGTDGVTYWCG 65
Query: 68 CPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHIVW 106
CP+A C G +V K G C+ G+ GSA +PGQALLL+HIVW
Sbjct: 66 CPDAACHGARVVKKGACDTGNAGSASVPGQALLLIHIVW 104
>gi|255557162|ref|XP_002519612.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
gi|223541202|gb|EEF42757.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
Length = 134
Score = 103 bits (257), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 53/100 (53%), Positives = 70/100 (70%), Gaps = 3/100 (3%)
Query: 20 LNIGLQLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVA 79
+ + ++ + ++ C + +P CP+NCFR DPVCGVDGVTYWCGC +A CAG KVA
Sbjct: 38 IRLPTHIIDNQDDNVDPCGEYGRPVSCPVNCFRTDPVCGVDGVTYWCGCRDAWCAGTKVA 97
Query: 80 KLGFCEVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
K GFCEVG+ G+A QALLL+HIVWL +L +L G+F
Sbjct: 98 KKGFCEVGNNGAA---AQALLLLHIVWLIVLGFCVLFGIF 134
>gi|414880449|tpg|DAA57580.1| TPA: hypothetical protein ZEAMMB73_385931 [Zea mays]
Length = 384
Score = 90.5 bits (223), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 35/45 (77%), Positives = 39/45 (86%)
Query: 45 LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG 89
LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVGSG
Sbjct: 279 LCPVRCFRPDPVCGADGVTYWCGCPEATCAGARVARRGYCEVGSG 323
>gi|168026830|ref|XP_001765934.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162682840|gb|EDQ69255.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 167
Score = 85.9 bits (211), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 42/84 (50%), Positives = 55/84 (65%), Gaps = 4/84 (4%)
Query: 40 FTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPG--- 96
K LC +NCFR DPVCG DGVTYWCG A CA V+VA G+C++ +GG+ G
Sbjct: 84 IEKEPLCTVNCFRADPVCGSDGVTYWCGVAEAKCAKVEVAHDGYCDIWNGGTKANLGLHA 143
Query: 97 -QALLLVHIVWLFLLALSLLLGVF 119
Q+L LVH+VWL L L +++G+
Sbjct: 144 AQSLQLVHMVWLVLAGLLIVIGLL 167
>gi|125572387|gb|EAZ13902.1| hypothetical protein OsJ_03827 [Oryza sativa Japonica Group]
Length = 129
Score = 78.6 bits (192), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 40/78 (51%), Positives = 52/78 (66%), Gaps = 2/78 (2%)
Query: 41 TKPELCPINCFRPDPVCGV-DGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQAL 99
++ +LCP+ CFR G G P CAG +VA+ G+CEVG+G SAP+ GQAL
Sbjct: 51 SEAQLCPVRCFRAGTRSAAPTGSRTGAGAPRPACAGARVARRGYCEVGAG-SAPVSGQAL 109
Query: 100 LLVHIVWLFLLALSLLLG 117
LLVHIVWLF+L ++LLG
Sbjct: 110 LLVHIVWLFVLGAAVLLG 127
>gi|255548139|ref|XP_002515126.1| hypothetical protein RCOM_1341480 [Ricinus communis]
gi|223545606|gb|EEF47110.1| hypothetical protein RCOM_1341480 [Ricinus communis]
Length = 83
Score = 74.7 bits (182), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 39/84 (46%), Positives = 51/84 (60%), Gaps = 4/84 (4%)
Query: 3 TSSLANRLIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPELC--PINCFRPDPVCGVD 60
T+S R + +I +A + L+ + ++ +C P C PINCFRPDPVCG +
Sbjct: 2 TTSFKKRTLLCMIVVVATCV--LLLTNGVQAQNLCPGSEPPPSCAGPINCFRPDPVCGAN 59
Query: 61 GVTYWCGCPNALCAGVKVAKLGFC 84
GVTYWCGCP+A CAGV V KL C
Sbjct: 60 GVTYWCGCPDAACAGVPVVKLEAC 83
>gi|255548141|ref|XP_002515127.1| hypothetical protein RCOM_1341590 [Ricinus communis]
gi|223545607|gb|EEF47111.1| hypothetical protein RCOM_1341590 [Ricinus communis]
Length = 81
Score = 73.6 bits (179), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 39/77 (50%), Positives = 48/77 (62%), Gaps = 1/77 (1%)
Query: 9 RLIAVLIFFLALNIGLQLVQSTQESDGVC-ARFTKPELCPINCFRPDPVCGVDGVTYWCG 67
+ IA L L + I V +Q + +C + PE CPI C PDPVCG +GVTYWCG
Sbjct: 5 KNIAPLCMMLLVFIVCVSVAQSQITINLCPGPMSPPEGCPIACLVPDPVCGANGVTYWCG 64
Query: 68 CPNALCAGVKVAKLGFC 84
CP+ALCAGV+V K G C
Sbjct: 65 CPDALCAGVRVVKFGEC 81
>gi|168024312|ref|XP_001764680.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683974|gb|EDQ70379.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 117
Score = 70.5 bits (171), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 39/102 (38%), Positives = 55/102 (53%), Gaps = 6/102 (5%)
Query: 24 LQLVQSTQESDGVCARFTKP------ELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVK 77
++ V S +DG +P + CP+ C R DPVCGVD VTYWCG +A CAGV+
Sbjct: 10 IRSVVSKAAADGANNVLIEPCILQTTKSCPVKCVRTDPVCGVDKVTYWCGAADAKCAGVE 69
Query: 78 VAKLGFCEVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
VA G C + G+A AL +H + L + ++ G+F
Sbjct: 70 VAHDGSCNLWELGTATSSTTALYALHSLQLNRMFCLVIAGLF 111
>gi|168050727|ref|XP_001777809.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162670785|gb|EDQ57347.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 190
Score = 69.7 bits (169), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 47/81 (58%), Gaps = 7/81 (8%)
Query: 33 SDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG--- 89
S G C R T CP+ CFR PVCG D VTY CG +A CAGV+VA GFC + G
Sbjct: 87 SKGPCIRQTTMS-CPVKCFRSYPVCGTDKVTYRCGAADANCAGVEVAYDGFCNLWEGDRN 145
Query: 90 ---GSAPLPGQALLLVHIVWL 107
+A Q+L LVH++WL
Sbjct: 146 VGSSTALYAVQSLQLVHMLWL 166
>gi|302785325|ref|XP_002974434.1| hypothetical protein SELMODRAFT_414607 [Selaginella
moellendorffii]
gi|300158032|gb|EFJ24656.1| hypothetical protein SELMODRAFT_414607 [Selaginella
moellendorffii]
Length = 128
Score = 65.5 bits (158), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 25/46 (54%), Positives = 35/46 (76%)
Query: 46 CPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGS 91
CP++C R DPVCG +GVTYWCG +A C GV+++ G+C G+GG+
Sbjct: 50 CPVSCLRADPVCGANGVTYWCGSRDAACDGVEISHTGYCSPGNGGA 95
>gi|224130742|ref|XP_002320916.1| predicted protein [Populus trichocarpa]
gi|222861689|gb|EEE99231.1| predicted protein [Populus trichocarpa]
Length = 81
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 38/83 (45%), Positives = 47/83 (56%), Gaps = 6/83 (7%)
Query: 6 LANRLIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPEL----CPINCFRPDPVCGVDG 61
+A I + + L GL L ++ +S VCA +P+ C INCF PDPVCG DG
Sbjct: 1 MATSKIVAPLCLMVLVFGLCLPKA--QSQDVCAGVERPDPETIPCTINCFVPDPVCGTDG 58
Query: 62 VTYWCGCPNALCAGVKVAKLGFC 84
VTY CGC +A C GV V K G C
Sbjct: 59 VTYSCGCLDAFCHGVDVVKEGEC 81
>gi|224130738|ref|XP_002320915.1| predicted protein [Populus trichocarpa]
gi|222861688|gb|EEE99230.1| predicted protein [Populus trichocarpa]
Length = 83
Score = 65.1 bits (157), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 42/89 (47%), Positives = 54/89 (60%), Gaps = 11/89 (12%)
Query: 1 MRTSSLANRL-IAVLIFFLALNIGLQLVQSTQESDGVCARFTKPEL----CPINCFRPDP 55
M TS + L + VL+F L+L+ +V+S +S GVCA +P+ C INC DP
Sbjct: 1 MATSKIVAPLCLMVLVFCLSLS----MVKS--QSYGVCAGAARPDPETIPCTINCLVADP 54
Query: 56 VCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
VCG DGVTY CGC +A C GV+V K G C
Sbjct: 55 VCGTDGVTYTCGCYDAFCHGVEVVKKGEC 83
>gi|224130734|ref|XP_002320914.1| predicted protein [Populus trichocarpa]
gi|222861687|gb|EEE99229.1| predicted protein [Populus trichocarpa]
Length = 80
Score = 58.9 bits (141), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 42/81 (51%), Gaps = 3/81 (3%)
Query: 6 LANRLIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPEL--CPINCFRPDPVCGVDGVT 63
+A I + + L GL L +S GVC F PE C + C PD VCG DGVT
Sbjct: 1 MATSKILAPLCLMVLVFGLCLSMVESQSYGVCEGF-DPEAPRCAVRCSVPDYVCGTDGVT 59
Query: 64 YWCGCPNALCAGVKVAKLGFC 84
Y CGC +A C GV V K G C
Sbjct: 60 YTCGCKDAFCNGVDVVKKGKC 80
>gi|255548143|ref|XP_002515128.1| conserved hypothetical protein [Ricinus communis]
gi|223545608|gb|EEF47112.1| conserved hypothetical protein [Ricinus communis]
Length = 75
Score = 55.1 bits (131), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 43/75 (57%), Gaps = 4/75 (5%)
Query: 14 LIFFLALNIGLQLVQSTQESDGVC--ARFTKP--ELCPINCFRPDPVCGVDGVTYWCGCP 69
++ + L + L +V+S + +C A+ P + I C R DPVCGV+GVTY CGCP
Sbjct: 1 MMMVVVLGVCLTMVESKVQVPNLCDLAKSASPGKDCSAIRCLRYDPVCGVNGVTYGCGCP 60
Query: 70 NALCAGVKVAKLGFC 84
A V+V KLG C
Sbjct: 61 EANFYAVRVGKLGAC 75
>gi|407462333|ref|YP_006773650.1| hypothetical protein NKOR_04090 [Candidatus Nitrosopumilus
koreensis AR1]
gi|407045955|gb|AFS80708.1| hypothetical protein NKOR_04090 [Candidatus Nitrosopumilus
koreensis AR1]
Length = 250
Score = 51.6 bits (122), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/51 (50%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 42 KPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
+PE C I C DPVCGVDG+TY CG +A C GVKV G C G+
Sbjct: 201 EPE-CSIQCLVYDPVCGVDGITYACGIEDAACHGVKVKHDGECSDSKLGTE 250
>gi|407465325|ref|YP_006776207.1| proteinase inhibitor I1 Kazal [Candidatus Nitrosopumilus sp. AR2]
gi|407048513|gb|AFS83265.1| proteinase inhibitor I1 Kazal [Candidatus Nitrosopumilus sp. AR2]
Length = 249
Score = 48.9 bits (115), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 27/56 (48%), Gaps = 2/56 (3%)
Query: 35 GVCARFTKPE--LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGS 88
+ F + E C I C DPVCG D +TY CG +A C VKV G C S
Sbjct: 114 DIREEFNRNENKECSIQCLVYDPVCGEDNITYACGVEDATCHNVKVKYSGECNASS 169
>gi|168057730|ref|XP_001780866.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667722|gb|EDQ54345.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 169
Score = 44.7 bits (104), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/59 (40%), Positives = 34/59 (57%), Gaps = 6/59 (10%)
Query: 65 WCGCPNALCAGVKVAKLGFCEVGS------GGSAPLPGQALLLVHIVWLFLLALSLLLG 117
WCG +A CAGV+VA G+C V +A Q+L LVH++WL + L ++LG
Sbjct: 109 WCGAADAKCAGVEVAHDGYCNVWELDANVGSTTAVRAAQSLQLVHMLWLVVAGLLIVLG 167
>gi|168044603|ref|XP_001774770.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673925|gb|EDQ60441.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 196
Score = 38.5 bits (88), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 33/59 (55%), Gaps = 8/59 (13%)
Query: 66 CGCPNALCAGVKVAKLGFC-------EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLG 117
CG +A CAGV VA G C VGS +A Q+L LVH++WL + L ++LG
Sbjct: 137 CGAADAKCAGVGVAHDGHCNLWELNANVGS-TTAVRAAQSLQLVHMLWLVVAGLLIVLG 194
>gi|302843587|ref|XP_002953335.1| hypothetical protein VOLCADRAFT_105857 [Volvox carteri f.
nagariensis]
gi|300261432|gb|EFJ45645.1| hypothetical protein VOLCADRAFT_105857 [Volvox carteri f.
nagariensis]
Length = 964
Score = 38.5 bits (88), Expect = 0.52, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 24/47 (51%), Gaps = 3/47 (6%)
Query: 49 NCFRPD---PVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
C PD PVCG +G TY C CAGV VA LG C V G A
Sbjct: 791 QCNCPDDYQPVCGSNGKTYDNACKAVFCAGVSVAYLGECSVDENGCA 837
>gi|149924837|ref|ZP_01913176.1| hypothetical protein PPSIR1_11716 [Plesiocystis pacifica SIR-1]
gi|149814297|gb|EDM73900.1| hypothetical protein PPSIR1_11716 [Plesiocystis pacifica SIR-1]
Length = 271
Score = 38.1 bits (87), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 18/30 (60%), Positives = 19/30 (63%)
Query: 56 VCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
VCG DGVTY C CAGV+VA G CE
Sbjct: 163 VCGADGVTYANACVATECAGVEVAHEGACE 192
>gi|340345306|ref|ZP_08668438.1| Blue (Type 1) copper domain protein [Candidatus Nitrosoarchaeum
koreensis MY1]
gi|339520447|gb|EGP94170.1| Blue (Type 1) copper domain protein [Candidatus Nitrosoarchaeum
koreensis MY1]
Length = 569
Score = 38.1 bits (87), Expect = 0.75, Method: Composition-based stats.
Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 17/78 (21%)
Query: 10 LIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPELCPINC-FRPDPVCGVDGVTYWCGC 68
L+A+ + F++++ G +PE P+ C + DPVCGVDG+TY C
Sbjct: 14 LVAISLVFISMDPGY---------------IVEPEAKPVACTMQYDPVCGVDGITYGNSC 58
Query: 69 PNALCAGVKVAKLGFCEV 86
A V++A G C V
Sbjct: 59 MLG-AANVQLAYPGECVV 75
>gi|313224749|emb|CBY20540.1| unnamed protein product [Oikopleura dioica]
Length = 1320
Score = 37.4 bits (85), Expect = 1.3, Method: Composition-based stats.
Identities = 24/58 (41%), Positives = 28/58 (48%), Gaps = 15/58 (25%)
Query: 46 CPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG------GSAPLPGQ 97
CP+ +PVCG DGVTY N+LC V K CE GSG G P G+
Sbjct: 151 CPLCPAENNPVCGSDGVTY-----NSLC----VMKRTNCEFGSGIELVSSGKCPEKGE 199
>gi|12325237|gb|AAG52567.1|AC010675_15 putative peptide transporter; 37139-33250 [Arabidopsis thaliana]
Length = 644
Score = 36.6 bits (83), Expect = 1.9, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 8/51 (15%)
Query: 43 PELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAP 93
P+L P +C DP+ CG PN L GV + L F VGSGG P
Sbjct: 171 PQLHPASCNSQDPLS--------CGGPNKLQIGVLLLGLCFLSVGSGGIRP 213
>gi|159464817|ref|XP_001690638.1| secreted protease and protease inhibitor [Chlamydomonas
reinhardtii]
gi|158280138|gb|EDP05897.1| secreted protease and protease inhibitor [Chlamydomonas
reinhardtii]
Length = 951
Score = 36.6 bits (83), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 21/44 (47%), Positives = 25/44 (56%), Gaps = 2/44 (4%)
Query: 53 PDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPG 96
P PVCG DG Y C A C+GV +LG C G G++P PG
Sbjct: 526 PSPVCGSDGNQYTNAC-EAWCSGVLSWRLGTCP-GGNGTSPSPG 567
Score = 34.3 bits (77), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 20/39 (51%), Positives = 21/39 (53%), Gaps = 1/39 (2%)
Query: 55 PVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAP 93
PVCG DG TY C A CAGVK K G C G + P
Sbjct: 463 PVCGSDGRTYSNPC-EANCAGVKSYKAGECTAQGGVNTP 500
>gi|18409391|ref|NP_564979.1| nitrate transporter 1.7 [Arabidopsis thaliana]
gi|75158815|sp|Q8RX77.1|PTR21_ARATH RecName: Full=Nitrate transporter 1.7
gi|19699104|gb|AAL90918.1| At1g69870/T17F3_10 [Arabidopsis thaliana]
gi|21928025|gb|AAM78041.1| At1g69870/T17F3_10 [Arabidopsis thaliana]
gi|332196872|gb|AEE34993.1| nitrate transporter 1.7 [Arabidopsis thaliana]
Length = 620
Score = 36.6 bits (83), Expect = 2.2, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 8/51 (15%)
Query: 43 PELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAP 93
P+L P +C DP+ CG PN L GV + L F VGSGG P
Sbjct: 147 PQLHPASCNSQDPLS--------CGGPNKLQIGVLLLGLCFLSVGSGGIRP 189
>gi|256549336|gb|ACU83222.1| serine proteinase inhibitor [Ruditapes philippinarum]
Length = 134
Score = 36.2 bits (82), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 18/32 (56%), Positives = 22/32 (68%), Gaps = 1/32 (3%)
Query: 55 PVCGVDGVTYWCGCPNALCAGVKVAKLGFCEV 86
PVCGVDG+TY C N CA V++A G C+V
Sbjct: 95 PVCGVDGITYGNDC-NGKCANVQLAHTGECDV 125
>gi|297790444|ref|XP_002863113.1| proton-dependent oligopeptide transport family protein [Arabidopsis
lyrata subsp. lyrata]
gi|297308935|gb|EFH39372.1| proton-dependent oligopeptide transport family protein [Arabidopsis
lyrata subsp. lyrata]
Length = 619
Score = 36.2 bits (82), Expect = 2.3, Method: Composition-based stats.
Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 8/51 (15%)
Query: 43 PELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAP 93
P+L P +C DP+ CG PN L GV + L F VGSGG P
Sbjct: 146 PQLHPASCNSQDPLS--------CGGPNKLQFGVLLLGLCFLSVGSGGIRP 188
>gi|326431932|gb|EGD77502.1| kunitz protease inhibitor 1 [Salpingoeca sp. ATCC 50818]
Length = 1724
Score = 35.8 bits (81), Expect = 3.4, Method: Composition-based stats.
Identities = 18/30 (60%), Positives = 20/30 (66%), Gaps = 1/30 (3%)
Query: 56 VCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
VCG DG TY C +A CAGVKVA G C+
Sbjct: 1333 VCGADGKTYPSAC-HARCAGVKVAYRGACQ 1361
>gi|313241335|emb|CBY33610.1| unnamed protein product [Oikopleura dioica]
Length = 1045
Score = 35.8 bits (81), Expect = 3.5, Method: Composition-based stats.
Identities = 21/44 (47%), Positives = 24/44 (54%), Gaps = 9/44 (20%)
Query: 46 CPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG 89
CP+ +PVCG DGVTY N+LC V K CE GSG
Sbjct: 246 CPLCPAENNPVCGSDGVTY-----NSLC----VMKRTNCEFGSG 280
>gi|405951044|gb|EKC18991.1| Serine protease inhibitor dipetalogastin [Crassostrea gigas]
Length = 122
Score = 35.8 bits (81), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 31/71 (43%), Gaps = 3/71 (4%)
Query: 38 ARFTKPELCPINCFRPD---PVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPL 94
AR CP NC PD PVCG DGVTY C G+ ++G C+ G A
Sbjct: 42 ARIISNGECPRNCVCPDHYRPVCGSDGVTYPNRCVAGCINGIYDFQVGECQEGGEEKAKG 101
Query: 95 PGQALLLVHIV 105
LL I+
Sbjct: 102 THSNSLLDQIL 112
>gi|307110408|gb|EFN58644.1| hypothetical protein CHLNCDRAFT_140892 [Chlorella variabilis]
Length = 404
Score = 35.8 bits (81), Expect = 3.7, Method: Composition-based stats.
Identities = 24/67 (35%), Positives = 30/67 (44%), Gaps = 4/67 (5%)
Query: 27 VQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEV 86
V++ Q + G CP+N PVCG +G+TY C A AG V G C V
Sbjct: 279 VEAAQGAVGTTGAVGSAVACPLNLM---PVCGANGMTYDNECL-AEAAGTTVVAEGPCTV 334
Query: 87 GSGGSAP 93
G S P
Sbjct: 335 PMGESGP 341
>gi|162455950|ref|YP_001618317.1| protease inhibitor [Sorangium cellulosum So ce56]
gi|161166532|emb|CAN97837.1| putative secreted Protease inhibitor [Sorangium cellulosum So ce56]
Length = 231
Score = 35.4 bits (80), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 22/43 (51%), Positives = 24/43 (55%), Gaps = 4/43 (9%)
Query: 42 KPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
KP+ CP + DPVCG DG TY GC A AGV V G C
Sbjct: 101 KPDACPQ---QYDPVCGCDGRTYSNGCVAAT-AGVSVRSAGEC 139
>gi|313222694|emb|CBY41696.1| unnamed protein product [Oikopleura dioica]
Length = 290
Score = 35.0 bits (79), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 5/81 (6%)
Query: 5 SLANRLIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPELCPINC-FRPDPVCGVDGVT 63
+ NR + +L A G + + + +DGVC + T+ + C C DP+CG DG+
Sbjct: 174 TYGNRDMFILAKCDAQRDGNKNARDWEATDGVC-QLTRVQRCNKACPMINDPICGSDGIR 232
Query: 64 YWCGC---PNALCAGVKVAKL 81
Y C A C+G ++ +L
Sbjct: 233 YANECVMESVAACSGEEITRL 253
>gi|71063300|gb|AAZ22318.1| mantle protein 9 [Pinctada fucata]
Length = 209
Score = 35.0 bits (79), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 17/32 (53%), Positives = 21/32 (65%)
Query: 54 DPVCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
+P CGVDG TY C +CAGV++A G CE
Sbjct: 39 NPQCGVDGRTYSNPCSARVCAGVEIAYPGRCE 70
>gi|197105034|ref|YP_002130411.1| kazal-type serine protease inhibitor domain [Phenylobacterium
zucineum HLK1]
gi|196478454|gb|ACG77982.1| kazal-type serine protease inhibitor domain [Phenylobacterium
zucineum HLK1]
Length = 132
Score = 35.0 bits (79), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 24/55 (43%), Positives = 30/55 (54%), Gaps = 6/55 (10%)
Query: 31 QESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
Q++ GVC + KPE+C PVCG DG TY C A AGV +A G C+
Sbjct: 82 QDAAGVCTK--KPEICTKEYM---PVCGCDGQTYGNAC-EADRAGVNIASTGACK 130
>gi|242025588|ref|XP_002433206.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212518747|gb|EEB20468.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 864
Score = 35.0 bits (79), Expect = 5.9, Method: Composition-based stats.
Identities = 21/68 (30%), Positives = 31/68 (45%), Gaps = 16/68 (23%)
Query: 42 KPELCPI-----NCFRP--DPVCGVDGVTYW----CGCPNA-----LCAGVKVAKLGFCE 85
+P+ P+ NC R P+CG+DG TY+ GC NA + V A +G+C
Sbjct: 659 QPDFHPVCNTTCNCDREKFSPICGIDGKTYFSACHAGCRNATLVDGMVVEVSTASIGYCN 718
Query: 86 VGSGGSAP 93
+ P
Sbjct: 719 LDCSNFVP 726
>gi|290988085|ref|XP_002676752.1| predicted protein [Naegleria gruberi]
gi|284090356|gb|EFC44008.1| predicted protein [Naegleria gruberi]
Length = 304
Score = 35.0 bits (79), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 4/40 (10%)
Query: 46 CPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
CP+ +RP VCGV+G+TY C A CA V + G C+
Sbjct: 247 CPVE-YRP--VCGVNGLTYGSAC-TARCADVSIHYYGVCK 282
>gi|397589373|gb|EJK54637.1| hypothetical protein THAOC_25716, partial [Thalassiosira oceanica]
Length = 175
Score = 34.7 bits (78), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 17/31 (54%), Positives = 20/31 (64%), Gaps = 1/31 (3%)
Query: 54 DPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
D VCG DG TYW C +A +GV+VA G C
Sbjct: 100 DQVCGADGGTYWNDC-HATQSGVEVAYGGRC 129
>gi|82702295|ref|YP_411861.1| proteinase inhibitor I1, Kazal [Nitrosospira multiformis ATCC
25196]
gi|82410360|gb|ABB74469.1| proteinase inhibitor I1, Kazal [Nitrosospira multiformis ATCC
25196]
Length = 263
Score = 34.7 bits (78), Expect = 8.4, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 34/67 (50%), Gaps = 11/67 (16%)
Query: 41 TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGS----GGSAPLP- 95
TKP +CP F+P VCG DG TY C A AGV V G C+ GG A +P
Sbjct: 177 TKPTICP-QIFKP--VCGCDGKTYSNSC-TAAAAGVSVEHEGECKKSEPQACGGIAGIPC 232
Query: 96 --GQALL 100
GQA +
Sbjct: 233 PEGQACM 239
>gi|159476856|ref|XP_001696527.1| hypothetical protein CHLREDRAFT_93612 [Chlamydomonas reinhardtii]
gi|158282752|gb|EDP08504.1| predicted protein [Chlamydomonas reinhardtii]
Length = 87
Score = 34.3 bits (77), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 20/41 (48%), Positives = 21/41 (51%), Gaps = 2/41 (4%)
Query: 46 CPINCFRP-DPVCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
C C R P CG DG TY C A C GV+VA G CE
Sbjct: 47 CRAGCKRAFKPACGADGRTYASACL-AKCQGVQVAHHGPCE 86
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.329 0.145 0.477
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,813,026,710
Number of Sequences: 23463169
Number of extensions: 68296165
Number of successful extensions: 192023
Number of sequences better than 100.0: 151
Number of HSP's better than 100.0 without gapping: 47
Number of HSP's successfully gapped in prelim test: 104
Number of HSP's that attempted gapping in prelim test: 191601
Number of HSP's gapped (non-prelim): 521
length of query: 119
length of database: 8,064,228,071
effective HSP length: 86
effective length of query: 33
effective length of database: 6,046,395,537
effective search space: 199531052721
effective search space used: 199531052721
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 69 (31.2 bits)