BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 033423
         (119 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|224130730|ref|XP_002320913.1| predicted protein [Populus trichocarpa]
 gi|222861686|gb|EEE99228.1| predicted protein [Populus trichocarpa]
          Length = 95

 Score =  131 bits (330), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 62/95 (65%), Positives = 69/95 (72%)

Query: 25  QLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
           Q V +       CA    P  CPINCFR DPVCG DGVTYWCGC +A+C+G +VAKLG C
Sbjct: 1   QEVTNKDGKGEACAGLKAPASCPINCFRADPVCGFDGVTYWCGCADAMCSGTRVAKLGAC 60

Query: 85  EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
           EVGSGGSA LPGQALLL+HIVWL L+  SLL G F
Sbjct: 61  EVGSGGSASLPGQALLLIHIVWLILIGFSLLFGFF 95


>gi|356553023|ref|XP_003544858.1| PREDICTED: uncharacterized protein LOC100777832 [Glycine max]
          Length = 124

 Score =  131 bits (329), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 62/87 (71%), Positives = 70/87 (80%), Gaps = 1/87 (1%)

Query: 33  SDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
           SD +C + T P  CP  CFR DPVCG DGVTYWCGC  A CAGV+VAKLGFCEVG+GGSA
Sbjct: 39  SDSLCGK-TTPSSCPAKCFRADPVCGADGVTYWCGCAEAACAGVEVAKLGFCEVGNGGSA 97

Query: 93  PLPGQALLLVHIVWLFLLALSLLLGVF 119
           P+PGQALLLVHIVWL +L  S+L G+F
Sbjct: 98  PIPGQALLLVHIVWLIVLGFSVLFGLF 124


>gi|449476421|ref|XP_004154732.1| PREDICTED: uncharacterized protein LOC101228097 [Cucumis sativus]
          Length = 130

 Score =  130 bits (327), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 57/79 (72%), Positives = 69/79 (87%)

Query: 41  TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALL 100
           + P  CP+ CFR DPVCGVDGVTYWCGC +ALC+GVKVAK+GFCEVG+GGSAP+P QALL
Sbjct: 52  SAPSYCPVKCFRTDPVCGVDGVTYWCGCADALCSGVKVAKMGFCEVGNGGSAPIPAQALL 111

Query: 101 LVHIVWLFLLALSLLLGVF 119
           LVHI+WL +L +S+L G+F
Sbjct: 112 LVHILWLIILGVSVLFGLF 130


>gi|449445981|ref|XP_004140750.1| PREDICTED: uncharacterized protein LOC101210178 [Cucumis sativus]
 gi|449522135|ref|XP_004168083.1| PREDICTED: uncharacterized LOC101210178 [Cucumis sativus]
          Length = 130

 Score =  130 bits (327), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 58/77 (75%), Positives = 69/77 (89%)

Query: 43  PELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLV 102
           P  CP+ CFR DPVCGVDGVTYWCGC +ALC+GVKVAK+GFCEVG+GGSAP+PGQALLLV
Sbjct: 54  PSSCPVKCFRTDPVCGVDGVTYWCGCADALCSGVKVAKMGFCEVGNGGSAPIPGQALLLV 113

Query: 103 HIVWLFLLALSLLLGVF 119
           HI+WL +L +S+L G+F
Sbjct: 114 HILWLIILGVSVLFGLF 130


>gi|357464293|ref|XP_003602428.1| hypothetical protein MTR_3g093240 [Medicago truncatula]
 gi|355491476|gb|AES72679.1| hypothetical protein MTR_3g093240 [Medicago truncatula]
          Length = 133

 Score =  130 bits (326), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 60/92 (65%), Positives = 67/92 (72%)

Query: 28  QSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVG 87
           + T E   VCA       CP+ CFR DPVCGVDGVTYWCGC  A CAGVKV K+GFCEVG
Sbjct: 42  KVTAEGKNVCAGAVASSWCPVKCFRTDPVCGVDGVTYWCGCAEAACAGVKVGKMGFCEVG 101

Query: 88  SGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
           SGGSAPL  QA LL+HIVWL +LA S+  G+F
Sbjct: 102 SGGSAPLSAQAFLLLHIVWLIVLAFSVFFGLF 133


>gi|356500817|ref|XP_003519227.1| PREDICTED: uncharacterized protein LOC100780930 [Glycine max]
          Length = 125

 Score =  125 bits (315), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 60/87 (68%), Positives = 69/87 (79%), Gaps = 1/87 (1%)

Query: 33  SDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
           S+ +C + T P  CP  CFR DPVCG DGVTYWCGC  A CAGV+VAK GFCEVG+GGSA
Sbjct: 40  SESLCGK-TMPLSCPAKCFRADPVCGADGVTYWCGCAEAACAGVEVAKFGFCEVGNGGSA 98

Query: 93  PLPGQALLLVHIVWLFLLALSLLLGVF 119
           P+PGQALLLVHIVWL +L  S+L G+F
Sbjct: 99  PIPGQALLLVHIVWLIVLGFSVLFGLF 125


>gi|115440619|ref|NP_001044589.1| Os01g0811200 [Oryza sativa Japonica Group]
 gi|18844762|dbj|BAB85233.1| serine protease inhibitor-like protein [Oryza sativa Japonica
           Group]
 gi|20160706|dbj|BAB89649.1| serine protease inhibitor-like protein [Oryza sativa Japonica
           Group]
 gi|113534120|dbj|BAF06503.1| Os01g0811200 [Oryza sativa Japonica Group]
          Length = 128

 Score =  120 bits (302), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 54/77 (70%), Positives = 66/77 (85%), Gaps = 1/77 (1%)

Query: 41  TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALL 100
           ++ +LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALL
Sbjct: 51  SEAQLCPVRCFRPDPVCGADGVTYWCGCPEAACAGARVARRGYCEVGA-GSAPVSGQALL 109

Query: 101 LVHIVWLFLLALSLLLG 117
           LVHIVWLF+L  ++LLG
Sbjct: 110 LVHIVWLFVLGAAVLLG 126


>gi|413952150|gb|AFW84799.1| serine-type endopeptidase inhibitor [Zea mays]
          Length = 131

 Score =  120 bits (301), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 55/74 (74%), Positives = 64/74 (86%), Gaps = 1/74 (1%)

Query: 45  LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG-GSAPLPGQALLLVH 103
           LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+G GSAP+ GQALLLVH
Sbjct: 56  LCPVRCFRPDPVCGADGVTYWCGCPEATCAGARVARRGYCEVGAGSGSAPVSGQALLLVH 115

Query: 104 IVWLFLLALSLLLG 117
           IVWLF+L  ++LLG
Sbjct: 116 IVWLFVLGAAVLLG 129


>gi|449438983|ref|XP_004137267.1| PREDICTED: uncharacterized protein LOC101220346 [Cucumis sativus]
          Length = 129

 Score =  120 bits (301), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 55/79 (69%), Positives = 68/79 (86%), Gaps = 1/79 (1%)

Query: 41  TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALL 100
           + P  CP+ CFR DPVCGVDGVTYWCGC +ALC+GVKVAK+GFCEVG+GGSAP+P QALL
Sbjct: 52  SAPSYCPVKCFRTDPVCGVDGVTYWCGCADALCSGVKVAKMGFCEVGNGGSAPIPAQALL 111

Query: 101 LVHIVWLFLLALSLLLGVF 119
           LVHI+ + +L +S+L G+F
Sbjct: 112 LVHILPI-ILGVSVLFGLF 129


>gi|226505326|ref|NP_001149593.1| serine-type endopeptidase inhibitor precursor [Zea mays]
 gi|195628284|gb|ACG35972.1| serine-type endopeptidase inhibitor [Zea mays]
          Length = 131

 Score =  120 bits (300), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 55/74 (74%), Positives = 64/74 (86%), Gaps = 1/74 (1%)

Query: 45  LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG-GSAPLPGQALLLVH 103
           LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+G GSAP+ GQALLLVH
Sbjct: 56  LCPVRCFRPDPVCGADGVTYWCGCPEATCAGARVARRGYCEVGAGSGSAPVSGQALLLVH 115

Query: 104 IVWLFLLALSLLLG 117
           IVWLF+L  ++LLG
Sbjct: 116 IVWLFVLGAAVLLG 129


>gi|414880042|tpg|DAA57173.1| TPA: serine-type endopeptidase inhibitor [Zea mays]
          Length = 227

 Score =  120 bits (300), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 54/73 (73%), Positives = 63/73 (86%), Gaps = 1/73 (1%)

Query: 45  LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHI 104
           LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALLLVHI
Sbjct: 154 LCPVRCFRPDPVCGADGVTYWCGCPEAACAGARVARRGYCEVGA-GSAPVSGQALLLVHI 212

Query: 105 VWLFLLALSLLLG 117
           VWLF+L  ++LLG
Sbjct: 213 VWLFVLGAAVLLG 225


>gi|125528114|gb|EAY76228.1| hypothetical protein OsI_04164 [Oryza sativa Indica Group]
          Length = 132

 Score =  119 bits (299), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 54/77 (70%), Positives = 66/77 (85%), Gaps = 1/77 (1%)

Query: 41  TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALL 100
           ++ +LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALL
Sbjct: 55  SEAQLCPVRCFRPDPVCGADGVTYWCGCPEAACAGARVARRGYCEVGA-GSAPVSGQALL 113

Query: 101 LVHIVWLFLLALSLLLG 117
           LVHIVWLF+L  ++LLG
Sbjct: 114 LVHIVWLFVLGAAVLLG 130


>gi|224064390|ref|XP_002301452.1| predicted protein [Populus trichocarpa]
 gi|222843178|gb|EEE80725.1| predicted protein [Populus trichocarpa]
          Length = 95

 Score =  118 bits (295), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 64/95 (67%), Positives = 69/95 (72%), Gaps = 3/95 (3%)

Query: 28  QSTQESDG---VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
           Q     DG    CA    P  CPINCFR DPVCGVDGVTYWCGC +ALC+G +V KLG C
Sbjct: 1   QEVTREDGKGDACAGLKAPASCPINCFRADPVCGVDGVTYWCGCADALCSGTRVDKLGAC 60

Query: 85  EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
           EVGSGGS+ LPGQALLL+HIVWL LL  SLL G F
Sbjct: 61  EVGSGGSSSLPGQALLLIHIVWLILLGFSLLFGFF 95


>gi|357125538|ref|XP_003564450.1| PREDICTED: uncharacterized protein LOC100825392 [Brachypodium
           distachyon]
          Length = 125

 Score =  117 bits (293), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 54/73 (73%), Positives = 64/73 (87%), Gaps = 1/73 (1%)

Query: 45  LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHI 104
           LCP+ CFRPDPVCG DGVTYWCGCP A+CAG +VA+ G+CEVG+ GSAP+ GQALLLVHI
Sbjct: 52  LCPVRCFRPDPVCGADGVTYWCGCPEAVCAGARVARRGYCEVGA-GSAPVSGQALLLVHI 110

Query: 105 VWLFLLALSLLLG 117
           VWLF+L  ++LLG
Sbjct: 111 VWLFVLGAAVLLG 123


>gi|226510470|ref|NP_001149605.1| serine-type endopeptidase inhibitor precursor [Zea mays]
 gi|195628428|gb|ACG36044.1| serine-type endopeptidase inhibitor [Zea mays]
          Length = 129

 Score =  117 bits (293), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 54/73 (73%), Positives = 63/73 (86%), Gaps = 1/73 (1%)

Query: 45  LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHI 104
           LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALLLVHI
Sbjct: 56  LCPVRCFRPDPVCGADGVTYWCGCPEAACAGARVARRGYCEVGA-GSAPVSGQALLLVHI 114

Query: 105 VWLFLLALSLLLG 117
           VWLF+L  ++LLG
Sbjct: 115 VWLFVLGAAVLLG 127


>gi|242054737|ref|XP_002456514.1| hypothetical protein SORBIDRAFT_03g037650 [Sorghum bicolor]
 gi|241928489|gb|EES01634.1| hypothetical protein SORBIDRAFT_03g037650 [Sorghum bicolor]
          Length = 131

 Score =  117 bits (292), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 54/73 (73%), Positives = 63/73 (86%), Gaps = 1/73 (1%)

Query: 45  LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHI 104
           LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVG+ GSAP+ GQALLLVHI
Sbjct: 58  LCPVRCFRPDPVCGADGVTYWCGCPEATCAGARVARRGYCEVGA-GSAPVSGQALLLVHI 116

Query: 105 VWLFLLALSLLLG 117
           VWLF+L  ++LLG
Sbjct: 117 VWLFVLGAAVLLG 129


>gi|18411677|ref|NP_567214.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
           thaliana]
 gi|332656644|gb|AEE82044.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
           thaliana]
          Length = 144

 Score =  116 bits (290), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 51/95 (53%), Positives = 64/95 (67%)

Query: 25  QLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
           + +   +     C    KP  CP+ CFRPDPVCG D VTYWCGC +ALC GV+V K G C
Sbjct: 50  EKINGEKNRGEFCEGIAKPASCPVQCFRPDPVCGEDSVTYWCGCADALCHGVRVVKQGAC 109

Query: 85  EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
           +VG+G    +PGQALLL+HIVW+ LL  S+L G+F
Sbjct: 110 DVGNGVGLSVPGQALLLIHIVWMMLLGFSILFGLF 144


>gi|21555759|gb|AAM63928.1| unknown [Arabidopsis thaliana]
          Length = 135

 Score =  115 bits (289), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 51/95 (53%), Positives = 64/95 (67%)

Query: 25  QLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
           + +   +     C    KP  CP+ CFRPDPVCG D VTYWCGC +ALC GV+V K G C
Sbjct: 41  EKINGEKNRGEFCEGIAKPASCPVQCFRPDPVCGEDSVTYWCGCADALCHGVRVVKQGAC 100

Query: 85  EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
           +VG+G    +PGQALLL+HIVW+ LL  S+L G+F
Sbjct: 101 DVGNGVGLSVPGQALLLIHIVWMMLLGFSILFGLF 135


>gi|297810001|ref|XP_002872884.1| hypothetical protein ARALYDRAFT_490419 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318721|gb|EFH49143.1| hypothetical protein ARALYDRAFT_490419 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 137

 Score =  115 bits (289), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 51/95 (53%), Positives = 64/95 (67%)

Query: 25  QLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
           + +   +     C    KP  CP+ CFRPDPVCG D VTYWCGC +ALC GV+V K G C
Sbjct: 43  EKIDGEKNRGEFCEGIAKPASCPVQCFRPDPVCGEDSVTYWCGCADALCHGVRVVKQGAC 102

Query: 85  EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
           +VG+G    +PGQALLL+HIVW+ LL  S+L G+F
Sbjct: 103 DVGNGVGLSVPGQALLLIHIVWMMLLGFSILFGLF 137


>gi|255548137|ref|XP_002515125.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
 gi|223545605|gb|EEF47109.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
          Length = 130

 Score =  115 bits (287), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 55/87 (63%), Positives = 66/87 (75%), Gaps = 1/87 (1%)

Query: 23  GLQLVQSTQESDG-VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKL 81
           G+ + Q T + +G VC   T P  CPINCFR DPVCGVDG TYWCGC +ALC+G  VAK+
Sbjct: 33  GMLIEQITSKGNGDVCKGVTAPASCPINCFRADPVCGVDGRTYWCGCDDALCSGTTVAKI 92

Query: 82  GFCEVGSGGSAPLPGQALLLVHIVWLF 108
           G C++G+GGSA LP QALLLVHI+WL 
Sbjct: 93  GACDMGNGGSASLPRQALLLVHILWLI 119


>gi|356507538|ref|XP_003522521.1| PREDICTED: uncharacterized protein LOC100797240 [Glycine max]
          Length = 131

 Score =  114 bits (285), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 53/88 (60%), Positives = 62/88 (70%), Gaps = 1/88 (1%)

Query: 32  ESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGS 91
           E   +C+    P  CP+ CFR DPVC VDGVTYWCGC  A  A  ++AKLGFCEVG+GGS
Sbjct: 45  EGKNLCSA-AAPSSCPVKCFRTDPVCSVDGVTYWCGCSEAAYASAQIAKLGFCEVGNGGS 103

Query: 92  APLPGQALLLVHIVWLFLLALSLLLGVF 119
             L GQALLLVHIVWL +L  S+  G+F
Sbjct: 104 VTLSGQALLLVHIVWLIVLGFSVFFGLF 131


>gi|224132550|ref|XP_002321347.1| predicted protein [Populus trichocarpa]
 gi|222868343|gb|EEF05474.1| predicted protein [Populus trichocarpa]
          Length = 132

 Score =  112 bits (280), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 54/85 (63%), Positives = 63/85 (74%), Gaps = 3/85 (3%)

Query: 35  GVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPL 94
           G  +R +    CP+ CFRPDPVCGVDGVTYWCGC +A CAG KV K GFCEVG+ G+A  
Sbjct: 51  GEWSRRSPVSSCPVKCFRPDPVCGVDGVTYWCGCDDARCAGTKVTKKGFCEVGNNGAA-- 108

Query: 95  PGQALLLVHIVWLFLLALSLLLGVF 119
             QALLLVHIVWL +L  S+L G+F
Sbjct: 109 -AQALLLVHIVWLIVLGFSILFGLF 132


>gi|297821082|ref|XP_002878424.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
           lyrata subsp. lyrata]
 gi|297324262|gb|EFH54683.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 117

 Score =  111 bits (277), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 53/95 (55%), Positives = 64/95 (67%), Gaps = 2/95 (2%)

Query: 14  LIFFLALNIGLQLVQSTQESDG--VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNA 71
            +F +   IGLQ      ++ G  VC R +    CPINCFR DPVCG DGVTYWCGCP+A
Sbjct: 10  FLFLVLCLIGLQAADDFLDNSGGDVCPRVSDSGGCPINCFRADPVCGADGVTYWCGCPDA 69

Query: 72  LCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHIVW 106
            C G +V K G C+ G+ GSA +PGQALLL+HIVW
Sbjct: 70  ACHGARVVKRGACDTGNAGSASVPGQALLLIHIVW 104


>gi|225455171|ref|XP_002268733.1| PREDICTED: uncharacterized protein LOC100253810 [Vitis vinifera]
          Length = 128

 Score =  109 bits (272), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 55/89 (61%), Positives = 62/89 (69%), Gaps = 5/89 (5%)

Query: 32  ESDG-VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGG 90
           ESDG +C        CP+ CFRPDPVCGVDGVTYWCGC  A CAG  VA  G CEVG+G 
Sbjct: 44  ESDGGLCGGSEVAASCPVKCFRPDPVCGVDGVTYWCGCTEARCAGTAVAHFGICEVGNGS 103

Query: 91  SAPLPGQALLLVHIVWLFLLALSLLLGVF 119
           +    GQALLLVHIVWL +L  S+L G+F
Sbjct: 104 A----GQALLLVHIVWLMVLGFSVLFGLF 128


>gi|116784747|gb|ABK23457.1| unknown [Picea sitchensis]
          Length = 131

 Score =  107 bits (268), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 56/83 (67%), Positives = 67/83 (80%)

Query: 36  VCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLP 95
           VC+   +   CP++CFRPDPVCGVDG+TYWCGC +A CAGV VAK+GFCEVG+GGS  L 
Sbjct: 48  VCSEKNESASCPVSCFRPDPVCGVDGITYWCGCTDAQCAGVDVAKIGFCEVGNGGSGLLS 107

Query: 96  GQALLLVHIVWLFLLALSLLLGV 118
           G+ALLLVHIVWL LL   +L+GV
Sbjct: 108 GEALLLVHIVWLILLGFIVLIGV 130


>gi|357490411|ref|XP_003615493.1| hypothetical protein MTR_5g068760 [Medicago truncatula]
 gi|355516828|gb|AES98451.1| hypothetical protein MTR_5g068760 [Medicago truncatula]
 gi|388491276|gb|AFK33704.1| unknown [Medicago truncatula]
 gi|388497980|gb|AFK37056.1| unknown [Medicago truncatula]
          Length = 120

 Score =  107 bits (266), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 61/87 (70%), Positives = 65/87 (74%), Gaps = 1/87 (1%)

Query: 33  SDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
           S  VC+  T P  CP  CFR DPVCG DGVTYWCGC  A CAG KVAKLGFCEVG+GGSA
Sbjct: 35  SQNVCS-VTTPSSCPAKCFRTDPVCGADGVTYWCGCAEAACAGAKVAKLGFCEVGNGGSA 93

Query: 93  PLPGQALLLVHIVWLFLLALSLLLGVF 119
             PGQALLLVHIVWL +L  S+L G F
Sbjct: 94  TFPGQALLLVHIVWLIVLGFSVLFGFF 120


>gi|21554106|gb|AAM63186.1| unknown [Arabidopsis thaliana]
          Length = 117

 Score =  106 bits (265), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 53/99 (53%), Positives = 63/99 (63%), Gaps = 3/99 (3%)

Query: 11  IAVLIFFLALN-IGLQLVQS--TQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCG 67
           +++   FL L  IGLQ       +    VC R      C INCFR DPVCG DGVTYWCG
Sbjct: 6   LSIRFLFLVLCLIGLQAADDFPDKSRGDVCPRVKDRGGCTINCFRADPVCGTDGVTYWCG 65

Query: 68  CPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHIVW 106
           CP+A C G +V K G C+ G+ GSA +PGQALLL+HIVW
Sbjct: 66  CPDAACHGTRVVKKGACDTGNAGSASVPGQALLLIHIVW 104


>gi|18412211|ref|NP_567123.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
           thaliana]
 gi|6899896|emb|CAB71905.1| hypothetical protein [Arabidopsis thaliana]
 gi|17473699|gb|AAL38306.1| unknown protein [Arabidopsis thaliana]
 gi|21386987|gb|AAM47897.1| unknown protein [Arabidopsis thaliana]
 gi|332646768|gb|AEE80289.1| serine protease inhibitor, Kazal-type family protein [Arabidopsis
           thaliana]
          Length = 117

 Score =  106 bits (264), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 53/99 (53%), Positives = 63/99 (63%), Gaps = 3/99 (3%)

Query: 11  IAVLIFFLALN-IGLQLVQS--TQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCG 67
           +++   FL L  IGLQ       +    VC R      C INCFR DPVCG DGVTYWCG
Sbjct: 6   LSIRFLFLVLCLIGLQAADDFPDKSRGDVCPRVKDRGGCTINCFRADPVCGTDGVTYWCG 65

Query: 68  CPNALCAGVKVAKLGFCEVGSGGSAPLPGQALLLVHIVW 106
           CP+A C G +V K G C+ G+ GSA +PGQALLL+HIVW
Sbjct: 66  CPDAACHGARVVKKGACDTGNAGSASVPGQALLLIHIVW 104


>gi|255557162|ref|XP_002519612.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
 gi|223541202|gb|EEF42757.1| serine-type endopeptidase inhibitor, putative [Ricinus communis]
          Length = 134

 Score =  103 bits (257), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 53/100 (53%), Positives = 70/100 (70%), Gaps = 3/100 (3%)

Query: 20  LNIGLQLVQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVA 79
           + +   ++ +  ++   C  + +P  CP+NCFR DPVCGVDGVTYWCGC +A CAG KVA
Sbjct: 38  IRLPTHIIDNQDDNVDPCGEYGRPVSCPVNCFRTDPVCGVDGVTYWCGCRDAWCAGTKVA 97

Query: 80  KLGFCEVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
           K GFCEVG+ G+A    QALLL+HIVWL +L   +L G+F
Sbjct: 98  KKGFCEVGNNGAA---AQALLLLHIVWLIVLGFCVLFGIF 134


>gi|414880449|tpg|DAA57580.1| TPA: hypothetical protein ZEAMMB73_385931 [Zea mays]
          Length = 384

 Score = 90.5 bits (223), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 35/45 (77%), Positives = 39/45 (86%)

Query: 45  LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG 89
           LCP+ CFRPDPVCG DGVTYWCGCP A CAG +VA+ G+CEVGSG
Sbjct: 279 LCPVRCFRPDPVCGADGVTYWCGCPEATCAGARVARRGYCEVGSG 323


>gi|168026830|ref|XP_001765934.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162682840|gb|EDQ69255.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 167

 Score = 85.9 bits (211), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 42/84 (50%), Positives = 55/84 (65%), Gaps = 4/84 (4%)

Query: 40  FTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPG--- 96
             K  LC +NCFR DPVCG DGVTYWCG   A CA V+VA  G+C++ +GG+    G   
Sbjct: 84  IEKEPLCTVNCFRADPVCGSDGVTYWCGVAEAKCAKVEVAHDGYCDIWNGGTKANLGLHA 143

Query: 97  -QALLLVHIVWLFLLALSLLLGVF 119
            Q+L LVH+VWL L  L +++G+ 
Sbjct: 144 AQSLQLVHMVWLVLAGLLIVIGLL 167


>gi|125572387|gb|EAZ13902.1| hypothetical protein OsJ_03827 [Oryza sativa Japonica Group]
          Length = 129

 Score = 78.6 bits (192), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 52/78 (66%), Gaps = 2/78 (2%)

Query: 41  TKPELCPINCFRPDPVCGV-DGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPGQAL 99
           ++ +LCP+ CFR         G     G P   CAG +VA+ G+CEVG+G SAP+ GQAL
Sbjct: 51  SEAQLCPVRCFRAGTRSAAPTGSRTGAGAPRPACAGARVARRGYCEVGAG-SAPVSGQAL 109

Query: 100 LLVHIVWLFLLALSLLLG 117
           LLVHIVWLF+L  ++LLG
Sbjct: 110 LLVHIVWLFVLGAAVLLG 127


>gi|255548139|ref|XP_002515126.1| hypothetical protein RCOM_1341480 [Ricinus communis]
 gi|223545606|gb|EEF47110.1| hypothetical protein RCOM_1341480 [Ricinus communis]
          Length = 83

 Score = 74.7 bits (182), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 39/84 (46%), Positives = 51/84 (60%), Gaps = 4/84 (4%)

Query: 3  TSSLANRLIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPELC--PINCFRPDPVCGVD 60
          T+S   R +  +I  +A  +   L+ +  ++  +C     P  C  PINCFRPDPVCG +
Sbjct: 2  TTSFKKRTLLCMIVVVATCV--LLLTNGVQAQNLCPGSEPPPSCAGPINCFRPDPVCGAN 59

Query: 61 GVTYWCGCPNALCAGVKVAKLGFC 84
          GVTYWCGCP+A CAGV V KL  C
Sbjct: 60 GVTYWCGCPDAACAGVPVVKLEAC 83


>gi|255548141|ref|XP_002515127.1| hypothetical protein RCOM_1341590 [Ricinus communis]
 gi|223545607|gb|EEF47111.1| hypothetical protein RCOM_1341590 [Ricinus communis]
          Length = 81

 Score = 73.6 bits (179), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 39/77 (50%), Positives = 48/77 (62%), Gaps = 1/77 (1%)

Query: 9  RLIAVLIFFLALNIGLQLVQSTQESDGVC-ARFTKPELCPINCFRPDPVCGVDGVTYWCG 67
          + IA L   L + I    V  +Q +  +C    + PE CPI C  PDPVCG +GVTYWCG
Sbjct: 5  KNIAPLCMMLLVFIVCVSVAQSQITINLCPGPMSPPEGCPIACLVPDPVCGANGVTYWCG 64

Query: 68 CPNALCAGVKVAKLGFC 84
          CP+ALCAGV+V K G C
Sbjct: 65 CPDALCAGVRVVKFGEC 81


>gi|168024312|ref|XP_001764680.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162683974|gb|EDQ70379.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 117

 Score = 70.5 bits (171), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 39/102 (38%), Positives = 55/102 (53%), Gaps = 6/102 (5%)

Query: 24  LQLVQSTQESDGVCARFTKP------ELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVK 77
           ++ V S   +DG      +P      + CP+ C R DPVCGVD VTYWCG  +A CAGV+
Sbjct: 10  IRSVVSKAAADGANNVLIEPCILQTTKSCPVKCVRTDPVCGVDKVTYWCGAADAKCAGVE 69

Query: 78  VAKLGFCEVGSGGSAPLPGQALLLVHIVWLFLLALSLLLGVF 119
           VA  G C +   G+A     AL  +H + L  +   ++ G+F
Sbjct: 70  VAHDGSCNLWELGTATSSTTALYALHSLQLNRMFCLVIAGLF 111


>gi|168050727|ref|XP_001777809.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162670785|gb|EDQ57347.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 190

 Score = 69.7 bits (169), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 47/81 (58%), Gaps = 7/81 (8%)

Query: 33  SDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG--- 89
           S G C R T    CP+ CFR  PVCG D VTY CG  +A CAGV+VA  GFC +  G   
Sbjct: 87  SKGPCIRQTTMS-CPVKCFRSYPVCGTDKVTYRCGAADANCAGVEVAYDGFCNLWEGDRN 145

Query: 90  ---GSAPLPGQALLLVHIVWL 107
               +A    Q+L LVH++WL
Sbjct: 146 VGSSTALYAVQSLQLVHMLWL 166


>gi|302785325|ref|XP_002974434.1| hypothetical protein SELMODRAFT_414607 [Selaginella
          moellendorffii]
 gi|300158032|gb|EFJ24656.1| hypothetical protein SELMODRAFT_414607 [Selaginella
          moellendorffii]
          Length = 128

 Score = 65.5 bits (158), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 25/46 (54%), Positives = 35/46 (76%)

Query: 46 CPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGS 91
          CP++C R DPVCG +GVTYWCG  +A C GV+++  G+C  G+GG+
Sbjct: 50 CPVSCLRADPVCGANGVTYWCGSRDAACDGVEISHTGYCSPGNGGA 95


>gi|224130742|ref|XP_002320916.1| predicted protein [Populus trichocarpa]
 gi|222861689|gb|EEE99231.1| predicted protein [Populus trichocarpa]
          Length = 81

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 38/83 (45%), Positives = 47/83 (56%), Gaps = 6/83 (7%)

Query: 6  LANRLIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPEL----CPINCFRPDPVCGVDG 61
          +A   I   +  + L  GL L ++  +S  VCA   +P+     C INCF PDPVCG DG
Sbjct: 1  MATSKIVAPLCLMVLVFGLCLPKA--QSQDVCAGVERPDPETIPCTINCFVPDPVCGTDG 58

Query: 62 VTYWCGCPNALCAGVKVAKLGFC 84
          VTY CGC +A C GV V K G C
Sbjct: 59 VTYSCGCLDAFCHGVDVVKEGEC 81


>gi|224130738|ref|XP_002320915.1| predicted protein [Populus trichocarpa]
 gi|222861688|gb|EEE99230.1| predicted protein [Populus trichocarpa]
          Length = 83

 Score = 65.1 bits (157), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 42/89 (47%), Positives = 54/89 (60%), Gaps = 11/89 (12%)

Query: 1  MRTSSLANRL-IAVLIFFLALNIGLQLVQSTQESDGVCARFTKPEL----CPINCFRPDP 55
          M TS +   L + VL+F L+L+    +V+S  +S GVCA   +P+     C INC   DP
Sbjct: 1  MATSKIVAPLCLMVLVFCLSLS----MVKS--QSYGVCAGAARPDPETIPCTINCLVADP 54

Query: 56 VCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
          VCG DGVTY CGC +A C GV+V K G C
Sbjct: 55 VCGTDGVTYTCGCYDAFCHGVEVVKKGEC 83


>gi|224130734|ref|XP_002320914.1| predicted protein [Populus trichocarpa]
 gi|222861687|gb|EEE99229.1| predicted protein [Populus trichocarpa]
          Length = 80

 Score = 58.9 bits (141), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 42/81 (51%), Gaps = 3/81 (3%)

Query: 6  LANRLIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPEL--CPINCFRPDPVCGVDGVT 63
          +A   I   +  + L  GL L     +S GVC  F  PE   C + C  PD VCG DGVT
Sbjct: 1  MATSKILAPLCLMVLVFGLCLSMVESQSYGVCEGF-DPEAPRCAVRCSVPDYVCGTDGVT 59

Query: 64 YWCGCPNALCAGVKVAKLGFC 84
          Y CGC +A C GV V K G C
Sbjct: 60 YTCGCKDAFCNGVDVVKKGKC 80


>gi|255548143|ref|XP_002515128.1| conserved hypothetical protein [Ricinus communis]
 gi|223545608|gb|EEF47112.1| conserved hypothetical protein [Ricinus communis]
          Length = 75

 Score = 55.1 bits (131), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 43/75 (57%), Gaps = 4/75 (5%)

Query: 14 LIFFLALNIGLQLVQSTQESDGVC--ARFTKP--ELCPINCFRPDPVCGVDGVTYWCGCP 69
          ++  + L + L +V+S  +   +C  A+   P  +   I C R DPVCGV+GVTY CGCP
Sbjct: 1  MMMVVVLGVCLTMVESKVQVPNLCDLAKSASPGKDCSAIRCLRYDPVCGVNGVTYGCGCP 60

Query: 70 NALCAGVKVAKLGFC 84
           A    V+V KLG C
Sbjct: 61 EANFYAVRVGKLGAC 75


>gi|407462333|ref|YP_006773650.1| hypothetical protein NKOR_04090 [Candidatus Nitrosopumilus
           koreensis AR1]
 gi|407045955|gb|AFS80708.1| hypothetical protein NKOR_04090 [Candidatus Nitrosopumilus
           koreensis AR1]
          Length = 250

 Score = 51.6 bits (122), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/51 (50%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 42  KPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
           +PE C I C   DPVCGVDG+TY CG  +A C GVKV   G C     G+ 
Sbjct: 201 EPE-CSIQCLVYDPVCGVDGITYACGIEDAACHGVKVKHDGECSDSKLGTE 250


>gi|407465325|ref|YP_006776207.1| proteinase inhibitor I1 Kazal [Candidatus Nitrosopumilus sp. AR2]
 gi|407048513|gb|AFS83265.1| proteinase inhibitor I1 Kazal [Candidatus Nitrosopumilus sp. AR2]
          Length = 249

 Score = 48.9 bits (115), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 27/56 (48%), Gaps = 2/56 (3%)

Query: 35  GVCARFTKPE--LCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGS 88
            +   F + E   C I C   DPVCG D +TY CG  +A C  VKV   G C   S
Sbjct: 114 DIREEFNRNENKECSIQCLVYDPVCGEDNITYACGVEDATCHNVKVKYSGECNASS 169


>gi|168057730|ref|XP_001780866.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667722|gb|EDQ54345.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 169

 Score = 44.7 bits (104), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/59 (40%), Positives = 34/59 (57%), Gaps = 6/59 (10%)

Query: 65  WCGCPNALCAGVKVAKLGFCEVGS------GGSAPLPGQALLLVHIVWLFLLALSLLLG 117
           WCG  +A CAGV+VA  G+C V          +A    Q+L LVH++WL +  L ++LG
Sbjct: 109 WCGAADAKCAGVEVAHDGYCNVWELDANVGSTTAVRAAQSLQLVHMLWLVVAGLLIVLG 167


>gi|168044603|ref|XP_001774770.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673925|gb|EDQ60441.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 196

 Score = 38.5 bits (88), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 25/59 (42%), Positives = 33/59 (55%), Gaps = 8/59 (13%)

Query: 66  CGCPNALCAGVKVAKLGFC-------EVGSGGSAPLPGQALLLVHIVWLFLLALSLLLG 117
           CG  +A CAGV VA  G C        VGS  +A    Q+L LVH++WL +  L ++LG
Sbjct: 137 CGAADAKCAGVGVAHDGHCNLWELNANVGS-TTAVRAAQSLQLVHMLWLVVAGLLIVLG 194


>gi|302843587|ref|XP_002953335.1| hypothetical protein VOLCADRAFT_105857 [Volvox carteri f.
           nagariensis]
 gi|300261432|gb|EFJ45645.1| hypothetical protein VOLCADRAFT_105857 [Volvox carteri f.
           nagariensis]
          Length = 964

 Score = 38.5 bits (88), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 24/47 (51%), Gaps = 3/47 (6%)

Query: 49  NCFRPD---PVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSA 92
            C  PD   PVCG +G TY   C    CAGV VA LG C V   G A
Sbjct: 791 QCNCPDDYQPVCGSNGKTYDNACKAVFCAGVSVAYLGECSVDENGCA 837


>gi|149924837|ref|ZP_01913176.1| hypothetical protein PPSIR1_11716 [Plesiocystis pacifica SIR-1]
 gi|149814297|gb|EDM73900.1| hypothetical protein PPSIR1_11716 [Plesiocystis pacifica SIR-1]
          Length = 271

 Score = 38.1 bits (87), Expect = 0.67,   Method: Compositional matrix adjust.
 Identities = 18/30 (60%), Positives = 19/30 (63%)

Query: 56  VCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
           VCG DGVTY   C    CAGV+VA  G CE
Sbjct: 163 VCGADGVTYANACVATECAGVEVAHEGACE 192


>gi|340345306|ref|ZP_08668438.1| Blue (Type 1) copper domain protein [Candidatus Nitrosoarchaeum
          koreensis MY1]
 gi|339520447|gb|EGP94170.1| Blue (Type 1) copper domain protein [Candidatus Nitrosoarchaeum
          koreensis MY1]
          Length = 569

 Score = 38.1 bits (87), Expect = 0.75,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 17/78 (21%)

Query: 10 LIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPELCPINC-FRPDPVCGVDGVTYWCGC 68
          L+A+ + F++++ G                  +PE  P+ C  + DPVCGVDG+TY   C
Sbjct: 14 LVAISLVFISMDPGY---------------IVEPEAKPVACTMQYDPVCGVDGITYGNSC 58

Query: 69 PNALCAGVKVAKLGFCEV 86
               A V++A  G C V
Sbjct: 59 MLG-AANVQLAYPGECVV 75


>gi|313224749|emb|CBY20540.1| unnamed protein product [Oikopleura dioica]
          Length = 1320

 Score = 37.4 bits (85), Expect = 1.3,   Method: Composition-based stats.
 Identities = 24/58 (41%), Positives = 28/58 (48%), Gaps = 15/58 (25%)

Query: 46  CPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG------GSAPLPGQ 97
           CP+     +PVCG DGVTY     N+LC    V K   CE GSG      G  P  G+
Sbjct: 151 CPLCPAENNPVCGSDGVTY-----NSLC----VMKRTNCEFGSGIELVSSGKCPEKGE 199


>gi|12325237|gb|AAG52567.1|AC010675_15 putative peptide transporter; 37139-33250 [Arabidopsis thaliana]
          Length = 644

 Score = 36.6 bits (83), Expect = 1.9,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 8/51 (15%)

Query: 43  PELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAP 93
           P+L P +C   DP+         CG PN L  GV +  L F  VGSGG  P
Sbjct: 171 PQLHPASCNSQDPLS--------CGGPNKLQIGVLLLGLCFLSVGSGGIRP 213


>gi|159464817|ref|XP_001690638.1| secreted protease and protease inhibitor [Chlamydomonas
           reinhardtii]
 gi|158280138|gb|EDP05897.1| secreted protease and protease inhibitor [Chlamydomonas
           reinhardtii]
          Length = 951

 Score = 36.6 bits (83), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 21/44 (47%), Positives = 25/44 (56%), Gaps = 2/44 (4%)

Query: 53  PDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPLPG 96
           P PVCG DG  Y   C  A C+GV   +LG C  G  G++P PG
Sbjct: 526 PSPVCGSDGNQYTNAC-EAWCSGVLSWRLGTCP-GGNGTSPSPG 567



 Score = 34.3 bits (77), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 20/39 (51%), Positives = 21/39 (53%), Gaps = 1/39 (2%)

Query: 55  PVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAP 93
           PVCG DG TY   C  A CAGVK  K G C    G + P
Sbjct: 463 PVCGSDGRTYSNPC-EANCAGVKSYKAGECTAQGGVNTP 500


>gi|18409391|ref|NP_564979.1| nitrate transporter 1.7 [Arabidopsis thaliana]
 gi|75158815|sp|Q8RX77.1|PTR21_ARATH RecName: Full=Nitrate transporter 1.7
 gi|19699104|gb|AAL90918.1| At1g69870/T17F3_10 [Arabidopsis thaliana]
 gi|21928025|gb|AAM78041.1| At1g69870/T17F3_10 [Arabidopsis thaliana]
 gi|332196872|gb|AEE34993.1| nitrate transporter 1.7 [Arabidopsis thaliana]
          Length = 620

 Score = 36.6 bits (83), Expect = 2.2,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 8/51 (15%)

Query: 43  PELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAP 93
           P+L P +C   DP+         CG PN L  GV +  L F  VGSGG  P
Sbjct: 147 PQLHPASCNSQDPLS--------CGGPNKLQIGVLLLGLCFLSVGSGGIRP 189


>gi|256549336|gb|ACU83222.1| serine proteinase inhibitor [Ruditapes philippinarum]
          Length = 134

 Score = 36.2 bits (82), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 18/32 (56%), Positives = 22/32 (68%), Gaps = 1/32 (3%)

Query: 55  PVCGVDGVTYWCGCPNALCAGVKVAKLGFCEV 86
           PVCGVDG+TY   C N  CA V++A  G C+V
Sbjct: 95  PVCGVDGITYGNDC-NGKCANVQLAHTGECDV 125


>gi|297790444|ref|XP_002863113.1| proton-dependent oligopeptide transport family protein [Arabidopsis
           lyrata subsp. lyrata]
 gi|297308935|gb|EFH39372.1| proton-dependent oligopeptide transport family protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 619

 Score = 36.2 bits (82), Expect = 2.3,   Method: Composition-based stats.
 Identities = 21/51 (41%), Positives = 25/51 (49%), Gaps = 8/51 (15%)

Query: 43  PELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAP 93
           P+L P +C   DP+         CG PN L  GV +  L F  VGSGG  P
Sbjct: 146 PQLHPASCNSQDPLS--------CGGPNKLQFGVLLLGLCFLSVGSGGIRP 188


>gi|326431932|gb|EGD77502.1| kunitz protease inhibitor 1 [Salpingoeca sp. ATCC 50818]
          Length = 1724

 Score = 35.8 bits (81), Expect = 3.4,   Method: Composition-based stats.
 Identities = 18/30 (60%), Positives = 20/30 (66%), Gaps = 1/30 (3%)

Query: 56   VCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
            VCG DG TY   C +A CAGVKVA  G C+
Sbjct: 1333 VCGADGKTYPSAC-HARCAGVKVAYRGACQ 1361


>gi|313241335|emb|CBY33610.1| unnamed protein product [Oikopleura dioica]
          Length = 1045

 Score = 35.8 bits (81), Expect = 3.5,   Method: Composition-based stats.
 Identities = 21/44 (47%), Positives = 24/44 (54%), Gaps = 9/44 (20%)

Query: 46  CPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSG 89
           CP+     +PVCG DGVTY     N+LC    V K   CE GSG
Sbjct: 246 CPLCPAENNPVCGSDGVTY-----NSLC----VMKRTNCEFGSG 280


>gi|405951044|gb|EKC18991.1| Serine protease inhibitor dipetalogastin [Crassostrea gigas]
          Length = 122

 Score = 35.8 bits (81), Expect = 3.5,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 31/71 (43%), Gaps = 3/71 (4%)

Query: 38  ARFTKPELCPINCFRPD---PVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGSGGSAPL 94
           AR      CP NC  PD   PVCG DGVTY   C      G+   ++G C+ G    A  
Sbjct: 42  ARIISNGECPRNCVCPDHYRPVCGSDGVTYPNRCVAGCINGIYDFQVGECQEGGEEKAKG 101

Query: 95  PGQALLLVHIV 105
                LL  I+
Sbjct: 102 THSNSLLDQIL 112


>gi|307110408|gb|EFN58644.1| hypothetical protein CHLNCDRAFT_140892 [Chlorella variabilis]
          Length = 404

 Score = 35.8 bits (81), Expect = 3.7,   Method: Composition-based stats.
 Identities = 24/67 (35%), Positives = 30/67 (44%), Gaps = 4/67 (5%)

Query: 27  VQSTQESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEV 86
           V++ Q + G          CP+N     PVCG +G+TY   C  A  AG  V   G C V
Sbjct: 279 VEAAQGAVGTTGAVGSAVACPLNLM---PVCGANGMTYDNECL-AEAAGTTVVAEGPCTV 334

Query: 87  GSGGSAP 93
             G S P
Sbjct: 335 PMGESGP 341


>gi|162455950|ref|YP_001618317.1| protease inhibitor [Sorangium cellulosum So ce56]
 gi|161166532|emb|CAN97837.1| putative secreted Protease inhibitor [Sorangium cellulosum So ce56]
          Length = 231

 Score = 35.4 bits (80), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 22/43 (51%), Positives = 24/43 (55%), Gaps = 4/43 (9%)

Query: 42  KPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
           KP+ CP    + DPVCG DG TY  GC  A  AGV V   G C
Sbjct: 101 KPDACPQ---QYDPVCGCDGRTYSNGCVAAT-AGVSVRSAGEC 139


>gi|313222694|emb|CBY41696.1| unnamed protein product [Oikopleura dioica]
          Length = 290

 Score = 35.0 bits (79), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 5/81 (6%)

Query: 5   SLANRLIAVLIFFLALNIGLQLVQSTQESDGVCARFTKPELCPINC-FRPDPVCGVDGVT 63
           +  NR + +L    A   G +  +  + +DGVC + T+ + C   C    DP+CG DG+ 
Sbjct: 174 TYGNRDMFILAKCDAQRDGNKNARDWEATDGVC-QLTRVQRCNKACPMINDPICGSDGIR 232

Query: 64  YWCGC---PNALCAGVKVAKL 81
           Y   C     A C+G ++ +L
Sbjct: 233 YANECVMESVAACSGEEITRL 253


>gi|71063300|gb|AAZ22318.1| mantle protein 9 [Pinctada fucata]
          Length = 209

 Score = 35.0 bits (79), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 17/32 (53%), Positives = 21/32 (65%)

Query: 54 DPVCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
          +P CGVDG TY   C   +CAGV++A  G CE
Sbjct: 39 NPQCGVDGRTYSNPCSARVCAGVEIAYPGRCE 70


>gi|197105034|ref|YP_002130411.1| kazal-type serine protease inhibitor domain [Phenylobacterium
           zucineum HLK1]
 gi|196478454|gb|ACG77982.1| kazal-type serine protease inhibitor domain [Phenylobacterium
           zucineum HLK1]
          Length = 132

 Score = 35.0 bits (79), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 24/55 (43%), Positives = 30/55 (54%), Gaps = 6/55 (10%)

Query: 31  QESDGVCARFTKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
           Q++ GVC +  KPE+C        PVCG DG TY   C  A  AGV +A  G C+
Sbjct: 82  QDAAGVCTK--KPEICTKEYM---PVCGCDGQTYGNAC-EADRAGVNIASTGACK 130


>gi|242025588|ref|XP_002433206.1| conserved hypothetical protein [Pediculus humanus corporis]
 gi|212518747|gb|EEB20468.1| conserved hypothetical protein [Pediculus humanus corporis]
          Length = 864

 Score = 35.0 bits (79), Expect = 5.9,   Method: Composition-based stats.
 Identities = 21/68 (30%), Positives = 31/68 (45%), Gaps = 16/68 (23%)

Query: 42  KPELCPI-----NCFRP--DPVCGVDGVTYW----CGCPNA-----LCAGVKVAKLGFCE 85
           +P+  P+     NC R    P+CG+DG TY+     GC NA     +   V  A +G+C 
Sbjct: 659 QPDFHPVCNTTCNCDREKFSPICGIDGKTYFSACHAGCRNATLVDGMVVEVSTASIGYCN 718

Query: 86  VGSGGSAP 93
           +      P
Sbjct: 719 LDCSNFVP 726


>gi|290988085|ref|XP_002676752.1| predicted protein [Naegleria gruberi]
 gi|284090356|gb|EFC44008.1| predicted protein [Naegleria gruberi]
          Length = 304

 Score = 35.0 bits (79), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 24/40 (60%), Gaps = 4/40 (10%)

Query: 46  CPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
           CP+  +RP  VCGV+G+TY   C  A CA V +   G C+
Sbjct: 247 CPVE-YRP--VCGVNGLTYGSAC-TARCADVSIHYYGVCK 282


>gi|397589373|gb|EJK54637.1| hypothetical protein THAOC_25716, partial [Thalassiosira oceanica]
          Length = 175

 Score = 34.7 bits (78), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 17/31 (54%), Positives = 20/31 (64%), Gaps = 1/31 (3%)

Query: 54  DPVCGVDGVTYWCGCPNALCAGVKVAKLGFC 84
           D VCG DG TYW  C +A  +GV+VA  G C
Sbjct: 100 DQVCGADGGTYWNDC-HATQSGVEVAYGGRC 129


>gi|82702295|ref|YP_411861.1| proteinase inhibitor I1, Kazal [Nitrosospira multiformis ATCC
           25196]
 gi|82410360|gb|ABB74469.1| proteinase inhibitor I1, Kazal [Nitrosospira multiformis ATCC
           25196]
          Length = 263

 Score = 34.7 bits (78), Expect = 8.4,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 34/67 (50%), Gaps = 11/67 (16%)

Query: 41  TKPELCPINCFRPDPVCGVDGVTYWCGCPNALCAGVKVAKLGFCEVGS----GGSAPLP- 95
           TKP +CP   F+P  VCG DG TY   C  A  AGV V   G C+       GG A +P 
Sbjct: 177 TKPTICP-QIFKP--VCGCDGKTYSNSC-TAAAAGVSVEHEGECKKSEPQACGGIAGIPC 232

Query: 96  --GQALL 100
             GQA +
Sbjct: 233 PEGQACM 239


>gi|159476856|ref|XP_001696527.1| hypothetical protein CHLREDRAFT_93612 [Chlamydomonas reinhardtii]
 gi|158282752|gb|EDP08504.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 87

 Score = 34.3 bits (77), Expect = 9.7,   Method: Compositional matrix adjust.
 Identities = 20/41 (48%), Positives = 21/41 (51%), Gaps = 2/41 (4%)

Query: 46 CPINCFRP-DPVCGVDGVTYWCGCPNALCAGVKVAKLGFCE 85
          C   C R   P CG DG TY   C  A C GV+VA  G CE
Sbjct: 47 CRAGCKRAFKPACGADGRTYASACL-AKCQGVQVAHHGPCE 86


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.329    0.145    0.477 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,813,026,710
Number of Sequences: 23463169
Number of extensions: 68296165
Number of successful extensions: 192023
Number of sequences better than 100.0: 151
Number of HSP's better than 100.0 without gapping: 47
Number of HSP's successfully gapped in prelim test: 104
Number of HSP's that attempted gapping in prelim test: 191601
Number of HSP's gapped (non-prelim): 521
length of query: 119
length of database: 8,064,228,071
effective HSP length: 86
effective length of query: 33
effective length of database: 6,046,395,537
effective search space: 199531052721
effective search space used: 199531052721
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.8 bits)
S2: 69 (31.2 bits)