All of lore.kernel.org
 help / color / mirror / Atom feed
From: Alexei Starovoitov <alexei.starovoitov@gmail.com>
To: Andrii Nakryiko <andrii.nakryiko@gmail.com>
Cc: Andrii Nakryiko <andrii@kernel.org>,
	bpf@vger.kernel.org, ast@kernel.org, daniel@iogearbox.net,
	kernel-team@fb.com
Subject: Re: [PATCH bpf-next 7/7] bpf: unify PTR_TO_MAP_{KEY,VALUE} with default case in regsafe()
Date: Thu, 29 Dec 2022 18:19:17 -0800	[thread overview]
Message-ID: <20221230021917.yuvm4g7sjj7vy5qc@MacBook-Pro-6.local> (raw)
In-Reply-To: <CAEf4BzaT_kp-F3QMeGqpCf8ekhmDVjHwV4y7fYtxjWPFq1yhSg@mail.gmail.com>

On Thu, Dec 29, 2022 at 01:59:49PM -0800, Andrii Nakryiko wrote:
> On Tue, Dec 27, 2022 at 6:00 PM Alexei Starovoitov
> <alexei.starovoitov@gmail.com> wrote:
> >
> > On Thu, Dec 22, 2022 at 09:49:21PM -0800, Andrii Nakryiko wrote:
> > > Make default case in regsafe() safer. Instead of doing byte-by-byte
> >
> > I love the patches 1-6, but this one is not making it safer.
> > It looks to be going less safe route.
> > More below.
> >
> > > comparison, take into account ID remapping and also range and var_off
> >
> > ID remapping is handled by the patch 6 in regs_exact().
> > This patch adds range and var_off check as default.
> > Which might not be correct in the future.
> >
> > > checks. For most of registers range and var_off will be zero (not set),
> > > which doesn't matter. For some, like PTR_TO_MAP_{KEY,VALUE}, this
> > > generalized logic is exactly matching what regsafe() was doing as
> > > a special case. But in any case, if register has id and/or ref_obj_id
> > > set, check it using check_ids() logic, taking into account idmap.
> >
> > That was already done in patch 6. So the commit log is misleading.
> > It's arguing that it's a benefit of this change while it was in the previous patch.
> 
> True, I think I had regs_exact() and regs_equals() change in one
> commit and split it at the last minute before submitting (I felt like
> patch #7 will be controversial ;) ), should have proofread messages
> more carefully. Sorry about that.
> 
> >
> > > With these changes, default case should be handling most registers more
> > > correctly, and even for future register would be a good default. For some
> > > special cases, like PTR_TO_PACKET, one would still need to implement extra
> > > checks (like special handling of reg->range) to get better state
> > > equivalence, but default logic shouldn't be wrong.
> >
> > PTR_TO_BTF_ID with var_off would be a counter example where
> > such default of comparing ranges and var_off would lead to issues.
> > Currently PTR_TO_BTF_ID doesn't allow var_off, but folks requested this support.
> > The range_within() logic is safe only for types like PTR_TO_MAP_KEY/VALUE
> > that start from zero and have uniform typeless blob of bytes.
> > PTR_TO_BTF_ID with var_off would be wrong to do with just range_within().
> 
> I'm trying to understand this future problem. I think this is the same
> issue that Kumar was trying to fix before, but when I asked for more
> specifics I didn't really get good answer of when this combined
> var_off and range_within() would be incorrect.
> 
> Do you mind showing (even if hypothetically) an example when
> var_off+range_within() won't work? I'm trying to understand this. We
> should either document why this is not safe, in general, or come to
> conclusion that it is safe. It's second time this comes up, so let's
> spend a bit of time getting to the bottom of this?

Kumar's example was for a constant. range_within is the same as equality
check, so we concluded it's safe for that case.
I'm worried about non uniformity of structs with types and general
comparison of ranges.
I guess you're arguing that if old state had wider range than every single
possible value from that range was already validated to be safe and
hence new narrow range is safe too.
It sounds logical, but it can get tricky with ranges and branch taken logic.
Consider something like:
R1=(min=2,max=8), R2=(min=1, max=10)
if (R1 within R2) // bpf prog is doing its own 'within'
  // branch taken kicks in
else
  // issues that were never checked

Now new state has:
R1=(min=4,max=6), R2=(min=5, max=5)

Both R1 and R2 of new state individually range_within of old safe state,
but together the prog may go to the unverified path.
Not sure whether it's practical today.
You asked for hypothetical, so here it goes :)
More gut feel than real issue.

> 
> >
> > SCALARS and PTR_TO_BTF_ID will likely dominate future bpf progs.
> > Keeping default as regs_exact (that does ID match) is safer default.
> 
> It's fine, though the point of this patch set was patch #7, enabling
> logic similar to PTR_TO_MAP_VALUE for PTR_TO_MEM and PTR_TO_BUF. I can
> send specific fixes for that, no problem. But as I said above, I'm
> really curious to understand what kind of situations will lead to
> unsafety if we do var_off+range_within checks.

PTR_TO_MEM and PTR_TO_BUF explicitly are likely ok despite my convoluted
example above.
I'm less sure about PTR_TO_BTF_ID. It could be ok.
Just feels safer to opt-in each type explicitly.

> 
> >
> > Having said all that the focus on safety should be balanced with focus on performance
> > of the verifier itself.
> > The regsafe is the hottest function.
> > That first memcmp used to be the hottest part of the whole verifier.
> 
> yeah, and it was done unconditionally even if not needed, which was
> kind of weird when I started looking at this. Probably some
> refactoring leftover.
> 
> > I suspect this refactoring won't change the perf profile, but we can optimize it.
> > Assuming that SCALAR, PTR_TO_BTF_ID and PTR_TO_MAP will be the majority of types
> > we can special case them and refactor comparison to only things
> > that matter to these types. var_off and min/max_value are the biggest part
> > of bpf_reg_state. They should be zero for PTR_TO_BTF_ID, but we already check
> > that in other parts of the verifier. There is no need to compare zeros again
> > in the hottest regsafe() function.
> > Same thing for SCALAR. Doing regs_exact() with big memcmp and then finer range_within()
> > on the same bytes is probably wasteful and can be optimized.
> > We might consider reshuffling bpf_reg_state fields again depending on cache line usage.
> > I suspect doing "smart" reg comparison we will be able to significantly
> > improve verification speed. Please consider for a follow up.
> 
> I agree. Perf wasn't the point for me (this is a preliminary for
> iterator stuff to improve state equivalence checks), so I didn't want
> to spend extra time on this (especially that benchmarking this
> properly is time consuming, as benchmarking under QEMU isn't
> representative (from me experiences with BPF ringbuf benchmarking).
> But I'll keep it on TODO list, either for me or anyone interested in
> contributing.

Thank you. Lorenz was pondering at it at some point, but lost interest. I guess.
I did a bunch of perf runs with our test_verif_scale* long ago.
The perf report used to be the same and stable for them.
processed_insn and number of states affects both memory and speed
and % wise have higher contribution to verifier speed, so any improvements
there help more. Optimizing the actual regsafe() is secondary.
Just something to put on todo list.

> >
> > I've applied the first 6 patches.
> 
> Cool, thanks, less patches to carry around. If you don't mind, let's
> look at this var_off concern in details. I can send
> PTR_TO_MEM-specific follow up fix, but if we can convince ourselves
> that generic logic is safe and future-proof, I'd rather do a generic
> change.

  reply	other threads:[~2022-12-30  2:20 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2022-12-23  5:49 [PATCH bpf-next 0/7] BPF verifier state equivalence checks improvements Andrii Nakryiko
2022-12-23  5:49 ` [PATCH bpf-next 1/7] bpf: teach refsafe() to take into account ID remapping Andrii Nakryiko
2022-12-23  5:49 ` [PATCH bpf-next 2/7] bpf: reorganize struct bpf_reg_state fields Andrii Nakryiko
2022-12-23  5:49 ` [PATCH bpf-next 3/7] bpf: generalize MAYBE_NULL vs non-MAYBE_NULL rule Andrii Nakryiko
2022-12-23  5:49 ` [PATCH bpf-next 4/7] bpf: reject non-exact register type matches in regsafe() Andrii Nakryiko
2022-12-23  5:49 ` [PATCH bpf-next 5/7] bpf: perform byte-by-byte comparison only when necessary " Andrii Nakryiko
2022-12-23  5:49 ` [PATCH bpf-next 6/7] bpf: fix regs_exact() logic in regsafe() to remap IDs correctly Andrii Nakryiko
2022-12-23  5:49 ` [PATCH bpf-next 7/7] bpf: unify PTR_TO_MAP_{KEY,VALUE} with default case in regsafe() Andrii Nakryiko
2022-12-28  2:00   ` Alexei Starovoitov
2022-12-29 21:59     ` Andrii Nakryiko
2022-12-30  2:19       ` Alexei Starovoitov [this message]
2023-01-03 22:04         ` Andrii Nakryiko
2023-01-04 22:35           ` Alexei Starovoitov
2023-01-04 23:03             ` Andrii Nakryiko
2023-01-05  0:14               ` Alexei Starovoitov
2023-01-11 19:08                 ` Andrii Nakryiko
2022-12-28  2:10 ` [PATCH bpf-next 0/7] BPF verifier state equivalence checks improvements patchwork-bot+netdevbpf

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20221230021917.yuvm4g7sjj7vy5qc@MacBook-Pro-6.local \
    --to=alexei.starovoitov@gmail.com \
    --cc=andrii.nakryiko@gmail.com \
    --cc=andrii@kernel.org \
    --cc=ast@kernel.org \
    --cc=bpf@vger.kernel.org \
    --cc=daniel@iogearbox.net \
    --cc=kernel-team@fb.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.