bpf.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Alexei Starovoitov <ast@fb.com>
To: Daniel Borkmann <daniel@iogearbox.net>,
	Alexei Starovoitov <alexei.starovoitov@gmail.com>
Cc: Alexei Starovoitov <ast@kernel.org>,
	"David S. Miller" <davem@davemloft.net>, X86 ML <x86@kernel.org>,
	Network Development <netdev@vger.kernel.org>,
	bpf <bpf@vger.kernel.org>, Kernel Team <Kernel-team@fb.com>
Subject: Re: [PATCH v3 bpf-next 06/11] bpf: implement accurate raw_tp context access via BTF
Date: Wed, 16 Oct 2019 23:52:02 +0000	[thread overview]
Message-ID: <543e4e69-ecc6-835b-7634-4c7d60d2c06d@fb.com> (raw)
In-Reply-To: <bb504159-48b1-93fc-8c38-5cef6b36e4d1@iogearbox.net>

On 10/16/19 3:08 PM, Daniel Borkmann wrote:
> On 10/16/19 11:28 PM, Alexei Starovoitov wrote:
>> On Wed, Oct 16, 2019 at 2:22 PM Daniel Borkmann <daniel@iogearbox.net> 
>> wrote:
>>> On 10/16/19 5:25 AM, Alexei Starovoitov wrote:
>>>> libbpf analyzes bpf C program, searches in-kernel BTF for given type 
>>>> name
>>>> and stores it into expected_attach_type.
>>>> The kernel verifier expects this btf_id to point to something like:
>>>> typedef void (*btf_trace_kfree_skb)(void *, struct sk_buff *skb, 
>>>> void *loc);
>>>> which represents signature of raw_tracepoint "kfree_skb".
>>>>
>>>> Then btf_ctx_access() matches ctx+0 access in bpf program with 'skb'
>>>> and 'ctx+8' access with 'loc' arguments of "kfree_skb" tracepoint.
>>>> In first case it passes btf_id of 'struct sk_buff *' back to the 
>>>> verifier core
>>>> and 'void *' in second case.
>>>>
>>>> Then the verifier tracks PTR_TO_BTF_ID as any other pointer type.
>>>> Like PTR_TO_SOCKET points to 'struct bpf_sock',
>>>> PTR_TO_TCP_SOCK points to 'struct bpf_tcp_sock', and so on.
>>>> PTR_TO_BTF_ID points to in-kernel structs.
>>>> If 1234 is btf_id of 'struct sk_buff' in vmlinux's BTF
>>>> then PTR_TO_BTF_ID#1234 points to one of in kernel skbs.
>>>>
>>>> When PTR_TO_BTF_ID#1234 is dereferenced (like r2 = *(u64 *)r1 + 32)
>>>> the btf_struct_access() checks which field of 'struct sk_buff' is
>>>> at offset 32. Checks that size of access matches type definition
>>>> of the field and continues to track the dereferenced type.
>>>> If that field was a pointer to 'struct net_device' the r2's type
>>>> will be PTR_TO_BTF_ID#456. Where 456 is btf_id of 'struct net_device'
>>>> in vmlinux's BTF.
>>>>
>>>> Such verifier analysis prevents "cheating" in BPF C program.
>>>> The program cannot cast arbitrary pointer to 'struct sk_buff *'
>>>> and access it. C compiler would allow type cast, of course,
>>>> but the verifier will notice type mismatch based on BPF assembly
>>>> and in-kernel BTF.
>>>>
>>>> Signed-off-by: Alexei Starovoitov <ast@kernel.org>
>>>
>>> Overall set looks great!
>>>
>>> [...]
>>>> +int btf_struct_access(struct bpf_verifier_log *log,
>>>> +                   const struct btf_type *t, int off, int size,
>>>> +                   enum bpf_access_type atype,
>>>> +                   u32 *next_btf_id)
>>>> +{
>>>> +     const struct btf_member *member;
>>>> +     const struct btf_type *mtype;
>>>> +     const char *tname, *mname;
>>>> +     int i, moff = 0, msize;
>>>> +
>>>> +again:
>>>> +     tname = __btf_name_by_offset(btf_vmlinux, t->name_off);
>>>
>>> More of a high-level question wrt btf_ctx_access(), is there a reason 
>>> the ctx
>>> access is only done for raw_tp? I presume kprobes is still on todo 
>>> (?), what
>>> about uprobes which also have pt_regs and could benefit from this 
>>> work, but is
>>> not fixed to btf_vmlinux to search its ctx type.
>>
>> Optimized kprobes via ftrace entry point are on immediate todo list
>> to follow up. I'm still debating on the best way to handle it.
>> uprobes - I haven't though about. Likely necessary as well.
>> Not sure what types to give to pt_regs yet.
>>
>>> I presume BPF_LDX | BPF_PROBE_MEM | BPF_* would need no additional 
>>> encoding,
>>> but JIT emission would have to differ depending on the prog type.
>>
>> you mean for kprobes/uprobes? Why would it need to be different?
>> The idea was to keep LDX|PROBE_MEM as normal LDX|MEM load as much as 
>> possible.
> 
> Agree, makes sense.
> 
>> The only difference vs normal load is to populate extable which is
>> arch dependent.
> 
> Wouldn't you also need to switch to USER_DS similarly to what 
> probe_kernel_read()
> vs probe_user_read() differentiates?

No. I don't think we should.
Here we're reading only kernel memory and shouldn't be
messing with addr_limit.
No stac/clac and no access_ok() either.
It's kernel memory being read.
set_fs(KERNEL_DS) matters when access_ok() and getuser()
are used by callee that normally take user address
while caller is passing kernel address.
Here is no such thing.


  reply	other threads:[~2019-10-16 23:52 UTC|newest]

Thread overview: 24+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2019-10-16  3:24 [PATCH v3 bpf-next 00/11] bpf: revolutionize bpf tracing Alexei Starovoitov
2019-10-16  3:24 ` [PATCH v3 bpf-next 01/11] bpf: add typecast to raw_tracepoints to help BTF generation Alexei Starovoitov
2019-10-16  3:24 ` [PATCH v3 bpf-next 02/11] bpf: add typecast to bpf helpers " Alexei Starovoitov
2019-10-16  3:24 ` [PATCH v3 bpf-next 03/11] bpf: process in-kernel BTF Alexei Starovoitov
2019-10-16  3:24 ` [PATCH v3 bpf-next 04/11] bpf: add attach_btf_id attribute to program load Alexei Starovoitov
2019-10-16 19:45   ` Andrii Nakryiko
2019-10-16 19:50     ` Alexei Starovoitov
2019-10-16  3:24 ` [PATCH v3 bpf-next 05/11] libbpf: auto-detect btf_id of BTF-based raw_tracepoints Alexei Starovoitov
2019-10-16 19:49   ` Andrii Nakryiko
2019-10-16  3:25 ` [PATCH v3 bpf-next 06/11] bpf: implement accurate raw_tp context access via BTF Alexei Starovoitov
2019-10-16 20:09   ` Andrii Nakryiko
2019-10-16 21:21   ` Daniel Borkmann
2019-10-16 21:28     ` Alexei Starovoitov
2019-10-16 22:08       ` Daniel Borkmann
2019-10-16 23:52         ` Alexei Starovoitov [this message]
2019-10-16  3:25 ` [PATCH v3 bpf-next 07/11] bpf: attach raw_tp program with BTF via type name Alexei Starovoitov
2019-10-16 20:13   ` Andrii Nakryiko
2019-10-16  3:25 ` [PATCH v3 bpf-next 08/11] bpf: add support for BTF pointers to interpreter Alexei Starovoitov
2019-10-16  3:25 ` [PATCH v3 bpf-next 09/11] bpf: add support for BTF pointers to x86 JIT Alexei Starovoitov
2019-10-16 20:15   ` Andrii Nakryiko
2019-10-16  3:25 ` [PATCH v3 bpf-next 10/11] bpf: check types of arguments passed into helpers Alexei Starovoitov
2019-10-16  3:25 ` [PATCH v3 bpf-next 11/11] selftests/bpf: add kfree_skb raw_tp test Alexei Starovoitov
2019-10-16 18:01 ` [PATCH v3 bpf-next 00/11] bpf: revolutionize bpf tracing Martin Lau
2019-10-17 15:14 ` Daniel Borkmann

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=543e4e69-ecc6-835b-7634-4c7d60d2c06d@fb.com \
    --to=ast@fb.com \
    --cc=Kernel-team@fb.com \
    --cc=alexei.starovoitov@gmail.com \
    --cc=ast@kernel.org \
    --cc=bpf@vger.kernel.org \
    --cc=daniel@iogearbox.net \
    --cc=davem@davemloft.net \
    --cc=netdev@vger.kernel.org \
    --cc=x86@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).