[PATCH v3 bpf-next 0/2] Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk

All of lore.kernel.org
 help / color / mirror / Atom feed

* [PATCH v3 bpf-next 0/2] Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release
@ 2019-03-02 16:10 Martin KaFai Lau
  2019-03-02 16:10 ` [PATCH v3 bpf-next 1/2] bpf: " Martin KaFai Lau
  2019-03-02 16:10 ` [PATCH v3 bpf-next 2/2] bpf: Test ref release issue in bpf_tcp_sock and bpf_sk_fullsock Martin KaFai Lau
  0 siblings, 2 replies; 10+ messages in thread
From: Martin KaFai Lau @ 2019-03-02 16:10 UTC (permalink / raw)
  To: netdev; +Cc: Alexei Starovoitov, Daniel Borkmann, kernel-team

This set addresses issue about accessing invalid
ptr returned from bpf_tcp_sock() and bpf_sk_fullsock()
after bpf_sk_release().

v3:
- reset reg->refcount_id for the is_null case in mark_ptr_or_null_reg()

v2:
- Remove refcount_id arg from release_reference() because
  id == refcount_id
- Add a WARN_ON_ONCE to mark_ptr_or_null_regs() to catch
  an internal verifier bug.

Martin KaFai Lau (2):
  bpf: Fix bpf_tcp_sock and bpf_sk_fullsock issue related to
    bpf_sk_release
  bpf: Test ref release issue in bpf_tcp_sock and bpf_sk_fullsock.

 include/linux/bpf_verifier.h                  |   9 ++
 kernel/bpf/verifier.c                         | 107 +++++++++++-------
 .../selftests/bpf/verifier/ref_tracking.c     |  73 ++++++++++++
 3 files changed, 150 insertions(+), 39 deletions(-)

-- 
2.17.1


^ permalink raw reply	[flat|nested] 10+ messages in thread

* [PATCH v3 bpf-next 1/2] bpf: Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release
  2019-03-02 16:10 [PATCH v3 bpf-next 0/2] Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release Martin KaFai Lau
@ 2019-03-02 16:10 ` Martin KaFai Lau
  2019-03-02 18:03   ` Alexei Starovoitov
  2019-03-02 16:10 ` [PATCH v3 bpf-next 2/2] bpf: Test ref release issue in bpf_tcp_sock and bpf_sk_fullsock Martin KaFai Lau
  1 sibling, 1 reply; 10+ messages in thread
From: Martin KaFai Lau @ 2019-03-02 16:10 UTC (permalink / raw)
  To: netdev; +Cc: Alexei Starovoitov, Daniel Borkmann, kernel-team, Lorenz Bauer

Lorenz Bauer [thanks!] reported that a ptr returned by bpf_tcp_sock(sk)
can still be accessed after bpf_sk_release(sk).
Both bpf_tcp_sock() and bpf_sk_fullsock() have the same issue.
This patch addresses them together.

A simple reproducer looks like this:

sk = bpf_sk_lookup_tcp();
/* if (!sk) ... */
tp = bpf_tcp_sock(sk);
/* if (!tp) ... */
bpf_sk_release(sk);
snd_cwnd = tp->snd_cwnd; /* oops! The verifier does not complain. */

The problem is the verifier did not scrub the register's states of
the tcp_sock ptr (tp) after bpf_sk_release(sk).

[ Note that when calling bpf_tcp_sock(sk), the sk is not always
  refcount-acquired. e.g. bpf_tcp_sock(skb->sk). The verifier works
  fine for this case. ]

Currently, the verifier does not track if a helper's return ptr (in REG_0)
is "carry"-ing one of its argument's refcount status. To carry this info,
the reg1->id needs to be stored in reg0.  The reg0->id has already
been used for NULL checking purpose.  Hence, a new "refcount_id"
is needed in "struct bpf_reg_state".

With refcount_id, when bpf_sk_release(sk) is called, the verifier can scrub
all reg states which has a refcount_id match.  It is done with the changes
in release_reg_references().

When acquiring and releasing a refcount, the reg->id is still used.
Hence, we cannot do "bpf_sk_release(tp)" in the above reproducer
example.

Misc change notes:
- With the new refcount_id, reg_is_refcounted() test can now be
  done with "reg->refcount_id && reg->id == reg->refcount_id" instead of
  testing the ptr type.

  The type_is_refcounted() and type_is_refcounted_or_null()
  are no longer needed, so removed.

- An anonymous struct is added to bpf_call_arg_meta to store
  the reg->id and reg->refcount_id of the arg.  Otherwise,
  they will be unavailable after check_helper_call()
  has cleared all CALLER_SAVED_REGS.

- The check_func_arg() can only allow one refcount-ed arg.  It is
  guaranteed by check_refcount_ok() which ensures at most one arg can be
  refcount-ed.  Hence, it is a verifier internal error if >1 refcount arg
  found in check_func_arg().

- The check_func_arg() also complains if a "is_acquire_function(func_id)"
  helper is having a refcount-ed arg.  No func_id is doing this now
  and should have been rejected earlier, so it is treated as
  verifier internal error also.

- In check_func_arg(), the "!reg->id" check is removed under
  the ARG_PTR_TO_SOCKET case.  It is because a PTR_TO_SOCKET
  can be obtained from bpf_sk_fullsock() and it does not
  take a refcount.  The verifier will still complain during
  release_reference() but it does not treat it as a
  verifier internal error anymore.

- In release_reference(), release_reference_state() is called
  first to ensure a match on "reg->id" can be found before
  scrubbing the reg states with release_reg_references().

Fixes: 655a51e536c0 ("bpf: Add struct bpf_tcp_sock and BPF_FUNC_tcp_sock")
Cc: Lorenz Bauer <lmb@cloudflare.com>
Reported-by: Lorenz Bauer <lmb@cloudflare.com>
Signed-off-by: Martin KaFai Lau <kafai@fb.com>
---
 include/linux/bpf_verifier.h |   9 +++
 kernel/bpf/verifier.c        | 107 ++++++++++++++++++++++-------------
 2 files changed, 77 insertions(+), 39 deletions(-)

diff --git a/include/linux/bpf_verifier.h b/include/linux/bpf_verifier.h
index 69f7a3449eda..b7698d0534cb 100644
--- a/include/linux/bpf_verifier.h
+++ b/include/linux/bpf_verifier.h
@@ -66,6 +66,15 @@ struct bpf_reg_state {
 	 * same reference to the socket, to determine proper reference freeing.
 	 */
 	u32 id;
+	/* For PTR_TO_SOCKET and PTR_TO_TCP_SOCK, this ptr may not actually
+	 * hold a refcount of a socket but instead it is a ptr
+	 * returned from a helper which is based on its refcount-ed
+	 * ptr argument (e.g. bpf_tcp_sock()).
+	 * "refcount_id" stores which refcount-ed argument it
+	 * originally derived from.  When this original argument's
+	 * refcount is released, this ptr will also be invalidated.
+	 */
+	u32 refcount_id;
 	/* For scalar types (SCALAR_VALUE), this represents our knowledge of
 	 * the actual value.
 	 * For pointer types, this represents the variable part of the offset
diff --git a/kernel/bpf/verifier.c b/kernel/bpf/verifier.c
index 1b9496c41383..31c278fadaaa 100644
--- a/kernel/bpf/verifier.c
+++ b/kernel/bpf/verifier.c
@@ -212,7 +212,10 @@ struct bpf_call_arg_meta {
 	int access_size;
 	s64 msize_smax_value;
 	u64 msize_umax_value;
-	int ptr_id;
+	struct {
+		int id;
+		int refcount_id;
+	} refcount_reg;
 	int func_id;
 };
 
@@ -346,19 +349,9 @@ static bool reg_type_may_be_null(enum bpf_reg_type type)
 	       type == PTR_TO_TCP_SOCK_OR_NULL;
 }
 
-static bool type_is_refcounted(enum bpf_reg_type type)
-{
-	return type == PTR_TO_SOCKET;
-}
-
-static bool type_is_refcounted_or_null(enum bpf_reg_type type)
-{
-	return type == PTR_TO_SOCKET || type == PTR_TO_SOCKET_OR_NULL;
-}
-
 static bool reg_is_refcounted(const struct bpf_reg_state *reg)
 {
-	return type_is_refcounted(reg->type);
+	return reg->refcount_id && reg->id == reg->refcount_id;
 }
 
 static bool reg_may_point_to_spin_lock(const struct bpf_reg_state *reg)
@@ -367,14 +360,10 @@ static bool reg_may_point_to_spin_lock(const struct bpf_reg_state *reg)
 		map_value_has_spin_lock(reg->map_ptr);
 }
 
-static bool reg_is_refcounted_or_null(const struct bpf_reg_state *reg)
-{
-	return type_is_refcounted_or_null(reg->type);
-}
-
 static bool arg_type_is_refcounted(enum bpf_arg_type type)
 {
-	return type == ARG_PTR_TO_SOCKET;
+	return type == ARG_PTR_TO_SOCKET ||
+		type == ARG_PTR_TO_SOCK_COMMON;
 }
 
 /* Determine whether the function releases some resources allocated by another
@@ -392,6 +381,12 @@ static bool is_acquire_function(enum bpf_func_id func_id)
 		func_id == BPF_FUNC_sk_lookup_udp;
 }
 
+static bool is_refcount_carrying_function(enum bpf_func_id func_id)
+{
+	return func_id == BPF_FUNC_tcp_sock ||
+		func_id == BPF_FUNC_sk_fullsock;
+}
+
 /* string representation of 'enum bpf_reg_type' */
 static const char * const reg_type_str[] = {
 	[NOT_INIT]		= "?",
@@ -465,7 +460,8 @@ static void print_verifier_state(struct bpf_verifier_env *env,
 			if (t == PTR_TO_STACK)
 				verbose(env, ",call_%d", func(env, reg)->callsite);
 		} else {
-			verbose(env, "(id=%d", reg->id);
+			verbose(env, "(id=%d refcount_id=%d", reg->id,
+				reg->refcount_id);
 			if (t != SCALAR_VALUE)
 				verbose(env, ",off=%d", reg->off);
 			if (type_is_pkt_pointer(t))
@@ -2418,12 +2414,6 @@ static int check_func_arg(struct bpf_verifier_env *env, u32 regno,
 		expected_type = PTR_TO_SOCKET;
 		if (type != expected_type)
 			goto err_type;
-		if (meta->ptr_id || !reg->id) {
-			verbose(env, "verifier internal error: mismatched references meta=%d, reg=%d\n",
-				meta->ptr_id, reg->id);
-			return -EFAULT;
-		}
-		meta->ptr_id = reg->id;
 	} else if (arg_type == ARG_PTR_TO_SPIN_LOCK) {
 		if (meta->func_id == BPF_FUNC_spin_lock) {
 			if (process_spin_lock(env, regno, true))
@@ -2532,6 +2522,26 @@ static int check_func_arg(struct bpf_verifier_env *env, u32 regno,
 					      zero_size_allowed, meta);
 	}
 
+	if (reg->refcount_id) {
+		if (meta->refcount_reg.refcount_id) {
+			verbose(env, "verifier internal error: more than one arg with refcount_id R%d %u %u\n",
+				regno, reg->refcount_id,
+				meta->refcount_reg.refcount_id);
+			return -EFAULT;
+		}
+
+		if (is_acquire_function(meta->func_id)) {
+			verbose(env,
+				"verifier internal error: func %s#%d taking an already refcount-ed arg R%d\n",
+				func_id_name(meta->func_id), meta->func_id,
+				regno);
+			return -EFAULT;
+		}
+
+		meta->refcount_reg.id = reg->id;
+		meta->refcount_reg.refcount_id = reg->refcount_id;
+	}
+
 	return err;
 err_type:
 	verbose(env, "R%d type=%s expected=%s\n", regno,
@@ -2805,13 +2815,13 @@ static void release_reg_references(struct bpf_verifier_env *env,
 	int i;
 
 	for (i = 0; i < MAX_BPF_REG; i++)
-		if (regs[i].id == id)
+		if (regs[i].refcount_id == id)
 			mark_reg_unknown(env, regs, i);
 
 	bpf_for_each_spilled_reg(i, state, reg) {
 		if (!reg)
 			continue;
-		if (reg_is_refcounted(reg) && reg->id == id)
+		if (reg->refcount_id == id)
 			__mark_reg_unknown(reg);
 	}
 }
@@ -2819,16 +2829,20 @@ static void release_reg_references(struct bpf_verifier_env *env,
 /* The pointer with the specified id has released its reference to kernel
  * resources. Identify all copies of the same pointer and clear the reference.
  */
-static int release_reference(struct bpf_verifier_env *env,
-			     struct bpf_call_arg_meta *meta)
+static int release_reference(struct bpf_verifier_env *env, int id)
 {
 	struct bpf_verifier_state *vstate = env->cur_state;
+	int err;
 	int i;
 
+	err = release_reference_state(cur_func(env), id);
+	if (err)
+		return err;
+
 	for (i = 0; i <= vstate->curframe; i++)
-		release_reg_references(env, vstate->frame[i], meta->ptr_id);
+		release_reg_references(env, vstate->frame[i], id);
 
-	return release_reference_state(cur_func(env), meta->ptr_id);
+	return 0;
 }
 
 static int check_func_call(struct bpf_verifier_env *env, struct bpf_insn *insn,
@@ -3093,7 +3107,7 @@ static int check_helper_call(struct bpf_verifier_env *env, int func_id, int insn
 			return err;
 		}
 	} else if (is_release_function(func_id)) {
-		err = release_reference(env, &meta);
+		err = release_reference(env, meta.refcount_reg.id);
 		if (err) {
 			verbose(env, "func %s#%d reference has not been acquired before\n",
 				func_id_name(func_id), func_id);
@@ -3156,6 +3170,7 @@ static int check_helper_call(struct bpf_verifier_env *env, int func_id, int insn
 				return id;
 			/* For release_reference() */
 			regs[BPF_REG_0].id = id;
+			regs[BPF_REG_0].refcount_id = id;
 		} else {
 			/* For mark_ptr_or_null_reg() */
 			regs[BPF_REG_0].id = ++env->id_gen;
@@ -3170,6 +3185,9 @@ static int check_helper_call(struct bpf_verifier_env *env, int func_id, int insn
 		return -EINVAL;
 	}
 
+	if (is_refcount_carrying_function(func_id))
+		regs[BPF_REG_0].refcount_id = meta.refcount_reg.refcount_id;
+
 	do_refine_retval_range(regs, fn->ret_type, func_id, &meta);
 
 	err = check_map_func_compatibility(env, meta.map_ptr, func_id);
@@ -4665,11 +4683,22 @@ static void mark_ptr_or_null_reg(struct bpf_func_state *state,
 		} else if (reg->type == PTR_TO_TCP_SOCK_OR_NULL) {
 			reg->type = PTR_TO_TCP_SOCK;
 		}
-		if (is_null || !(reg_is_refcounted(reg) ||
-				 reg_may_point_to_spin_lock(reg))) {
-			/* We don't need id from this point onwards anymore,
-			 * thus we should better reset it, so that state
-			 * pruning has chances to take effect.
+		if (is_null) {
+			/* We don't need id and refcount_id from this point
+			 * onwards anymore, thus we should better reset it,
+			 * so that state pruning has chances to take effect.
+			 */
+			reg->id = 0;
+			reg->refcount_id = 0;
+		} else if (!reg_is_refcounted(reg) &&
+			   !reg_may_point_to_spin_lock(reg)) {
+			/* For not-NULL ptr, reg->refcount_id will be reset
+			 * in release_reg_references().
+			 *
+			 * reg->id is still used by refcounted ptr
+			 * and spin_lock ptr for tracking purpose.
+			 * Other than these two ptr type,
+			 * reg->id can also be reset.
 			 */
 			reg->id = 0;
 		}
@@ -4687,8 +4716,8 @@ static void mark_ptr_or_null_regs(struct bpf_verifier_state *vstate, u32 regno,
 	u32 id = regs[regno].id;
 	int i, j;
 
-	if (reg_is_refcounted_or_null(&regs[regno]) && is_null)
-		release_reference_state(state, id);
+	if (reg_is_refcounted(&regs[regno]) && is_null)
+		WARN_ON_ONCE(release_reference_state(state, id));
 
 	for (i = 0; i < MAX_BPF_REG; i++)
 		mark_ptr_or_null_reg(state, &regs[i], id, is_null);
-- 
2.17.1


^ permalink raw reply related	[flat|nested] 10+ messages in thread

* [PATCH v3 bpf-next 2/2] bpf: Test ref release issue in bpf_tcp_sock and bpf_sk_fullsock.
  2019-03-02 16:10 [PATCH v3 bpf-next 0/2] Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release Martin KaFai Lau
  2019-03-02 16:10 ` [PATCH v3 bpf-next 1/2] bpf: " Martin KaFai Lau
@ 2019-03-02 16:10 ` Martin KaFai Lau
  1 sibling, 0 replies; 10+ messages in thread
From: Martin KaFai Lau @ 2019-03-02 16:10 UTC (permalink / raw)
  To: netdev; +Cc: Alexei Starovoitov, Daniel Borkmann, kernel-team, Lorenz Bauer

Adding verifier tests to ensure the ptr returned from
bpf_tcp_sock() and bpf_sk_fullsock() cannot be accessed
after bpf_sk_release() is called.  It is derived from a
reproducer test from Lorenz Bauer.

Cc: Lorenz Bauer <lmb@cloudflare.com>
Signed-off-by: Martin KaFai Lau <kafai@fb.com>
---
 .../selftests/bpf/verifier/ref_tracking.c     | 73 +++++++++++++++++++
 1 file changed, 73 insertions(+)

diff --git a/tools/testing/selftests/bpf/verifier/ref_tracking.c b/tools/testing/selftests/bpf/verifier/ref_tracking.c
index 3ed3593bd8b6..9695f8e9b58b 100644
--- a/tools/testing/selftests/bpf/verifier/ref_tracking.c
+++ b/tools/testing/selftests/bpf/verifier/ref_tracking.c
@@ -605,3 +605,76 @@
 	.prog_type = BPF_PROG_TYPE_SCHED_CLS,
 	.result = ACCEPT,
 },
+{
+	"reference tracking: use ptr from bpf_tcp_sock() after release",
+	.insns = {
+	BPF_SK_LOOKUP,
+	BPF_JMP_IMM(BPF_JNE, BPF_REG_0, 0, 1),
+	BPF_EXIT_INSN(),
+	BPF_MOV64_REG(BPF_REG_6, BPF_REG_0),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_0),
+	BPF_EMIT_CALL(BPF_FUNC_tcp_sock),
+	BPF_JMP_IMM(BPF_JNE, BPF_REG_0, 0, 3),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_6),
+	BPF_EMIT_CALL(BPF_FUNC_sk_release),
+	BPF_EXIT_INSN(),
+	BPF_MOV64_REG(BPF_REG_7, BPF_REG_0),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_6),
+	BPF_EMIT_CALL(BPF_FUNC_sk_release),
+	BPF_LDX_MEM(BPF_W, BPF_REG_0, BPF_REG_7, offsetof(struct bpf_tcp_sock, snd_cwnd)),
+	BPF_EXIT_INSN(),
+	},
+	.prog_type = BPF_PROG_TYPE_SCHED_CLS,
+	.result = REJECT,
+	.errstr = "invalid mem access",
+},
+{
+	"reference tracking: use ptr from bpf_sk_fullsock() after release",
+	.insns = {
+	BPF_SK_LOOKUP,
+	BPF_JMP_IMM(BPF_JNE, BPF_REG_0, 0, 1),
+	BPF_EXIT_INSN(),
+	BPF_MOV64_REG(BPF_REG_6, BPF_REG_0),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_0),
+	BPF_EMIT_CALL(BPF_FUNC_sk_fullsock),
+	BPF_JMP_IMM(BPF_JNE, BPF_REG_0, 0, 3),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_6),
+	BPF_EMIT_CALL(BPF_FUNC_sk_release),
+	BPF_EXIT_INSN(),
+	BPF_MOV64_REG(BPF_REG_7, BPF_REG_0),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_6),
+	BPF_EMIT_CALL(BPF_FUNC_sk_release),
+	BPF_LDX_MEM(BPF_W, BPF_REG_0, BPF_REG_7, offsetof(struct bpf_sock, type)),
+	BPF_EXIT_INSN(),
+	},
+	.prog_type = BPF_PROG_TYPE_SCHED_CLS,
+	.result = REJECT,
+	.errstr = "invalid mem access",
+},
+{
+	"reference tracking: use ptr from bpf_sk_fullsock(tp) after release",
+	.insns = {
+	BPF_SK_LOOKUP,
+	BPF_JMP_IMM(BPF_JNE, BPF_REG_0, 0, 1),
+	BPF_EXIT_INSN(),
+	BPF_MOV64_REG(BPF_REG_6, BPF_REG_0),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_0),
+	BPF_EMIT_CALL(BPF_FUNC_tcp_sock),
+	BPF_JMP_IMM(BPF_JNE, BPF_REG_0, 0, 3),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_6),
+	BPF_EMIT_CALL(BPF_FUNC_sk_release),
+	BPF_EXIT_INSN(),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_0),
+	BPF_EMIT_CALL(BPF_FUNC_sk_fullsock),
+	BPF_MOV64_REG(BPF_REG_1, BPF_REG_6),
+	BPF_MOV64_REG(BPF_REG_6, BPF_REG_0),
+	BPF_EMIT_CALL(BPF_FUNC_sk_release),
+	BPF_JMP_IMM(BPF_JNE, BPF_REG_6, 0, 1),
+	BPF_EXIT_INSN(),
+	BPF_LDX_MEM(BPF_W, BPF_REG_0, BPF_REG_6, offsetof(struct bpf_sock, type)),
+	BPF_EXIT_INSN(),
+	},
+	.prog_type = BPF_PROG_TYPE_SCHED_CLS,
+	.result = REJECT,
+	.errstr = "invalid mem access",
+},
-- 
2.17.1


^ permalink raw reply related	[flat|nested] 10+ messages in thread

* Re: [PATCH v3 bpf-next 1/2] bpf: Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release
  2019-03-02 16:10 ` [PATCH v3 bpf-next 1/2] bpf: " Martin KaFai Lau
@ 2019-03-02 18:03   ` Alexei Starovoitov
  2019-03-02 20:21     ` Martin Lau
  0 siblings, 1 reply; 10+ messages in thread
From: Alexei Starovoitov @ 2019-03-02 18:03 UTC (permalink / raw)
  To: Martin KaFai Lau
  Cc: netdev, Alexei Starovoitov, Daniel Borkmann, kernel-team, Lorenz Bauer

On Sat, Mar 02, 2019 at 08:10:10AM -0800, Martin KaFai Lau wrote:
> Lorenz Bauer [thanks!] reported that a ptr returned by bpf_tcp_sock(sk)
> can still be accessed after bpf_sk_release(sk).
> Both bpf_tcp_sock() and bpf_sk_fullsock() have the same issue.
> This patch addresses them together.
> 
> A simple reproducer looks like this:
> 
> sk = bpf_sk_lookup_tcp();
> /* if (!sk) ... */
> tp = bpf_tcp_sock(sk);
> /* if (!tp) ... */
> bpf_sk_release(sk);
> snd_cwnd = tp->snd_cwnd; /* oops! The verifier does not complain. */
> 
> The problem is the verifier did not scrub the register's states of
> the tcp_sock ptr (tp) after bpf_sk_release(sk).
> 
> [ Note that when calling bpf_tcp_sock(sk), the sk is not always
>   refcount-acquired. e.g. bpf_tcp_sock(skb->sk). The verifier works
>   fine for this case. ]
> 
> Currently, the verifier does not track if a helper's return ptr (in REG_0)
> is "carry"-ing one of its argument's refcount status. To carry this info,
> the reg1->id needs to be stored in reg0.  The reg0->id has already
> been used for NULL checking purpose.  Hence, a new "refcount_id"
> is needed in "struct bpf_reg_state".
> 
> With refcount_id, when bpf_sk_release(sk) is called, the verifier can scrub
> all reg states which has a refcount_id match.  It is done with the changes
> in release_reg_references().
> 
> When acquiring and releasing a refcount, the reg->id is still used.
> Hence, we cannot do "bpf_sk_release(tp)" in the above reproducer
> example.

I think the choice of returning listener full sock from req sock
in sk_to_full_sk() was a wrong one.
It seems better to make semantics of bpf_tcp_sock() and bpf_sk_fullsock() as 
always type cast or null.
And have a separate helper for req socket that returns inet_reqsk(sk)->rsk_listener.

Then it will be ok to call bpf_sk_release(tp) when tp came from bpf_sk_lookup_tcp.
The verifier will know that it's the case because its ID will be in acquired_refs.

The additional refcount_id won't be necessary.
bpf_sk_fullsock() and bpf_tcp_sock() will not call sk_to_full_sk
and the verifier will be copying reg1->id into reg0->id.

In release_reference() the verifier will do
  if (regs[i].id == id)
    mark_reg_unknown(env, regs, i);
for all socket types.

release_reference_state() will stay as-is.

imo such logic will be easier to follow.

This implicit sk_to_full_sk() makes the whole thing much harder for the verifier
and for the bpf program writers.

The new bpf_get_listener_sock(sk) doesn't have to copy ID from reg1 to reg0
since req socket will not be returned from bpf_sk_lookup_tcp and its ID
will not be stored in acuired_refs.

Does it make sense ?

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v3 bpf-next 1/2] bpf: Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release
  2019-03-02 18:03   ` Alexei Starovoitov
@ 2019-03-02 20:21     ` Martin Lau
  2019-03-04  9:33       ` Daniel Borkmann
  0 siblings, 1 reply; 10+ messages in thread
From: Martin Lau @ 2019-03-02 20:21 UTC (permalink / raw)
  To: Alexei Starovoitov
  Cc: netdev, Alexei Starovoitov, Daniel Borkmann, Kernel Team, Lorenz Bauer

On Sat, Mar 02, 2019 at 10:03:03AM -0800, Alexei Starovoitov wrote:
> On Sat, Mar 02, 2019 at 08:10:10AM -0800, Martin KaFai Lau wrote:
> > Lorenz Bauer [thanks!] reported that a ptr returned by bpf_tcp_sock(sk)
> > can still be accessed after bpf_sk_release(sk).
> > Both bpf_tcp_sock() and bpf_sk_fullsock() have the same issue.
> > This patch addresses them together.
> > 
> > A simple reproducer looks like this:
> > 
> > sk = bpf_sk_lookup_tcp();
> > /* if (!sk) ... */
> > tp = bpf_tcp_sock(sk);
> > /* if (!tp) ... */
> > bpf_sk_release(sk);
> > snd_cwnd = tp->snd_cwnd; /* oops! The verifier does not complain. */
> > 
> > The problem is the verifier did not scrub the register's states of
> > the tcp_sock ptr (tp) after bpf_sk_release(sk).
> > 
> > [ Note that when calling bpf_tcp_sock(sk), the sk is not always
> >   refcount-acquired. e.g. bpf_tcp_sock(skb->sk). The verifier works
> >   fine for this case. ]
> > 
> > Currently, the verifier does not track if a helper's return ptr (in REG_0)
> > is "carry"-ing one of its argument's refcount status. To carry this info,
> > the reg1->id needs to be stored in reg0.  The reg0->id has already
> > been used for NULL checking purpose.  Hence, a new "refcount_id"
> > is needed in "struct bpf_reg_state".
> > 
> > With refcount_id, when bpf_sk_release(sk) is called, the verifier can scrub
> > all reg states which has a refcount_id match.  It is done with the changes
> > in release_reg_references().
> > 
> > When acquiring and releasing a refcount, the reg->id is still used.
> > Hence, we cannot do "bpf_sk_release(tp)" in the above reproducer
> > example.
> 
> I think the choice of returning listener full sock from req sock
> in sk_to_full_sk() was a wrong one.
> It seems better to make semantics of bpf_tcp_sock() and bpf_sk_fullsock() as 
> always type cast or null.
> And have a separate helper for req socket that returns inet_reqsk(sk)->rsk_listener.
> 
> Then it will be ok to call bpf_sk_release(tp) when tp came from bpf_sk_lookup_tcp.
> The verifier will know that it's the case because its ID will be in acquired_refs.
> 
> The additional refcount_id won't be necessary.
> bpf_sk_fullsock() and bpf_tcp_sock() will not call sk_to_full_sk
> and the verifier will be copying reg1->id into reg0->id.
> 
> In release_reference() the verifier will do
>   if (regs[i].id == id)
>     mark_reg_unknown(env, regs, i);
> for all socket types.
> 
> release_reference_state() will stay as-is.
> 
> imo such logic will be easier to follow.
> 
> This implicit sk_to_full_sk() makes the whole thing much harder for the verifier
> and for the bpf program writers.
> 
> The new bpf_get_listener_sock(sk) doesn't have to copy ID from reg1 to reg0
> since req socket will not be returned from bpf_sk_lookup_tcp and its ID
> will not be stored in acuired_refs.
> 
> Does it make sense ?
I like this idea.  Many thanks for thinking it through!

Allowing bpf_sk_release(tp), no need to call bpf_sk_release() on ptr
returned from bpf_get_listener_sock(sk) and keep one reg->id.

I think it should work.  I will rework the patches.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v3 bpf-next 1/2] bpf: Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release
  2019-03-02 20:21     ` Martin Lau
@ 2019-03-04  9:33       ` Daniel Borkmann
  2019-03-04 17:42         ` Martin Lau
  0 siblings, 1 reply; 10+ messages in thread
From: Daniel Borkmann @ 2019-03-04  9:33 UTC (permalink / raw)
  To: Martin Lau, Alexei Starovoitov
  Cc: netdev, Alexei Starovoitov, Kernel Team, Lorenz Bauer

On 03/02/2019 09:21 PM, Martin Lau wrote:
> On Sat, Mar 02, 2019 at 10:03:03AM -0800, Alexei Starovoitov wrote:
>> On Sat, Mar 02, 2019 at 08:10:10AM -0800, Martin KaFai Lau wrote:
>>> Lorenz Bauer [thanks!] reported that a ptr returned by bpf_tcp_sock(sk)
>>> can still be accessed after bpf_sk_release(sk).
>>> Both bpf_tcp_sock() and bpf_sk_fullsock() have the same issue.
>>> This patch addresses them together.
>>>
>>> A simple reproducer looks like this:
>>>
>>> sk = bpf_sk_lookup_tcp();
>>> /* if (!sk) ... */
>>> tp = bpf_tcp_sock(sk);
>>> /* if (!tp) ... */
>>> bpf_sk_release(sk);
>>> snd_cwnd = tp->snd_cwnd; /* oops! The verifier does not complain. */
>>>
>>> The problem is the verifier did not scrub the register's states of
>>> the tcp_sock ptr (tp) after bpf_sk_release(sk).
>>>
>>> [ Note that when calling bpf_tcp_sock(sk), the sk is not always
>>>   refcount-acquired. e.g. bpf_tcp_sock(skb->sk). The verifier works
>>>   fine for this case. ]
>>>
>>> Currently, the verifier does not track if a helper's return ptr (in REG_0)
>>> is "carry"-ing one of its argument's refcount status. To carry this info,
>>> the reg1->id needs to be stored in reg0.  The reg0->id has already
>>> been used for NULL checking purpose.  Hence, a new "refcount_id"
>>> is needed in "struct bpf_reg_state".
>>>
>>> With refcount_id, when bpf_sk_release(sk) is called, the verifier can scrub
>>> all reg states which has a refcount_id match.  It is done with the changes
>>> in release_reg_references().
>>>
>>> When acquiring and releasing a refcount, the reg->id is still used.
>>> Hence, we cannot do "bpf_sk_release(tp)" in the above reproducer
>>> example.
>>
>> I think the choice of returning listener full sock from req sock
>> in sk_to_full_sk() was a wrong one.
>> It seems better to make semantics of bpf_tcp_sock() and bpf_sk_fullsock() as 
>> always type cast or null.
>> And have a separate helper for req socket that returns inet_reqsk(sk)->rsk_listener.
>>
>> Then it will be ok to call bpf_sk_release(tp) when tp came from bpf_sk_lookup_tcp.
>> The verifier will know that it's the case because its ID will be in acquired_refs.
>>
>> The additional refcount_id won't be necessary.
>> bpf_sk_fullsock() and bpf_tcp_sock() will not call sk_to_full_sk
>> and the verifier will be copying reg1->id into reg0->id.
>>
>> In release_reference() the verifier will do
>>   if (regs[i].id == id)
>>     mark_reg_unknown(env, regs, i);
>> for all socket types.
>>
>> release_reference_state() will stay as-is.
>>
>> imo such logic will be easier to follow.
>>
>> This implicit sk_to_full_sk() makes the whole thing much harder for the verifier
>> and for the bpf program writers.
>>
>> The new bpf_get_listener_sock(sk) doesn't have to copy ID from reg1 to reg0
>> since req socket will not be returned from bpf_sk_lookup_tcp and its ID
>> will not be stored in acuired_refs.
>>
>> Does it make sense ?
> I like this idea.  Many thanks for thinking it through!
> 
> Allowing bpf_sk_release(tp), no need to call bpf_sk_release() on ptr
> returned from bpf_get_listener_sock(sk) and keep one reg->id.
> 
> I think it should work.  I will rework the patches.

Agree, makes sense, that seems much better fix.

Thanks,
Daniel

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v3 bpf-next 1/2] bpf: Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release
  2019-03-04  9:33       ` Daniel Borkmann
@ 2019-03-04 17:42         ` Martin Lau
  2019-03-06 15:59           ` Lorenz Bauer
  0 siblings, 1 reply; 10+ messages in thread
From: Martin Lau @ 2019-03-04 17:42 UTC (permalink / raw)
  To: Daniel Borkmann
  Cc: Alexei Starovoitov, netdev, Alexei Starovoitov, Kernel Team,
	Lorenz Bauer

On Mon, Mar 04, 2019 at 10:33:46AM +0100, Daniel Borkmann wrote:
> On 03/02/2019 09:21 PM, Martin Lau wrote:
> > On Sat, Mar 02, 2019 at 10:03:03AM -0800, Alexei Starovoitov wrote:
> >> On Sat, Mar 02, 2019 at 08:10:10AM -0800, Martin KaFai Lau wrote:
> >>> Lorenz Bauer [thanks!] reported that a ptr returned by bpf_tcp_sock(sk)
> >>> can still be accessed after bpf_sk_release(sk).
> >>> Both bpf_tcp_sock() and bpf_sk_fullsock() have the same issue.
> >>> This patch addresses them together.
> >>>
> >>> A simple reproducer looks like this:
> >>>
> >>> sk = bpf_sk_lookup_tcp();
> >>> /* if (!sk) ... */
> >>> tp = bpf_tcp_sock(sk);
> >>> /* if (!tp) ... */
> >>> bpf_sk_release(sk);
> >>> snd_cwnd = tp->snd_cwnd; /* oops! The verifier does not complain. */
> >>>
> >>> The problem is the verifier did not scrub the register's states of
> >>> the tcp_sock ptr (tp) after bpf_sk_release(sk).
> >>>
> >>> [ Note that when calling bpf_tcp_sock(sk), the sk is not always
> >>>   refcount-acquired. e.g. bpf_tcp_sock(skb->sk). The verifier works
> >>>   fine for this case. ]
> >>>
> >>> Currently, the verifier does not track if a helper's return ptr (in REG_0)
> >>> is "carry"-ing one of its argument's refcount status. To carry this info,
> >>> the reg1->id needs to be stored in reg0.  The reg0->id has already
> >>> been used for NULL checking purpose.  Hence, a new "refcount_id"
> >>> is needed in "struct bpf_reg_state".
> >>>
> >>> With refcount_id, when bpf_sk_release(sk) is called, the verifier can scrub
> >>> all reg states which has a refcount_id match.  It is done with the changes
> >>> in release_reg_references().
> >>>
> >>> When acquiring and releasing a refcount, the reg->id is still used.
> >>> Hence, we cannot do "bpf_sk_release(tp)" in the above reproducer
> >>> example.
> >>
> >> I think the choice of returning listener full sock from req sock
> >> in sk_to_full_sk() was a wrong one.
> >> It seems better to make semantics of bpf_tcp_sock() and bpf_sk_fullsock() as 
> >> always type cast or null.
> >> And have a separate helper for req socket that returns inet_reqsk(sk)->rsk_listener.
> >>
> >> Then it will be ok to call bpf_sk_release(tp) when tp came from bpf_sk_lookup_tcp.
> >> The verifier will know that it's the case because its ID will be in acquired_refs.
> >>
> >> The additional refcount_id won't be necessary.
> >> bpf_sk_fullsock() and bpf_tcp_sock() will not call sk_to_full_sk
> >> and the verifier will be copying reg1->id into reg0->id.
> >>
> >> In release_reference() the verifier will do
> >>   if (regs[i].id == id)
> >>     mark_reg_unknown(env, regs, i);
> >> for all socket types.
> >>
> >> release_reference_state() will stay as-is.
> >>
> >> imo such logic will be easier to follow.
> >>
> >> This implicit sk_to_full_sk() makes the whole thing much harder for the verifier
> >> and for the bpf program writers.
> >>
> >> The new bpf_get_listener_sock(sk) doesn't have to copy ID from reg1 to reg0
> >> since req socket will not be returned from bpf_sk_lookup_tcp and its ID
> >> will not be stored in acuired_refs.
> >>
> >> Does it make sense ?
> > I like this idea.  Many thanks for thinking it through!
> > 
> > Allowing bpf_sk_release(tp), no need to call bpf_sk_release() on ptr
> > returned from bpf_get_listener_sock(sk) and keep one reg->id.
> > 
> > I think it should work.  I will rework the patches.
> 
> Agree, makes sense, that seems much better fix.
While I was working on this change, based on the code, one issue I saw is:

if the bpf prog does this:

sk = bpf_sk_lookup_tcp();
/* if (!sk) ... */
fullsock = bpf_sk_fullsock(sk);
if (!fullsock) {
	bpf_sk_release(sk); /* Fail. sk_reg->id not found in ref state */
	return 0;
}

The bpf_sk_release(sk) failed because the reference state has already
been released by "release_reference_state(state, fullsock_reg->id)" during
"if (!fullsock) /* handled by mark_ptr_or_null_regs(is_null == true) */"
Logically, I think bpf_sk_release(sk) should not fail regardless of
bpf_sk_fullsock() doing sk_to_full_sk() or not.

bpf_sk_fullsock() could disallow PTR_TO_SOCKET or PTR_TO_TCP_SOCK but that
would be weird.

I think we still need two id.  May be rename the refcount_id proposed in
this patch to ref_obj_id which is the original refcounted object id.

If the sk_to_full_sk() is removed from bpf_sk_fullsock() and bpf_tcp_sock(),
these two helpers become a simple cast (i.e. either return the same pointer
or NULL).  Then bpf_sk_release(fullsock) and bpf_sk_release(tp) could work:

- When is_null == true, release_reference_state(state, reg->id) is called.
- During bpf_sk_release(fullsock), release_reference(env, reg->ref_obj_id)
  is called so that sk, fullsock and tp with the same ref_obj_id will
  be mark_reg_unknown().

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v3 bpf-next 1/2] bpf: Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release
  2019-03-04 17:42         ` Martin Lau
@ 2019-03-06 15:59           ` Lorenz Bauer
  2019-03-09  5:28             ` Martin Lau
  0 siblings, 1 reply; 10+ messages in thread
From: Lorenz Bauer @ 2019-03-06 15:59 UTC (permalink / raw)
  To: Martin Lau
  Cc: Daniel Borkmann, Alexei Starovoitov, netdev, Alexei Starovoitov,
	Kernel Team

On Mon, 4 Mar 2019 at 17:43, Martin Lau <kafai@fb.com> wrote:
>
> On Mon, Mar 04, 2019 at 10:33:46AM +0100, Daniel Borkmann wrote:
> > On 03/02/2019 09:21 PM, Martin Lau wrote:
> > > On Sat, Mar 02, 2019 at 10:03:03AM -0800, Alexei Starovoitov wrote:
> > >> On Sat, Mar 02, 2019 at 08:10:10AM -0800, Martin KaFai Lau wrote:
> > >>> Lorenz Bauer [thanks!] reported that a ptr returned by bpf_tcp_sock(sk)
> > >>> can still be accessed after bpf_sk_release(sk).
> > >>> Both bpf_tcp_sock() and bpf_sk_fullsock() have the same issue.
> > >>> This patch addresses them together.
> > >>>
> > >>> A simple reproducer looks like this:
> > >>>
> > >>> sk = bpf_sk_lookup_tcp();
> > >>> /* if (!sk) ... */
> > >>> tp = bpf_tcp_sock(sk);
> > >>> /* if (!tp) ... */
> > >>> bpf_sk_release(sk);
> > >>> snd_cwnd = tp->snd_cwnd; /* oops! The verifier does not complain. */
> > >>>
> > >>> The problem is the verifier did not scrub the register's states of
> > >>> the tcp_sock ptr (tp) after bpf_sk_release(sk).
> > >>>
> > >>> [ Note that when calling bpf_tcp_sock(sk), the sk is not always
> > >>>   refcount-acquired. e.g. bpf_tcp_sock(skb->sk). The verifier works
> > >>>   fine for this case. ]
> > >>>
> > >>> Currently, the verifier does not track if a helper's return ptr (in REG_0)
> > >>> is "carry"-ing one of its argument's refcount status. To carry this info,
> > >>> the reg1->id needs to be stored in reg0.  The reg0->id has already
> > >>> been used for NULL checking purpose.  Hence, a new "refcount_id"
> > >>> is needed in "struct bpf_reg_state".
> > >>>
> > >>> With refcount_id, when bpf_sk_release(sk) is called, the verifier can scrub
> > >>> all reg states which has a refcount_id match.  It is done with the changes
> > >>> in release_reg_references().
> > >>>
> > >>> When acquiring and releasing a refcount, the reg->id is still used.
> > >>> Hence, we cannot do "bpf_sk_release(tp)" in the above reproducer
> > >>> example.
> > >>
> > >> I think the choice of returning listener full sock from req sock
> > >> in sk_to_full_sk() was a wrong one.
> > >> It seems better to make semantics of bpf_tcp_sock() and bpf_sk_fullsock() as
> > >> always type cast or null.
> > >> And have a separate helper for req socket that returns inet_reqsk(sk)->rsk_listener.
> > >>
> > >> Then it will be ok to call bpf_sk_release(tp) when tp came from bpf_sk_lookup_tcp.
> > >> The verifier will know that it's the case because its ID will be in acquired_refs.
> > >>
> > >> The additional refcount_id won't be necessary.
> > >> bpf_sk_fullsock() and bpf_tcp_sock() will not call sk_to_full_sk
> > >> and the verifier will be copying reg1->id into reg0->id.
> > >>
> > >> In release_reference() the verifier will do
> > >>   if (regs[i].id == id)
> > >>     mark_reg_unknown(env, regs, i);
> > >> for all socket types.
> > >>
> > >> release_reference_state() will stay as-is.
> > >>
> > >> imo such logic will be easier to follow.
> > >>
> > >> This implicit sk_to_full_sk() makes the whole thing much harder for the verifier
> > >> and for the bpf program writers.
> > >>
> > >> The new bpf_get_listener_sock(sk) doesn't have to copy ID from reg1 to reg0
> > >> since req socket will not be returned from bpf_sk_lookup_tcp and its ID
> > >> will not be stored in acuired_refs.
> > >>
> > >> Does it make sense ?
> > > I like this idea.  Many thanks for thinking it through!
> > >
> > > Allowing bpf_sk_release(tp), no need to call bpf_sk_release() on ptr
> > > returned from bpf_get_listener_sock(sk) and keep one reg->id.
> > >
> > > I think it should work.  I will rework the patches.
> >
> > Agree, makes sense, that seems much better fix.
> While I was working on this change, based on the code, one issue I saw is:
>
> if the bpf prog does this:
>
> sk = bpf_sk_lookup_tcp();
> /* if (!sk) ... */
> fullsock = bpf_sk_fullsock(sk);
> if (!fullsock) {
>         bpf_sk_release(sk); /* Fail. sk_reg->id not found in ref state */
>         return 0;
> }
>
> The bpf_sk_release(sk) failed because the reference state has already
> been released by "release_reference_state(state, fullsock_reg->id)" during
> "if (!fullsock) /* handled by mark_ptr_or_null_regs(is_null == true) */"
> Logically, I think bpf_sk_release(sk) should not fail regardless of
> bpf_sk_fullsock() doing sk_to_full_sk() or not.
>
> bpf_sk_fullsock() could disallow PTR_TO_SOCKET or PTR_TO_TCP_SOCK but that
> would be weird.
>
> I think we still need two id.  May be rename the refcount_id proposed in
> this patch to ref_obj_id which is the original refcounted object id.
>
> If the sk_to_full_sk() is removed from bpf_sk_fullsock() and bpf_tcp_sock(),
> these two helpers become a simple cast (i.e. either return the same pointer
> or NULL).  Then bpf_sk_release(fullsock) and bpf_sk_release(tp) could work:
>
> - When is_null == true, release_reference_state(state, reg->id) is called.

If I understand correctly, this works because we never
acquire_reference() for tp/ fullsock,
making this a no-op?

> - During bpf_sk_release(fullsock), release_reference(env, reg->ref_obj_id)
>   is called so that sk, fullsock and tp with the same ref_obj_id will
>   be mark_reg_unknown().

To clarify, the following states are possible:
* id == 0, ref_obj_id == 0: not a pointer / reference
* id != 0, reg_obj_id == 0: a reference which didn't have
acquire_reference() called
* id != 0, reg_obj_id != 0: a reference which had acquire_reference() called
* id == 0, reg_obj_id: illegal

Sounds reasonable to me.

-- 
Lorenz Bauer  |  Systems Engineer
25 Lavington St., London SE1 0NZ

www.cloudflare.com

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v3 bpf-next 1/2] bpf: Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release
  2019-03-06 15:59           ` Lorenz Bauer
@ 2019-03-09  5:28             ` Martin Lau
  2019-03-11 11:47               ` Lorenz Bauer
  0 siblings, 1 reply; 10+ messages in thread
From: Martin Lau @ 2019-03-09  5:28 UTC (permalink / raw)
  To: Lorenz Bauer
  Cc: Daniel Borkmann, Alexei Starovoitov, netdev, Alexei Starovoitov,
	Kernel Team

On Wed, Mar 06, 2019 at 03:59:40PM +0000, Lorenz Bauer wrote:
> On Mon, 4 Mar 2019 at 17:43, Martin Lau <kafai@fb.com> wrote:
> >
> > On Mon, Mar 04, 2019 at 10:33:46AM +0100, Daniel Borkmann wrote:
> > > On 03/02/2019 09:21 PM, Martin Lau wrote:
> > > > On Sat, Mar 02, 2019 at 10:03:03AM -0800, Alexei Starovoitov wrote:
> > > >> On Sat, Mar 02, 2019 at 08:10:10AM -0800, Martin KaFai Lau wrote:
> > > >>> Lorenz Bauer [thanks!] reported that a ptr returned by bpf_tcp_sock(sk)
> > > >>> can still be accessed after bpf_sk_release(sk).
> > > >>> Both bpf_tcp_sock() and bpf_sk_fullsock() have the same issue.
> > > >>> This patch addresses them together.
> > > >>>
> > > >>> A simple reproducer looks like this:
> > > >>>
> > > >>> sk = bpf_sk_lookup_tcp();
> > > >>> /* if (!sk) ... */
> > > >>> tp = bpf_tcp_sock(sk);
> > > >>> /* if (!tp) ... */
> > > >>> bpf_sk_release(sk);
> > > >>> snd_cwnd = tp->snd_cwnd; /* oops! The verifier does not complain. */
> > > >>>
> > > >>> The problem is the verifier did not scrub the register's states of
> > > >>> the tcp_sock ptr (tp) after bpf_sk_release(sk).
> > > >>>
> > > >>> [ Note that when calling bpf_tcp_sock(sk), the sk is not always
> > > >>>   refcount-acquired. e.g. bpf_tcp_sock(skb->sk). The verifier works
> > > >>>   fine for this case. ]
> > > >>>
> > > >>> Currently, the verifier does not track if a helper's return ptr (in REG_0)
> > > >>> is "carry"-ing one of its argument's refcount status. To carry this info,
> > > >>> the reg1->id needs to be stored in reg0.  The reg0->id has already
> > > >>> been used for NULL checking purpose.  Hence, a new "refcount_id"
> > > >>> is needed in "struct bpf_reg_state".
> > > >>>
> > > >>> With refcount_id, when bpf_sk_release(sk) is called, the verifier can scrub
> > > >>> all reg states which has a refcount_id match.  It is done with the changes
> > > >>> in release_reg_references().
> > > >>>
> > > >>> When acquiring and releasing a refcount, the reg->id is still used.
> > > >>> Hence, we cannot do "bpf_sk_release(tp)" in the above reproducer
> > > >>> example.
> > > >>
> > > >> I think the choice of returning listener full sock from req sock
> > > >> in sk_to_full_sk() was a wrong one.
> > > >> It seems better to make semantics of bpf_tcp_sock() and bpf_sk_fullsock() as
> > > >> always type cast or null.
> > > >> And have a separate helper for req socket that returns inet_reqsk(sk)->rsk_listener.
> > > >>
> > > >> Then it will be ok to call bpf_sk_release(tp) when tp came from bpf_sk_lookup_tcp.
> > > >> The verifier will know that it's the case because its ID will be in acquired_refs.
> > > >>
> > > >> The additional refcount_id won't be necessary.
> > > >> bpf_sk_fullsock() and bpf_tcp_sock() will not call sk_to_full_sk
> > > >> and the verifier will be copying reg1->id into reg0->id.
> > > >>
> > > >> In release_reference() the verifier will do
> > > >>   if (regs[i].id == id)
> > > >>     mark_reg_unknown(env, regs, i);
> > > >> for all socket types.
> > > >>
> > > >> release_reference_state() will stay as-is.
> > > >>
> > > >> imo such logic will be easier to follow.
> > > >>
> > > >> This implicit sk_to_full_sk() makes the whole thing much harder for the verifier
> > > >> and for the bpf program writers.
> > > >>
> > > >> The new bpf_get_listener_sock(sk) doesn't have to copy ID from reg1 to reg0
> > > >> since req socket will not be returned from bpf_sk_lookup_tcp and its ID
> > > >> will not be stored in acuired_refs.
> > > >>
> > > >> Does it make sense ?
> > > > I like this idea.  Many thanks for thinking it through!
> > > >
> > > > Allowing bpf_sk_release(tp), no need to call bpf_sk_release() on ptr
> > > > returned from bpf_get_listener_sock(sk) and keep one reg->id.
> > > >
> > > > I think it should work.  I will rework the patches.
> > >
> > > Agree, makes sense, that seems much better fix.
> > While I was working on this change, based on the code, one issue I saw is:
> >
> > if the bpf prog does this:
> >
> > sk = bpf_sk_lookup_tcp();
> > /* if (!sk) ... */
> > fullsock = bpf_sk_fullsock(sk);
> > if (!fullsock) {
> >         bpf_sk_release(sk); /* Fail. sk_reg->id not found in ref state */
> >         return 0;
> > }
> >
> > The bpf_sk_release(sk) failed because the reference state has already
> > been released by "release_reference_state(state, fullsock_reg->id)" during
> > "if (!fullsock) /* handled by mark_ptr_or_null_regs(is_null == true) */"
> > Logically, I think bpf_sk_release(sk) should not fail regardless of
> > bpf_sk_fullsock() doing sk_to_full_sk() or not.
> >
> > bpf_sk_fullsock() could disallow PTR_TO_SOCKET or PTR_TO_TCP_SOCK but that
> > would be weird.
> >
> > I think we still need two id.  May be rename the refcount_id proposed in
> > this patch to ref_obj_id which is the original refcounted object id.
> >
> > If the sk_to_full_sk() is removed from bpf_sk_fullsock() and bpf_tcp_sock(),
> > these two helpers become a simple cast (i.e. either return the same pointer
> > or NULL).  Then bpf_sk_release(fullsock) and bpf_sk_release(tp) could work:
> >
> > - When is_null == true, release_reference_state(state, reg->id) is called.
> 
> If I understand correctly, this works because we never
> acquire_reference() for tp/ fullsock,
> making this a no-op?
Sorry for the late reply.

Correct. Those two helpers do not take ref, so
release_reference_state() will not be called.

> 
> > - During bpf_sk_release(fullsock), release_reference(env, reg->ref_obj_id)
> >   is called so that sk, fullsock and tp with the same ref_obj_id will
> >   be mark_reg_unknown().
> 
> To clarify, the following states are possible:
> * id == 0, ref_obj_id == 0: not a pointer / reference
> * id != 0, reg_obj_id == 0: a reference which didn't have
> acquire_reference() called
> * id != 0, reg_obj_id != 0: a reference which had acquire_reference() called
> * id == 0, reg_obj_id: illegal
In this 2 id(s) approach, I would think of it in this way.
id and ref_obj_id are for two different purposes.  One for
null checking and one for reference tracking.  Whenever
its own purpose is served, it can be set to 0.

Regardless, I am working on another idea that does not
require two id(s) in bpf_reg_state.  I will give
an update on this.

^ permalink raw reply	[flat|nested] 10+ messages in thread

* Re: [PATCH v3 bpf-next 1/2] bpf: Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release
  2019-03-09  5:28             ` Martin Lau
@ 2019-03-11 11:47               ` Lorenz Bauer
  0 siblings, 0 replies; 10+ messages in thread
From: Lorenz Bauer @ 2019-03-11 11:47 UTC (permalink / raw)
  To: Martin Lau
  Cc: Daniel Borkmann, Alexei Starovoitov, netdev, Alexei Starovoitov,
	Kernel Team

On Sat, 9 Mar 2019 at 05:28, Martin Lau <kafai@fb.com> wrote:
>
> On Wed, Mar 06, 2019 at 03:59:40PM +0000, Lorenz Bauer wrote:
> > On Mon, 4 Mar 2019 at 17:43, Martin Lau <kafai@fb.com> wrote:
> > >
> > > On Mon, Mar 04, 2019 at 10:33:46AM +0100, Daniel Borkmann wrote:
> > > > On 03/02/2019 09:21 PM, Martin Lau wrote:
> > > > > On Sat, Mar 02, 2019 at 10:03:03AM -0800, Alexei Starovoitov wrote:
> > > > >> On Sat, Mar 02, 2019 at 08:10:10AM -0800, Martin KaFai Lau wrote:
> > > > >>> Lorenz Bauer [thanks!] reported that a ptr returned by bpf_tcp_sock(sk)
> > > > >>> can still be accessed after bpf_sk_release(sk).
> > > > >>> Both bpf_tcp_sock() and bpf_sk_fullsock() have the same issue.
> > > > >>> This patch addresses them together.
> > > > >>>
> > > > >>> A simple reproducer looks like this:
> > > > >>>
> > > > >>> sk = bpf_sk_lookup_tcp();
> > > > >>> /* if (!sk) ... */
> > > > >>> tp = bpf_tcp_sock(sk);
> > > > >>> /* if (!tp) ... */
> > > > >>> bpf_sk_release(sk);
> > > > >>> snd_cwnd = tp->snd_cwnd; /* oops! The verifier does not complain. */
> > > > >>>
> > > > >>> The problem is the verifier did not scrub the register's states of
> > > > >>> the tcp_sock ptr (tp) after bpf_sk_release(sk).
> > > > >>>
> > > > >>> [ Note that when calling bpf_tcp_sock(sk), the sk is not always
> > > > >>>   refcount-acquired. e.g. bpf_tcp_sock(skb->sk). The verifier works
> > > > >>>   fine for this case. ]
> > > > >>>
> > > > >>> Currently, the verifier does not track if a helper's return ptr (in REG_0)
> > > > >>> is "carry"-ing one of its argument's refcount status. To carry this info,
> > > > >>> the reg1->id needs to be stored in reg0.  The reg0->id has already
> > > > >>> been used for NULL checking purpose.  Hence, a new "refcount_id"
> > > > >>> is needed in "struct bpf_reg_state".
> > > > >>>
> > > > >>> With refcount_id, when bpf_sk_release(sk) is called, the verifier can scrub
> > > > >>> all reg states which has a refcount_id match.  It is done with the changes
> > > > >>> in release_reg_references().
> > > > >>>
> > > > >>> When acquiring and releasing a refcount, the reg->id is still used.
> > > > >>> Hence, we cannot do "bpf_sk_release(tp)" in the above reproducer
> > > > >>> example.
> > > > >>
> > > > >> I think the choice of returning listener full sock from req sock
> > > > >> in sk_to_full_sk() was a wrong one.
> > > > >> It seems better to make semantics of bpf_tcp_sock() and bpf_sk_fullsock() as
> > > > >> always type cast or null.
> > > > >> And have a separate helper for req socket that returns inet_reqsk(sk)->rsk_listener.
> > > > >>
> > > > >> Then it will be ok to call bpf_sk_release(tp) when tp came from bpf_sk_lookup_tcp.
> > > > >> The verifier will know that it's the case because its ID will be in acquired_refs.
> > > > >>
> > > > >> The additional refcount_id won't be necessary.
> > > > >> bpf_sk_fullsock() and bpf_tcp_sock() will not call sk_to_full_sk
> > > > >> and the verifier will be copying reg1->id into reg0->id.
> > > > >>
> > > > >> In release_reference() the verifier will do
> > > > >>   if (regs[i].id == id)
> > > > >>     mark_reg_unknown(env, regs, i);
> > > > >> for all socket types.
> > > > >>
> > > > >> release_reference_state() will stay as-is.
> > > > >>
> > > > >> imo such logic will be easier to follow.
> > > > >>
> > > > >> This implicit sk_to_full_sk() makes the whole thing much harder for the verifier
> > > > >> and for the bpf program writers.
> > > > >>
> > > > >> The new bpf_get_listener_sock(sk) doesn't have to copy ID from reg1 to reg0
> > > > >> since req socket will not be returned from bpf_sk_lookup_tcp and its ID
> > > > >> will not be stored in acuired_refs.
> > > > >>
> > > > >> Does it make sense ?
> > > > > I like this idea.  Many thanks for thinking it through!
> > > > >
> > > > > Allowing bpf_sk_release(tp), no need to call bpf_sk_release() on ptr
> > > > > returned from bpf_get_listener_sock(sk) and keep one reg->id.
> > > > >
> > > > > I think it should work.  I will rework the patches.
> > > >
> > > > Agree, makes sense, that seems much better fix.
> > > While I was working on this change, based on the code, one issue I saw is:
> > >
> > > if the bpf prog does this:
> > >
> > > sk = bpf_sk_lookup_tcp();
> > > /* if (!sk) ... */
> > > fullsock = bpf_sk_fullsock(sk);
> > > if (!fullsock) {
> > >         bpf_sk_release(sk); /* Fail. sk_reg->id not found in ref state */
> > >         return 0;
> > > }
> > >
> > > The bpf_sk_release(sk) failed because the reference state has already
> > > been released by "release_reference_state(state, fullsock_reg->id)" during
> > > "if (!fullsock) /* handled by mark_ptr_or_null_regs(is_null == true) */"
> > > Logically, I think bpf_sk_release(sk) should not fail regardless of
> > > bpf_sk_fullsock() doing sk_to_full_sk() or not.
> > >
> > > bpf_sk_fullsock() could disallow PTR_TO_SOCKET or PTR_TO_TCP_SOCK but that
> > > would be weird.
> > >
> > > I think we still need two id.  May be rename the refcount_id proposed in
> > > this patch to ref_obj_id which is the original refcounted object id.
> > >
> > > If the sk_to_full_sk() is removed from bpf_sk_fullsock() and bpf_tcp_sock(),
> > > these two helpers become a simple cast (i.e. either return the same pointer
> > > or NULL).  Then bpf_sk_release(fullsock) and bpf_sk_release(tp) could work:
> > >
> > > - When is_null == true, release_reference_state(state, reg->id) is called.
> >
> > If I understand correctly, this works because we never
> > acquire_reference() for tp/ fullsock,
> > making this a no-op?
> Sorry for the late reply.
>
> Correct. Those two helpers do not take ref, so
> release_reference_state() will not be called.
>
> >
> > > - During bpf_sk_release(fullsock), release_reference(env, reg->ref_obj_id)
> > >   is called so that sk, fullsock and tp with the same ref_obj_id will
> > >   be mark_reg_unknown().
> >
> > To clarify, the following states are possible:
> > * id == 0, ref_obj_id == 0: not a pointer / reference
> > * id != 0, reg_obj_id == 0: a reference which didn't have
> > acquire_reference() called
> > * id != 0, reg_obj_id != 0: a reference which had acquire_reference() called
> > * id == 0, reg_obj_id: illegal
> In this 2 id(s) approach, I would think of it in this way.
> id and ref_obj_id are for two different purposes.  One for
> null checking and one for reference tracking.  Whenever
> its own purpose is served, it can be set to 0.
>
> Regardless, I am working on another idea that does not
> require two id(s) in bpf_reg_state.  I will give
> an update on this.

Ok, I'm holding off re-submitting the syn cookie helper until you've
landed your fix, otherwise
I run into problems when making sk_release accept PTR_TO_SOCK_COMMON.

-- 
Lorenz Bauer  |  Systems Engineer
25 Lavington St., London SE1 0NZ

www.cloudflare.com

^ permalink raw reply	[flat|nested] 10+ messages in thread

end of thread, other threads:[~2019-03-11 11:48 UTC | newest]

Thread overview: 10+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2019-03-02 16:10 [PATCH v3 bpf-next 0/2] Fix bpf_tcp_sock and bpf_sk_fullsock issue related to bpf_sk_release Martin KaFai Lau
2019-03-02 16:10 ` [PATCH v3 bpf-next 1/2] bpf: " Martin KaFai Lau
2019-03-02 18:03   ` Alexei Starovoitov
2019-03-02 20:21     ` Martin Lau
2019-03-04  9:33       ` Daniel Borkmann
2019-03-04 17:42         ` Martin Lau
2019-03-06 15:59           ` Lorenz Bauer
2019-03-09  5:28             ` Martin Lau
2019-03-11 11:47               ` Lorenz Bauer
2019-03-02 16:10 ` [PATCH v3 bpf-next 2/2] bpf: Test ref release issue in bpf_tcp_sock and bpf_sk_fullsock Martin KaFai Lau

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.