netfilter-devel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
* Re: data-race in nf_tables_newtable / nf_tables_newtable
       [not found] <CAEHB2488dNqBKcgWLSeq500JLC1+q6RV=ENcUPm=rN9bWf0QkQ@mail.gmail.com>
@ 2022-08-19 12:35 ` Florian Westphal
  2022-08-22 20:29   ` Gabriel Ryan
  0 siblings, 1 reply; 4+ messages in thread
From: Florian Westphal @ 2022-08-19 12:35 UTC (permalink / raw)
  To: Abhishek Shah
  Cc: coreteam, davem, edumazet, fw, kadlec, kuba, netdev,
	netfilter-devel, pabeni, pablo, Gabriel Ryan, linux-kernel

Abhishek Shah <abhishek.shah@columbia.edu> wrote:
> Hi all,
> 
> We found a race involving the table->handle variable here
> <https://elixir.bootlin.com/linux/v5.18-rc5/source/net/netfilter/nf_tables_api.c#L1221>.
> This race advances the pointer, which can cause out-of-bounds memory
> accesses in the future. Please let us know what you think.
> 
> Thanks!
> 
> 
> *---------------------Report-----------------*
> *read-write* to 0xffffffff883a01e8 of 8 bytes by task 6542 on cpu 0:
>  nf_tables_newtable+0x6dc/0xc00 net/netfilter/nf_tables_api.c:1221
>  nfnetlink_rcv_batch net/netfilter/nfnetlink.c:513 [inline]

[..]

> *read-write* to 0xffffffff883a01e8 of 8 bytes by task 6541 on cpu 1:
>  nf_tables_newtable+0x6dc/0xc00 net/netfilter/nf_tables_api.c:1221
>  nfnetlink_rcv_batch net/netfilter/nfnetlink.c:513 [inline]

[..]

I don't understand.  Like all batch operations, nf_tables_newtable is
supposed to run with the transaction mutex held, i.e. parallel execution
is not expected.

There is a lockdep assertion at start of nf_tables_newtable(); I
don't see how its possible that two threads can run this concurrently.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: data-race in nf_tables_newtable / nf_tables_newtable
  2022-08-19 12:35 ` data-race in nf_tables_newtable / nf_tables_newtable Florian Westphal
@ 2022-08-22 20:29   ` Gabriel Ryan
  2022-08-22 20:41     ` Florian Westphal
  0 siblings, 1 reply; 4+ messages in thread
From: Gabriel Ryan @ 2022-08-22 20:29 UTC (permalink / raw)
  To: Florian Westphal
  Cc: Abhishek Shah, coreteam, davem, edumazet, kadlec, kuba, netdev,
	netfilter-devel, pabeni, pablo, linux-kernel

Hi Florian,

I just looked at the lock event trace from our report and it looks
like two distinct commit mutexes were held when the race was
triggered. I think the race is probably on the table_handle variable
on net/netfilter/nf_tables_api.c:1221, and not the table->handle field
being written to.

Racing increments to table_handle could cause it to either overcount
or undercount. Could that be an issue?

Best,

Gabe

On Fri, Aug 19, 2022 at 8:35 AM Florian Westphal <fw@strlen.de> wrote:
>
> Abhishek Shah <abhishek.shah@columbia.edu> wrote:
> > Hi all,
> >
> > We found a race involving the table->handle variable here
> > <https://urldefense.proofpoint.com/v2/url?u=https-3A__elixir.bootlin.com_linux_v5.18-2Drc5_source_net_netfilter_nf-5Ftables-5Fapi.c-23L1221&d=DwIBAg&c=009klHSCxuh5AI1vNQzSO0KGjl4nbi2Q0M1QLJX9BeE&r=EyAJYRJu01oaAhhVVY3o8zKgZvacDAXd_PNRtaqACCo&m=xlZC-wDg7fkTm6_4HfcoDqYfJx_OU2L5HHX2q_yTYZZCEDCFAg-9I7T1gNmXPISg&s=JYkSOriQVx_3lJhAzBo7yqhe4bnf2Sy96cPL0L1NIn8&e=  >.
> > This race advances the pointer, which can cause out-of-bounds memory
> > accesses in the future. Please let us know what you think.
> >
> > Thanks!
> >
> >
> > *---------------------Report-----------------*
> > *read-write* to 0xffffffff883a01e8 of 8 bytes by task 6542 on cpu 0:
> >  nf_tables_newtable+0x6dc/0xc00 net/netfilter/nf_tables_api.c:1221
> >  nfnetlink_rcv_batch net/netfilter/nfnetlink.c:513 [inline]
>
> [..]
>
> > *read-write* to 0xffffffff883a01e8 of 8 bytes by task 6541 on cpu 1:
> >  nf_tables_newtable+0x6dc/0xc00 net/netfilter/nf_tables_api.c:1221
> >  nfnetlink_rcv_batch net/netfilter/nfnetlink.c:513 [inline]
>
> [..]
>
> I don't understand.  Like all batch operations, nf_tables_newtable is
> supposed to run with the transaction mutex held, i.e. parallel execution
> is not expected.
>
> There is a lockdep assertion at start of nf_tables_newtable(); I
> don't see how its possible that two threads can run this concurrently.

-- 
Gabriel Ryan
PhD Candidate at Columbia University

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: data-race in nf_tables_newtable / nf_tables_newtable
  2022-08-22 20:29   ` Gabriel Ryan
@ 2022-08-22 20:41     ` Florian Westphal
  2022-08-22 20:51       ` Gabriel Ryan
  0 siblings, 1 reply; 4+ messages in thread
From: Florian Westphal @ 2022-08-22 20:41 UTC (permalink / raw)
  To: Gabriel Ryan
  Cc: Florian Westphal, Abhishek Shah, coreteam, davem, edumazet,
	kadlec, kuba, netdev, netfilter-devel, pabeni, pablo,
	linux-kernel

Gabriel Ryan <gabe@cs.columbia.edu> wrote:
> Hi Florian,
> 
> I just looked at the lock event trace from our report and it looks
> like two distinct commit mutexes were held when the race was
> triggered. I think the race is probably on the table_handle variable
> on net/netfilter/nf_tables_api.c:1221, and not the table->handle field
> being written to.

See

https://patchwork.ozlabs.org/project/netfilter-devel/patch/20220821085939.571378-1-pablo@netfilter.org/

which makes table_handle per netns.

^ permalink raw reply	[flat|nested] 4+ messages in thread

* Re: data-race in nf_tables_newtable / nf_tables_newtable
  2022-08-22 20:41     ` Florian Westphal
@ 2022-08-22 20:51       ` Gabriel Ryan
  0 siblings, 0 replies; 4+ messages in thread
From: Gabriel Ryan @ 2022-08-22 20:51 UTC (permalink / raw)
  To: Florian Westphal
  Cc: Abhishek Shah, coreteam, davem, edumazet, kadlec, kuba, netdev,
	netfilter-devel, pabeni, pablo, linux-kernel

Ok, glad everything worked out then, thanks for sharing!

Best,

Gabe

On Mon, Aug 22, 2022 at 4:42 PM Florian Westphal <fw@strlen.de> wrote:
>
> Gabriel Ryan <gabe@cs.columbia.edu> wrote:
> > Hi Florian,
> >
> > I just looked at the lock event trace from our report and it looks
> > like two distinct commit mutexes were held when the race was
> > triggered. I think the race is probably on the table_handle variable
> > on net/netfilter/nf_tables_api.c:1221, and not the table->handle field
> > being written to.
>
> See
>
> https://urldefense.proofpoint.com/v2/url?u=https-3A__patchwork.ozlabs.org_project_netfilter-2Ddevel_patch_20220821085939.571378-2D1-2Dpablo-40netfilter.org_&d=DwIFAg&c=009klHSCxuh5AI1vNQzSO0KGjl4nbi2Q0M1QLJX9BeE&r=EyAJYRJu01oaAhhVVY3o8zKgZvacDAXd_PNRtaqACCo&m=tKybb4B0md9Rrb5d9AH-UN2fjr4L8y6LtRxywepWhKsWCyN0OWaRwAtDbGt10WQD&s=uJ4BtOvCcZOe6F17rDKQipDF0McLeOwaUL9jO3BgTCU&e=
>
> which makes table_handle per netns.

^ permalink raw reply	[flat|nested] 4+ messages in thread

end of thread, other threads:[~2022-08-22 20:52 UTC | newest]

Thread overview: 4+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <CAEHB2488dNqBKcgWLSeq500JLC1+q6RV=ENcUPm=rN9bWf0QkQ@mail.gmail.com>
2022-08-19 12:35 ` data-race in nf_tables_newtable / nf_tables_newtable Florian Westphal
2022-08-22 20:29   ` Gabriel Ryan
2022-08-22 20:41     ` Florian Westphal
2022-08-22 20:51       ` Gabriel Ryan

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).