All of lore.kernel.org
 help / color / mirror / Atom feed
* [PATCH v2] mm/mmu_notifier.c: Fix race in mmu_interval_notifier_remove()
@ 2022-04-20  4:37 Alistair Popple
  2022-04-20 22:11 ` Andrew Morton
  0 siblings, 1 reply; 6+ messages in thread
From: Alistair Popple @ 2022-04-20  4:37 UTC (permalink / raw)
  To: akpm
  Cc: linux-mm, linux-kernel, christian.koenig, jhubbard, rcampbell,
	Alistair Popple, Jason Gunthorpe

In some cases it is possible for mmu_interval_notifier_remove() to race
with mn_tree_inv_end() allowing it to return while the notifier data
structure is still in use. Consider the following sequence:

CPU0 - mn_tree_inv_end()            CPU1 - mmu_interval_notifier_remove()
----------------------------------- ------------------------------------
                                    spin_lock(subscriptions->lock);
                                    seq = subscriptions->invalidate_seq;
spin_lock(subscriptions->lock);     spin_unlock(subscriptions->lock);
subscriptions->invalidate_seq++;
                                    wait_event(invalidate_seq != seq);
                                    return;
interval_tree_remove(interval_sub); kfree(interval_sub);
spin_unlock(subscriptions->lock);
wake_up_all();

As the wait_event() condition is true it will return immediately. This
can lead to use-after-free type errors if the caller frees the data
structure containing the interval notifier subscription while it is
still on a deferred list. Fix this by taking the appropriate lock when
reading invalidate_seq to ensure proper synchronisation.

Signed-off-by: Alistair Popple <apopple@nvidia.com>
Signed-off-by: Jason Gunthorpe <jgg@nvidia.com>
Fixes: 99cb252f5e68 ("mm/mmu_notifier: add an interval tree notifier")
---
 mm/mmu_notifier.c | 14 +++++++++++++-
 1 file changed, 13 insertions(+), 1 deletion(-)

diff --git a/mm/mmu_notifier.c b/mm/mmu_notifier.c
index 3f3bbcd298c6..e0275b9f6b81 100644
--- a/mm/mmu_notifier.c
+++ b/mm/mmu_notifier.c
@@ -1036,6 +1036,18 @@ int mmu_interval_notifier_insert_locked(
 }
 EXPORT_SYMBOL_GPL(mmu_interval_notifier_insert_locked);
 
+static bool
+mmu_interval_seq_released(struct mmu_notifier_subscriptions *subscriptions,
+			  unsigned long seq)
+{
+	bool ret;
+
+	spin_lock(&subscriptions->lock);
+	ret = subscriptions->invalidate_seq != seq;
+	spin_unlock(&subscriptions->lock);
+	return ret;
+}
+
 /**
  * mmu_interval_notifier_remove - Remove a interval notifier
  * @interval_sub: Interval subscription to unregister
@@ -1086,7 +1098,7 @@ void mmu_interval_notifier_remove(struct mmu_interval_notifier *interval_sub)
 	lock_map_release(&__mmu_notifier_invalidate_range_start_map);
 	if (seq)
 		wait_event(subscriptions->wq,
-			   READ_ONCE(subscriptions->invalidate_seq) != seq);
+			   mmu_interval_seq_released(subscriptions, seq));
 
 	/* pairs with mmgrab in mmu_interval_notifier_insert() */
 	mmdrop(mm);
-- 
2.34.1


^ permalink raw reply related	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] mm/mmu_notifier.c: Fix race in mmu_interval_notifier_remove()
  2022-04-20  4:37 [PATCH v2] mm/mmu_notifier.c: Fix race in mmu_interval_notifier_remove() Alistair Popple
@ 2022-04-20 22:11 ` Andrew Morton
  2022-04-20 23:21   ` Alistair Popple
  2022-04-21 13:14   ` Jason Gunthorpe
  0 siblings, 2 replies; 6+ messages in thread
From: Andrew Morton @ 2022-04-20 22:11 UTC (permalink / raw)
  To: Alistair Popple
  Cc: linux-mm, linux-kernel, christian.koenig, jhubbard, rcampbell,
	Jason Gunthorpe

On Wed, 20 Apr 2022 14:37:34 +1000 Alistair Popple <apopple@nvidia.com> wrote:

> In some cases it is possible for mmu_interval_notifier_remove() to race
> with mn_tree_inv_end() allowing it to return while the notifier data
> structure is still in use. Consider the following sequence:
> 
> CPU0 - mn_tree_inv_end()            CPU1 - mmu_interval_notifier_remove()
> ----------------------------------- ------------------------------------
>                                     spin_lock(subscriptions->lock);
>                                     seq = subscriptions->invalidate_seq;
> spin_lock(subscriptions->lock);     spin_unlock(subscriptions->lock);
> subscriptions->invalidate_seq++;
>                                     wait_event(invalidate_seq != seq);
>                                     return;
> interval_tree_remove(interval_sub); kfree(interval_sub);
> spin_unlock(subscriptions->lock);
> wake_up_all();
> 
> As the wait_event() condition is true it will return immediately. This
> can lead to use-after-free type errors if the caller frees the data
> structure containing the interval notifier subscription while it is
> still on a deferred list. Fix this by taking the appropriate lock when
> reading invalidate_seq to ensure proper synchronisation.
> 
> ...
>
> Fixes: 99cb252f5e68 ("mm/mmu_notifier: add an interval tree notifier")

Do you think fix this should be backported into older kernels?

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] mm/mmu_notifier.c: Fix race in mmu_interval_notifier_remove()
  2022-04-20 22:11 ` Andrew Morton
@ 2022-04-20 23:21   ` Alistair Popple
  2022-04-20 23:35     ` Andrew Morton
  2022-04-21 13:14   ` Jason Gunthorpe
  1 sibling, 1 reply; 6+ messages in thread
From: Alistair Popple @ 2022-04-20 23:21 UTC (permalink / raw)
  To: Andrew Morton
  Cc: linux-mm, linux-kernel, christian.koenig, jhubbard, rcampbell,
	Jason Gunthorpe

[-- Attachment #1: Type: text/plain, Size: 1606 bytes --]

Andrew Morton <akpm@linux-foundation.org> writes:

> On Wed, 20 Apr 2022 14:37:34 +1000 Alistair Popple <apopple@nvidia.com> wrote:
>
>> In some cases it is possible for mmu_interval_notifier_remove() to race
>> with mn_tree_inv_end() allowing it to return while the notifier data
>> structure is still in use. Consider the following sequence:
>>
>> CPU0 - mn_tree_inv_end()            CPU1 - mmu_interval_notifier_remove()
>> ----------------------------------- ------------------------------------
>>                                     spin_lock(subscriptions->lock);
>>                                     seq = subscriptions->invalidate_seq;
>> spin_lock(subscriptions->lock);     spin_unlock(subscriptions->lock);
>> subscriptions->invalidate_seq++;
>>                                     wait_event(invalidate_seq != seq);
>>                                     return;
>> interval_tree_remove(interval_sub); kfree(interval_sub);
>> spin_unlock(subscriptions->lock);
>> wake_up_all();
>>
>> As the wait_event() condition is true it will return immediately. This
>> can lead to use-after-free type errors if the caller frees the data
>> structure containing the interval notifier subscription while it is
>> still on a deferred list. Fix this by taking the appropriate lock when
>> reading invalidate_seq to ensure proper synchronisation.
>>
>> ...
>>
>> Fixes: 99cb252f5e68 ("mm/mmu_notifier: add an interval tree notifier")
>
> Do you think fix this should be backported into older kernels?

Yes, I forgot to cc stable sorry. Do you want me to resend with
'Cc: stable@vger.kernel.org'?

- Alistair

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] mm/mmu_notifier.c: Fix race in mmu_interval_notifier_remove()
  2022-04-20 23:21   ` Alistair Popple
@ 2022-04-20 23:35     ` Andrew Morton
  2022-04-21  7:06       ` Alistair Popple
  0 siblings, 1 reply; 6+ messages in thread
From: Andrew Morton @ 2022-04-20 23:35 UTC (permalink / raw)
  To: Alistair Popple
  Cc: linux-mm, linux-kernel, christian.koenig, jhubbard, rcampbell,
	Jason Gunthorpe

On Thu, 21 Apr 2022 09:21:06 +1000 Alistair Popple <apopple@nvidia.com> wrote:

> >> As the wait_event() condition is true it will return immediately. This
> >> can lead to use-after-free type errors if the caller frees the data
> >> structure containing the interval notifier subscription while it is
> >> still on a deferred list. Fix this by taking the appropriate lock when
> >> reading invalidate_seq to ensure proper synchronisation.
> >>
> >> ...
> >>
> >> Fixes: 99cb252f5e68 ("mm/mmu_notifier: add an interval tree notifier")
> >
> > Do you think fix this should be backported into older kernels?
> 
> Yes, I forgot to cc stable sorry.

So we have actually seen these use-after-free errors?

Some description of the end-user visible impact is always helpful when
deciding which trees need a patch.

> Do you want me to resend with
> 'Cc: stable@vger.kernel.org'?

Thanks, I added that.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] mm/mmu_notifier.c: Fix race in mmu_interval_notifier_remove()
  2022-04-20 23:35     ` Andrew Morton
@ 2022-04-21  7:06       ` Alistair Popple
  0 siblings, 0 replies; 6+ messages in thread
From: Alistair Popple @ 2022-04-21  7:06 UTC (permalink / raw)
  To: Andrew Morton
  Cc: linux-mm, linux-kernel, christian.koenig, jhubbard, rcampbell,
	Jason Gunthorpe

[-- Attachment #1: Type: text/plain, Size: 1246 bytes --]

Andrew Morton <akpm@linux-foundation.org> writes:

> On Thu, 21 Apr 2022 09:21:06 +1000 Alistair Popple <apopple@nvidia.com> wrote:
>
>> >> As the wait_event() condition is true it will return immediately. This
>> >> can lead to use-after-free type errors if the caller frees the data
>> >> structure containing the interval notifier subscription while it is
>> >> still on a deferred list. Fix this by taking the appropriate lock when
>> >> reading invalidate_seq to ensure proper synchronisation.
>> >>
>> >> ...
>> >>
>> >> Fixes: 99cb252f5e68 ("mm/mmu_notifier: add an interval tree notifier")
>> >
>> > Do you think fix this should be backported into older kernels?
>>
>> Yes, I forgot to cc stable sorry.
>
> So we have actually seen these use-after-free errors?

I observed them whilst running stress testing during some development. You do
have to be pretty unlucky, but it lead to the usual problems of use-after-free
(memory corruption, kernel crash, difficult to diagnose WARN_ON, etc) so I think
it's worth backporting.

> Some description of the end-user visible impact is always helpful when
> deciding which trees need a patch.
>
>> Do you want me to resend with
>> 'Cc: stable@vger.kernel.org'?
>
> Thanks, I added that.

Thanks.

^ permalink raw reply	[flat|nested] 6+ messages in thread

* Re: [PATCH v2] mm/mmu_notifier.c: Fix race in mmu_interval_notifier_remove()
  2022-04-20 22:11 ` Andrew Morton
  2022-04-20 23:21   ` Alistair Popple
@ 2022-04-21 13:14   ` Jason Gunthorpe
  1 sibling, 0 replies; 6+ messages in thread
From: Jason Gunthorpe @ 2022-04-21 13:14 UTC (permalink / raw)
  To: Andrew Morton
  Cc: Alistair Popple, linux-mm, linux-kernel, christian.koenig,
	jhubbard, rcampbell

On Wed, Apr 20, 2022 at 03:11:42PM -0700, Andrew Morton wrote:
> On Wed, 20 Apr 2022 14:37:34 +1000 Alistair Popple <apopple@nvidia.com> wrote:
> 
> > In some cases it is possible for mmu_interval_notifier_remove() to race
> > with mn_tree_inv_end() allowing it to return while the notifier data
> > structure is still in use. Consider the following sequence:
> > 
> > CPU0 - mn_tree_inv_end()            CPU1 - mmu_interval_notifier_remove()
> >                                     spin_lock(subscriptions->lock);
> >                                     seq = subscriptions->invalidate_seq;
> > spin_lock(subscriptions->lock);     spin_unlock(subscriptions->lock);
> > subscriptions->invalidate_seq++;
> >                                     wait_event(invalidate_seq != seq);
> >                                     return;
> > interval_tree_remove(interval_sub); kfree(interval_sub);
> > spin_unlock(subscriptions->lock);
> > wake_up_all();
> > 
> > As the wait_event() condition is true it will return immediately. This
> > can lead to use-after-free type errors if the caller frees the data
> > structure containing the interval notifier subscription while it is
> > still on a deferred list. Fix this by taking the appropriate lock when
> > reading invalidate_seq to ensure proper synchronisation.
> > 
> > ...
> >
> > Fixes: 99cb252f5e68 ("mm/mmu_notifier: add an interval tree notifier")
> 
> Do you think fix this should be backported into older kernels?

I think it should be tagged stable, yes

Jason

^ permalink raw reply	[flat|nested] 6+ messages in thread

end of thread, other threads:[~2022-04-21 13:15 UTC | newest]

Thread overview: 6+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-04-20  4:37 [PATCH v2] mm/mmu_notifier.c: Fix race in mmu_interval_notifier_remove() Alistair Popple
2022-04-20 22:11 ` Andrew Morton
2022-04-20 23:21   ` Alistair Popple
2022-04-20 23:35     ` Andrew Morton
2022-04-21  7:06       ` Alistair Popple
2022-04-21 13:14   ` Jason Gunthorpe

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.