From: Vlastimil Babka <vbabka@suse.cz> To: Jann Horn <jannh@google.com> Cc: Linux-MM <linux-mm@kvack.org>, kernel list <linux-kernel@vger.kernel.org>, Christoph Lameter <cl@linux.com>, David Rientjes <rientjes@google.com>, Pekka Enberg <penberg@kernel.org>, Joonsoo Kim <iamjoonsoo.kim@lge.com>, Sebastian Andrzej Siewior <bigeasy@linutronix.de>, Thomas Gleixner <tglx@linutronix.de>, Mel Gorman <mgorman@techsingularity.net>, Jesper Dangaard Brouer <brouer@redhat.com>, Peter Zijlstra <peterz@infradead.org> Subject: Re: [RFC 25/26] mm, slub: use migrate_disable() in put_cpu_partial() Date: Wed, 9 Jun 2021 10:41:43 +0200 [thread overview] Message-ID: <2a5cddf1-100d-9d24-fb98-8a5bd439a804@suse.cz> (raw) In-Reply-To: <CAG48ez1mvUuXwg0YPH5ANzhQLpbphqk-ZS+jbRz+H66fvm4FcA@mail.gmail.com> On 5/25/21 5:33 PM, Jann Horn wrote: > On Tue, May 25, 2021 at 1:40 AM Vlastimil Babka <vbabka@suse.cz> wrote: >> In put_cpu_partial, we need a stable cpu, but being preempted is not an issue. >> So, disable migration instead of preemption. > > I wouldn't say "not an issue", more like "you're not making it worse". > > From what I can tell, the following race can already theoretically happen: > > task A: put_cpu_partial() calls preempt_disable() > task A: oldpage = this_cpu_read(s->cpu_slab->partial) > interrupt: kfree() reaches unfreeze_partials() and discards the page > task B (on another CPU): reallocates page as page cache > task A: reads page->pages and page->pobjects, which are actually > halves of the pointer page->lru.prev > task B (on another CPU): frees page > interrupt: allocates page as SLUB page and places it on the percpu partial list > task A: this_cpu_cmpxchg() succeeds Oops, nice find. Thanks. > which would cause page->pages and page->pobjects to end up containing > halves of pointers that would then influence when put_cpu_partial() > happens and show up in root-only sysfs files. Maybe that's acceptable, > I don't know. But there should probably at least be a comment for now > to point out that we're reading union fields of a page that might be > in a completely different state. > > (Someone should probably fix that code sometime and get rid of > page->pobjects entirely, given how inaccurate it is...) I'll try to address it separately later. Probably just target a number of pages, instead of objects, on the list and store the number as part of struct kmem_cache_cpu, not struct page. The inaccuracy leading to potentially long lists is a good reason enough, the race scenario above is another one...
next prev parent reply other threads:[~2021-06-09 8:41 UTC|newest] Thread overview: 53+ messages / expand[flat|nested] mbox.gz Atom feed top 2021-05-24 23:39 [RFC 00/26] SLUB: use local_lock for kmem_cache_cpu protection and reduce disabling irqs Vlastimil Babka 2021-05-24 23:39 ` [RFC 01/26] mm, slub: allocate private object map for sysfs listings Vlastimil Babka 2021-05-25 8:06 ` Christoph Lameter 2021-05-25 10:13 ` Mel Gorman 2021-05-24 23:39 ` [RFC 02/26] mm, slub: allocate private object map for validate_slab_cache() Vlastimil Babka 2021-05-25 8:09 ` Christoph Lameter 2021-05-25 10:17 ` Mel Gorman 2021-05-25 10:36 ` Vlastimil Babka 2021-05-25 11:33 ` Mel Gorman 2021-06-08 10:37 ` Vlastimil Babka 2021-05-24 23:39 ` [RFC 03/26] mm, slub: don't disable irq for debug_check_no_locks_freed() Vlastimil Babka 2021-05-25 10:24 ` Mel Gorman 2021-05-24 23:39 ` [RFC 04/26] mm, slub: simplify kmem_cache_cpu and tid setup Vlastimil Babka 2021-05-25 11:47 ` Mel Gorman 2021-05-24 23:39 ` [RFC 05/26] mm, slub: extract get_partial() from new_slab_objects() Vlastimil Babka 2021-05-25 9:03 ` Christoph Lameter 2021-05-25 11:54 ` Mel Gorman 2021-05-24 23:39 ` [RFC 06/26] mm, slub: dissolve new_slab_objects() into ___slab_alloc() Vlastimil Babka 2021-05-25 9:06 ` Christoph Lameter 2021-05-25 11:59 ` Mel Gorman 2021-05-24 23:39 ` [RFC 07/26] mm, slub: return slab page from get_partial() and set c->page afterwards Vlastimil Babka 2021-05-25 9:12 ` Christoph Lameter 2021-06-08 10:48 ` Vlastimil Babka 2021-05-24 23:39 ` [RFC 08/26] mm, slub: restructure new page checks in ___slab_alloc() Vlastimil Babka 2021-05-25 12:09 ` Mel Gorman 2021-05-24 23:39 ` [RFC 09/26] mm, slub: move disabling/enabling irqs to ___slab_alloc() Vlastimil Babka 2021-05-25 12:35 ` Mel Gorman 2021-05-25 12:47 ` Vlastimil Babka 2021-05-25 15:10 ` Mel Gorman 2021-05-25 17:24 ` Vlastimil Babka 2021-05-24 23:39 ` [RFC 10/26] mm, slub: do initial checks in ___slab_alloc() with irqs enabled Vlastimil Babka 2021-05-25 13:04 ` Mel Gorman 2021-06-08 12:13 ` Vlastimil Babka 2021-05-24 23:39 ` [RFC 11/26] mm, slub: move disabling irqs closer to get_partial() in ___slab_alloc() Vlastimil Babka 2021-05-25 16:00 ` Jann Horn 2021-05-24 23:39 ` [RFC 12/26] mm, slub: restore irqs around calling new_slab() Vlastimil Babka 2021-05-24 23:39 ` [RFC 13/26] mm, slub: validate partial and newly allocated slabs before loading them Vlastimil Babka 2021-05-24 23:39 ` [RFC 14/26] mm, slub: check new pages with restored irqs Vlastimil Babka 2021-05-24 23:39 ` [RFC 15/26] mm, slub: stop disabling irqs around get_partial() Vlastimil Babka 2021-05-24 23:39 ` [RFC 16/26] mm, slub: move reset of c->page and freelist out of deactivate_slab() Vlastimil Babka 2021-05-24 23:39 ` [RFC 17/26] mm, slub: make locking in deactivate_slab() irq-safe Vlastimil Babka 2021-05-24 23:39 ` [RFC 18/26] mm, slub: call deactivate_slab() without disabling irqs Vlastimil Babka 2021-05-24 23:39 ` [RFC 19/26] mm, slub: move irq control into unfreeze_partials() Vlastimil Babka 2021-05-24 23:39 ` [RFC 20/26] mm, slub: discard slabs in unfreeze_partials() without irqs disabled Vlastimil Babka 2021-05-24 23:39 ` [RFC 21/26] mm, slub: detach whole partial list at once in unfreeze_partials() Vlastimil Babka 2021-05-24 23:39 ` [RFC 22/26] mm, slub: detach percpu partial list in unfreeze_partials() using this_cpu_cmpxchg() Vlastimil Babka 2021-05-24 23:39 ` [RFC 23/26] mm, slub: only disable irq with spin_lock in __unfreeze_partials() Vlastimil Babka 2021-05-24 23:39 ` [RFC 24/26] mm, slub: don't disable irqs in slub_cpu_dead() Vlastimil Babka 2021-05-24 23:39 ` [RFC 25/26] mm, slub: use migrate_disable() in put_cpu_partial() Vlastimil Babka 2021-05-25 15:33 ` Jann Horn 2021-06-09 8:41 ` Vlastimil Babka [this message] 2021-05-24 23:39 ` [RFC 26/26] mm, slub: convert kmem_cpu_slab protection to local_lock Vlastimil Babka 2021-05-25 16:11 ` Vlastimil Babka
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=2a5cddf1-100d-9d24-fb98-8a5bd439a804@suse.cz \ --to=vbabka@suse.cz \ --cc=bigeasy@linutronix.de \ --cc=brouer@redhat.com \ --cc=cl@linux.com \ --cc=iamjoonsoo.kim@lge.com \ --cc=jannh@google.com \ --cc=linux-kernel@vger.kernel.org \ --cc=linux-mm@kvack.org \ --cc=mgorman@techsingularity.net \ --cc=penberg@kernel.org \ --cc=peterz@infradead.org \ --cc=rientjes@google.com \ --cc=tglx@linutronix.de \ --subject='Re: [RFC 25/26] mm, slub: use migrate_disable() in put_cpu_partial()' \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: link
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for NNTP newsgroup(s).