From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.2 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,USER_AGENT_SANE_1 autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id A0D44C432C1 for ; Tue, 24 Sep 2019 20:07:48 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 77AFB214DA for ; Tue, 24 Sep 2019 20:07:48 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728477AbfIXUHs (ORCPT ); Tue, 24 Sep 2019 16:07:48 -0400 Received: from youngberry.canonical.com ([91.189.89.112]:38737 "EHLO youngberry.canonical.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728405AbfIXUHr (ORCPT ); Tue, 24 Sep 2019 16:07:47 -0400 Received: from lmontsouris-656-1-55-152.w80-15.abo.wanadoo.fr ([80.15.152.152] helo=wittgenstein) by youngberry.canonical.com with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.86_2) (envelope-from ) id 1iCr65-0008FZ-2D; Tue, 24 Sep 2019 20:07:42 +0000 Date: Tue, 24 Sep 2019 22:07:36 +0200 From: Christian Brauner To: "Michael Kerrisk (man-pages)" Cc: Florian Weimer , Oleg Nesterov , Jann Horn , "Eric W. Biederman" , Daniel Colascione , Joel Fernandes , linux-man , Linux API , lkml Subject: Re: For review: pidfd_send_signal(2) manual page Message-ID: <20190924200735.2dvqhan7ynnmfc7s@wittgenstein> References: <87pnjr9rth.fsf@mid.deneb.enyo.de> <20190923142325.jowzbnwjw7g7si7j@wittgenstein> <90dd38d5-34b3-b72f-8e5a-b51f944f22fb@gmail.com> <20190924195701.7pw2olbviieqsg5q@wittgenstein> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20190924195701.7pw2olbviieqsg5q@wittgenstein> User-Agent: NeoMutt/20180716 Sender: linux-man-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-man@vger.kernel.org On Tue, Sep 24, 2019 at 09:57:04PM +0200, Christian Brauner wrote: > On Tue, Sep 24, 2019 at 09:44:49PM +0200, Michael Kerrisk (man-pages) wrote: > > Hello Christian, > > > > On 9/23/19 4:23 PM, Christian Brauner wrote: > > > On Mon, Sep 23, 2019 at 01:26:34PM +0200, Florian Weimer wrote: > > >> * Michael Kerrisk: > > >> > > >>> SYNOPSIS > > >>> int pidfd_send_signal(int pidfd, int sig, siginfo_t info, > > >>> unsigned int flags); > > >> > > >> This probably should reference a header for siginfo_t. > > > > > > Agreed. > > > > > >> > > >>> ESRCH The target process does not exist. > > >> > > >> If the descriptor is valid, does this mean the process has been waited > > >> for? Maybe this can be made more explicit. > > > > > > If by valid you mean "refers to a process/thread-group leader" aka is a > > > pidfd then yes: Getting ESRCH means that the process has exited and has > > > already been waited upon. > > > If it had only exited but not waited upon aka is a zombie, then sending > > > a signal will just work because that's currently how sending signals to > > > zombies works, i.e. if you only send a signal and don't do any > > > additional checks you won't notice a difference between a process being > > > alive and a process being a zombie. The userspace visible behavior in > > > terms of signaling them is identical. > > > > (Thanks for the clarification. I added the text "(i.e., it has > > terminated and been waited on)" to the ESRCH error.) > > > > >>> The pidfd_send_signal() system call allows the avoidance of race > > >>> conditions that occur when using traditional interfaces (such as > > >>> kill(2)) to signal a process. The problem is that the traditional > > >>> interfaces specify the target process via a process ID (PID), with > > >>> the result that the sender may accidentally send a signal to the > > >>> wrong process if the originally intended target process has termi‐ > > >>> nated and its PID has been recycled for another process. By con‐ > > >>> trast, a PID file descriptor is a stable reference to a specific > > >>> process; if that process terminates, then the file descriptor > > >>> ceases to be valid and the caller of pidfd_send_signal() is > > >>> informed of this fact via an ESRCH error. > > >> > > >> It would be nice to explain somewhere how you can avoid the race using > > >> a PID descriptor. Is there anything else besides CLONE_PIDFD? > > > > > > If you're the parent of the process you can do this without CLONE_PIDFD: > > > pid = fork(); > > > pidfd = pidfd_open(); > > > ret = pidfd_send_signal(pidfd, 0, NULL, 0); > > > if (ret < 0 && errno == ESRCH) > > > /* pidfd refers to another, recycled process */ > > > > Although there is still the race between the fork() and the > > pidfd_open(), right? > > Actually no and my code is even too complex. > If you are the parent, and this is really a sequence that obeys the > ordering pidfd_open() before waiting: > > pid = fork(); > if (pid == 0) > exit(EXIT_SUCCESS); > pidfd = pidfd_open(pid, 0); > waitid(pid, ...); > > Then you are guaranteed that pidfd will refer to pid. No recycling can > happen since the process has not been waited upon yet (That is, > excluding special cases such as where you have a mainloop where a > callback reacts to a SIGCHLD event and waits on the child behind your > back and your next callback in the mainloop calls pidfd_open() while the > pid has been recycled etc.). If we wanted to be super nitpicky one could also get in that situation where you do: signal(SIGCHLD,SIG_IGN); // or struct sigaction sa; sa.sa_handler = SIG_IGN; sigemptyset(&sa.sa_mask); sa.sa_flags = 0; sigaction(SIGCHLD, &sa, 0) pid = fork(); if (pid == 0) exit(EXIT_SUCCESS); pidfd = pidfd_open(); because then the process gets autoreaped and can be recycled. But again, that's just bad form and in that scenario one should again use clone(CLONE_PIDFD) instead of fork(). Christian