All of lore.kernel.org
 help / color / mirror / Atom feed
From: Ben Hutchings <ben@decadent.org.uk>
To: linux-kernel@vger.kernel.org, stable@vger.kernel.org
Cc: akpm@linux-foundation.org, "Clark Williams" <williams@redhat.com>,
	"Roland McGrath" <roland@hack.frob.com>,
	"Lai Jiangshan" <laijs@cn.fujitsu.com>,
	"Carlos ODonell" <carlos@redhat.com>,
	"Jakub Jelinek" <jakub@redhat.com>,
	"Paul McKenney" <paulmck@linux.vnet.ibm.com>,
	"Sebastian Andrzej Siewior" <bigeasy@linutronix.de>,
	"Thomas Gleixner" <tglx@linutronix.de>,
	"Davidlohr Bueso" <davidlohr@hp.com>,
	"Dave Jones" <davej@redhat.com>,
	"Peter Zijlstra" <peterz@infradead.org>,
	"Darren Hart" <darren@dvhart.com>,
	"Linus Torvalds" <torvalds@linux-foundation.org>,
	"Michael Kerrisk" <mtk.manpages@gmail.com>,
	"Steven Rostedt" <rostedt@goodmis.org>
Subject: [PATCH 3.2 87/92] futex: Add another early deadlock detection check
Date: Sat, 07 Jun 2014 02:26:29 +0100	[thread overview]
Message-ID: <lsq.1402104389.679439736@decadent.org.uk> (raw)
In-Reply-To: <lsq.1402104388.385007124@decadent.org.uk>

3.2.60-rc1 review patch.  If anyone has any objections, please let me know.

------------------

From: Thomas Gleixner <tglx@linutronix.de>

commit 866293ee54227584ffcb4a42f69c1f365974ba7f upstream.

Dave Jones trinity syscall fuzzer exposed an issue in the deadlock
detection code of rtmutex:
  http://lkml.kernel.org/r/20140429151655.GA14277@redhat.com

That underlying issue has been fixed with a patch to the rtmutex code,
but the futex code must not call into rtmutex in that case because
    - it can detect that issue early
    - it avoids a different and more complex fixup for backing out

If the user space variable got manipulated to 0x80000000 which means
no lock holder, but the waiters bit set and an active pi_state in the
kernel is found we can figure out the recursive locking issue by
looking at the pi_state owner. If that is the current task, then we
can safely return -EDEADLK.

The check should have been added in commit 59fa62451 (futex: Handle
futex_pi OWNER_DIED take over correctly) already, but I did not see
the above issue caused by user space manipulation back then.

Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
Cc: Dave Jones <davej@redhat.com>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Peter Zijlstra <peterz@infradead.org>
Cc: Darren Hart <darren@dvhart.com>
Cc: Davidlohr Bueso <davidlohr@hp.com>
Cc: Steven Rostedt <rostedt@goodmis.org>
Cc: Clark Williams <williams@redhat.com>
Cc: Paul McKenney <paulmck@linux.vnet.ibm.com>
Cc: Lai Jiangshan <laijs@cn.fujitsu.com>
Cc: Roland McGrath <roland@hack.frob.com>
Cc: Carlos ODonell <carlos@redhat.com>
Cc: Jakub Jelinek <jakub@redhat.com>
Cc: Michael Kerrisk <mtk.manpages@gmail.com>
Cc: Sebastian Andrzej Siewior <bigeasy@linutronix.de>
Link: http://lkml.kernel.org/r/20140512201701.097349971@linutronix.de
Signed-off-by: Thomas Gleixner <tglx@linutronix.de>
[bwh: Backported to 3.2: adjust context]
Signed-off-by: Ben Hutchings <ben@decadent.org.uk>
---
 kernel/futex.c | 47 ++++++++++++++++++++++++++++++++++-------------
 1 file changed, 34 insertions(+), 13 deletions(-)

--- a/kernel/futex.c
+++ b/kernel/futex.c
@@ -590,7 +590,8 @@ void exit_pi_state_list(struct task_stru
 
 static int
 lookup_pi_state(u32 uval, struct futex_hash_bucket *hb,
-		union futex_key *key, struct futex_pi_state **ps)
+		union futex_key *key, struct futex_pi_state **ps,
+		struct task_struct *task)
 {
 	struct futex_pi_state *pi_state = NULL;
 	struct futex_q *this, *next;
@@ -634,6 +635,16 @@ lookup_pi_state(u32 uval, struct futex_h
 					return -EINVAL;
 			}
 
+			/*
+			 * Protect against a corrupted uval. If uval
+			 * is 0x80000000 then pid is 0 and the waiter
+			 * bit is set. So the deadlock check in the
+			 * calling code has failed and we did not fall
+			 * into the check above due to !pid.
+			 */
+			if (task && pi_state->owner == task)
+				return -EDEADLK;
+
 			atomic_inc(&pi_state->refcount);
 			*ps = pi_state;
 
@@ -783,7 +794,7 @@ retry:
 	 * We dont have the lock. Look up the PI state (or create it if
 	 * we are the first waiter):
 	 */
-	ret = lookup_pi_state(uval, hb, key, ps);
+	ret = lookup_pi_state(uval, hb, key, ps, task);
 
 	if (unlikely(ret)) {
 		switch (ret) {
@@ -1193,7 +1204,7 @@ void requeue_pi_wake_futex(struct futex_
  *
  * Returns:
  *  0 - failed to acquire the lock atomicly
- *  1 - acquired the lock
+ * >0 - acquired the lock, return value is vpid of the top_waiter
  * <0 - error
  */
 static int futex_proxy_trylock_atomic(u32 __user *pifutex,
@@ -1204,7 +1215,7 @@ static int futex_proxy_trylock_atomic(u3
 {
 	struct futex_q *top_waiter = NULL;
 	u32 curval;
-	int ret;
+	int ret, vpid;
 
 	if (get_futex_value_locked(&curval, pifutex))
 		return -EFAULT;
@@ -1232,11 +1243,13 @@ static int futex_proxy_trylock_atomic(u3
 	 * the contended case or if set_waiters is 1.  The pi_state is returned
 	 * in ps in contended cases.
 	 */
+	vpid = task_pid_vnr(top_waiter->task);
 	ret = futex_lock_pi_atomic(pifutex, hb2, key2, ps, top_waiter->task,
 				   set_waiters);
-	if (ret == 1)
+	if (ret == 1) {
 		requeue_pi_wake_futex(top_waiter, key2, hb2);
-
+		return vpid;
+	}
 	return ret;
 }
 
@@ -1268,7 +1281,6 @@ static int futex_requeue(u32 __user *uad
 	struct futex_hash_bucket *hb1, *hb2;
 	struct plist_head *head1;
 	struct futex_q *this, *next;
-	u32 curval2;
 
 	if (requeue_pi) {
 		/*
@@ -1354,16 +1366,25 @@ retry_private:
 		 * At this point the top_waiter has either taken uaddr2 or is
 		 * waiting on it.  If the former, then the pi_state will not
 		 * exist yet, look it up one more time to ensure we have a
-		 * reference to it.
+		 * reference to it. If the lock was taken, ret contains the
+		 * vpid of the top waiter task.
 		 */
-		if (ret == 1) {
+		if (ret > 0) {
 			WARN_ON(pi_state);
 			drop_count++;
 			task_count++;
-			ret = get_futex_value_locked(&curval2, uaddr2);
-			if (!ret)
-				ret = lookup_pi_state(curval2, hb2, &key2,
-						      &pi_state);
+			/*
+			 * If we acquired the lock, then the user
+			 * space value of uaddr2 should be vpid. It
+			 * cannot be changed by the top waiter as it
+			 * is blocked on hb2 lock if it tries to do
+			 * so. If something fiddled with it behind our
+			 * back the pi state lookup might unearth
+			 * it. So we rather use the known value than
+			 * rereading and handing potential crap to
+			 * lookup_pi_state.
+			 */
+			ret = lookup_pi_state(ret, hb2, &key2, &pi_state, NULL);
 		}
 
 		switch (ret) {


  parent reply	other threads:[~2014-06-07  1:32 UTC|newest]

Thread overview: 98+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-06-07  1:26 [PATCH 3.2 00/92] 3.2.60-rc1 review Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 11/92] net: ipv4: current group_info should be put after using Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 03/92] ACPI / EC: Process rather than discard events in acpi_ec_clear Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 22/92] macvlan: Don't propagate IFF_ALLMULTI changes on down interfaces Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 14/92] rtnetlink: Warn when interface's information won't fit in our packet Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 12/92] filter: prevent nla extensions to peek beyond the end of the message Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 10/92] ipv6: Limit mtu to 65575 bytes Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 25/92] skb: Add inline helper for getting the skb end offset from head Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 13/92] tg3: update rx_jumbo_pending ring param only when jumbo frames are enabled Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 20/92] net: ipv4: ip_forward: fix inverted local_df test Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 09/92] bonding: Remove debug_fs files when module init fails Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 17/92] sctp: reset flowi4_oif parameter on route lookup Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 07/92] l2tp: take PMTU from tunnel UDP socket Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 05/92] netfilter: Can't fail and free after table replacement Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 24/92] ipv4: initialise the itag variable in __mkroute_input Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 30/92] drm/vmwgfx: Make sure user-space can't DMA across buffer object boundaries v2 Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 26/92] net-gro: reset skb->truesize in napi_reuse_skb() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 28/92] rt2x00: fix beaconing on USB Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 06/92] tracepoint: Do not waste memory on mods with no tracepoints Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 08/92] net: core: don't account for udp header size when computing seglen Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 01/92] powerpc: Add vr save/restore functions Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 19/92] tcp_cubic: fix the range of delayed_ack Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 27/92] thinkpad-acpi: fix issuing duplicated key events for brightness up/down Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 23/92] act_mirred: do not drop packets when fails to mirror it Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 21/92] ipv4: fib_semantics: increment fib_info_cnt after fib_info allocation Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 02/92] ACPI / EC: Clear stale EC events on Samsung systems Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 16/92] bridge: Handle IFLA_ADDRESS correctly when creating bridge device Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 29/92] Input: synaptics - add min/max quirk for ThinkPad Edge E431 Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 15/92] rtnetlink: Only supply IFLA_VF_PORTS information when RTEXT_FILTER_VF is set Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 04/92] tgafb: fix mode setting with fbset Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 18/92] Revert "macvlan : fix checksums error when we are in bridge mode" Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 44/92] [media] media-device: fix infoleak in ioctl media_enum_entities() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 60/92] mm/page-writeback.c: fix divide by zero in pos_ratio_polynom Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 48/92] USB: Nokia 305 should be treated as unusual dev Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 63/92] NFSD: Call ->set_acl with a NULL ACL structure if no entries Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 42/92] rtlwifi: rtl8192cu: Fix too long disable of IRQs Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 32/92] Bluetooth: Add support for Lite-on [04ca:3007] Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 59/92] Negative (setpoint-dirty) in bdi_position_ratio() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 71/92] i2c: s3c2410: resume race fix Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 70/92] i2c: designware: Mask all interrupts during i2c controller enable Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 58/92] posix_acl: handle NULL ACL in posix_acl_equiv_mode Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 47/92] drivers/tty/hvc: don't free hvc_console_setup after init Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 65/92] hwmon: (emc1403) fix inverted store_hyst() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 90/92] futex: Validate atomic acquisition in futex_lock_pi_atomic() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 66/92] hwmon: (emc1403) Support full range of known chip revision numbers Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 46/92] usb: storage: shuttle_usbat: fix discs being detected twice Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 31/92] Bluetooth: Fix redundant encryption request for reauthentication Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 78/92] drm/radeon: also try GART for CPU accessed buffers Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 36/92] KVM: async_pf: mm->mm_users can not pin apf->mm Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 69/92] x86, mm, hugetlb: Add missing TLB page invalidation for hugetlb_cow() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 86/92] mm/memory-failure.c: fix memory leak by race between poison and unpoison Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 51/92] HID: usbhid: quirk for Synaptics HD touchscreen Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 55/92] md: avoid possible spinning md thread at shutdown Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 75/92] libceph: fix corruption when using page_count 0 page in rbd Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 49/92] USB: Nokia 5300 should be treated as unusual dev Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 92/92] futex: Make lookup_pi_state more robust Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 62/92] trace: module: Maintain a valid user count Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 72/92] x86-64, modify_ldt: Make support for 16-bit segments a runtime option Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 52/92] HID: usbhid: quirk for Synaptics Quad HD touchscreen Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 83/92] nfsd4: warn on finding lockowner without stateid's Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 57/92] NFSd: call rpc_destroy_wait_queue() from free_client() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 77/92] perf: Prevent false warning in perf_swevent_add Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 53/92] HID: add NO_INIT_REPORTS quirk for Synaptics Touch Pad V 103S Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 73/92] PCI: shpchp: Check bridge's secondary (not primary) bus speed Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 54/92] Input: elantech - fix touchpad initialization on Gigabyte U2442 Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 67/92] [media] V4L2: ov7670: fix a wrong index, potentially Oopsing the kernel from user-space Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 80/92] can: peak_pci: Fix the way channels are linked together Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 37/92] ftrace/module: Hardcode ftrace_module_init() call into load_module() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 74/92] libceph: only call kernel_sendpage() via helper Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 33/92] crypto: caam - add allocation failure handling in SPRINTFCAT macro Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 45/92] drm/nouveau/acpi: allow non-optimus setups to load vbios from acpi Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 85/92] hwpoison, hugetlb: lock_page/unlock_page does not match for handling a free hugepage Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 35/92] kvm: remove .done from struct kvm_async_pf Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 34/92] kvm: free resources after canceling async_pf Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 61/92] sched: Use CPUPRI_NR_PRIORITIES instead of MAX_RT_PRIO in cpupri check Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 50/92] HID: usbhid: quirk for Synaptics Large Touchccreen Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 89/92] futex-prevent-requeue-pi-on-same-futex.patch futex: Forbid uaddr == uaddr2 in futex_requeue(..., requeue_pi=1) Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 68/92] [media] V4L2: fix VIDIOC_CREATE_BUFS in 64- / 32-bit compatibility mode Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 40/92] hrtimer: Prevent remote enqueue of leftmost timers Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 79/92] drm/radeon: handle non-VGA class pci devices with ATRM Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 82/92] nfsd4: remove lockowner when removing lock stateid Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 64/92] hrtimer: Set expiry time before switch_hrtimer_base() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 76/92] perf: Limit perf_event_attr::sample_period to 63 bits Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 41/92] timer: Prevent overflow in apply_slack Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 39/92] hrtimer: Prevent all reprogramming if hang detected Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 81/92] can: peak_pci: prevent use after free at netdev removal Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 91/92] futex: Always cleanup owner tid in unlock_pi Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 38/92] [SCSI] mpt2sas: Don't disable device twice at suspend Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 88/92] futex: Prevent attaching to kernel threads Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 84/92] dma: mv_xor: Flush descriptors before activating a channel Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 56/92] NFSd: Move default initialisers from create_client() to alloc_client() Ben Hutchings
2014-06-07  1:26 ` [PATCH 3.2 43/92] rtl8192cu: Fix unbalanced irq enable in error path of rtl92cu_hw_init() Ben Hutchings
2014-06-07  1:26 ` Ben Hutchings [this message]
2014-06-07  2:23 ` [PATCH 3.2 00/92] 3.2.60-rc1 review Ben Hutchings
2014-06-07 16:33 ` Guenter Roeck
2014-06-07 17:00   ` Ben Hutchings
2014-06-09 23:25     ` Satoru Takeuchi
2014-06-09 23:48       ` Ben Hutchings

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=lsq.1402104389.679439736@decadent.org.uk \
    --to=ben@decadent.org.uk \
    --cc=akpm@linux-foundation.org \
    --cc=bigeasy@linutronix.de \
    --cc=carlos@redhat.com \
    --cc=darren@dvhart.com \
    --cc=davej@redhat.com \
    --cc=davidlohr@hp.com \
    --cc=jakub@redhat.com \
    --cc=laijs@cn.fujitsu.com \
    --cc=linux-kernel@vger.kernel.org \
    --cc=mtk.manpages@gmail.com \
    --cc=paulmck@linux.vnet.ibm.com \
    --cc=peterz@infradead.org \
    --cc=roland@hack.frob.com \
    --cc=rostedt@goodmis.org \
    --cc=stable@vger.kernel.org \
    --cc=tglx@linutronix.de \
    --cc=torvalds@linux-foundation.org \
    --cc=williams@redhat.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.