[BUG] RTNL and flush_scheduled_work deadlocks

* [BUG] RTNL and flush_scheduled_work deadlocks
@ 2007-02-14 21:27 Stephen Hemminger
  2007-02-14 21:44 ` Ben Greear
                   ` (5 more replies)
  0 siblings, 6 replies; 34+ messages in thread
From: Stephen Hemminger @ 2007-02-14 21:27 UTC (permalink / raw)
  To: Francois Romieu
  Cc: netdev, Ben Greear, Kyle Lucke, Raghavendra Koushik, Al Viro

Ben found this but the problem seems pretty widespread.

The following places are subject to deadlock between flush_scheduled_work
and the RTNL mutex. What can happen is that a work queue routine (like
bridge port_carrier_check) is waiting forever for RTNL, and the driver
routine has called flush_scheduled_work with RTNL held and is waiting
for the work queue to clear.

Several other places have comments like: "can't call flush_scheduled_work
here or it will deadlock". Most of the problem places are in device close
routine. My recommendation would be to add a check for device netif_running in
what ever work routine is used, and move the flush_scheduled_work to the
remove routine.

8139too.c: rtl8139_close --> rtl8139_stop_thread
r8169.c:   rtl8169_down
cassini.c: cas_change_mtu
iseries_veth.c: veth_stop_connection
s2io.c: s2io_close
sis190.c: sis190_down

^ permalink raw reply	[flat|nested] 34+ messages in thread