* 3.12.y: srcu warning during free_dev() with dm-multipath
@ 2015-06-17 16:26 Sebastian Parschauer
2015-06-22 8:55 ` Sebastian Parschauer
0 siblings, 1 reply; 5+ messages in thread
From: Sebastian Parschauer @ 2015-06-17 16:26 UTC (permalink / raw)
To: Alasdair Kergon, Mike Snitzer; +Cc: dm-devel, Sebastian Parschauer
Hi dm developers,
we hit the srcu warning in cleanup_srcu_struct() when removing
dm-multipath devices. We run v3.12.40 but there is no change compared to
v3.12.44 in drivers/md/dm.c or kernel/srcu.c.
Would the following upstream commit fix this?
63a4f06 dm: fix add_disk() NULL pointer due to race with free_dev()
Thanks,
Sebastian
===============
Details:
> WARNING: CPU: 36 PID: 3772 at kernel/srcu.c:285 cleanup_srcu_struct+0x8d/0x90()
> CPU: 36 PID: 3772 Comm: multipathd Tainted: G O 3.12.40-2 #1
> 0000000000000009 ffff882803213bc8 ffffffff817138b0 0000000000000007
> 0000000000000000 ffff882803213c08 ffffffff81045fb7 ffff8827035a0870
> 0000000000000001 ffffffff81cbf760 ffff8827fea77808 ffff8827fea77800
> Call Trace:
> [<ffffffff817138b0>] dump_stack+0x46/0x58
> [<ffffffff81045fb7>] warn_slowpath_common+0x87/0xb0
> [<ffffffff81045ff5>] warn_slowpath_null+0x15/0x20
> [<ffffffff8106bd5d>] cleanup_srcu_struct+0x8d/0x90
> [<ffffffff81594986>] __dm_destroy+0x176/0x240
> [<ffffffff8159550e>] dm_destroy+0xe/0x10
> [<ffffffff8159ae9f>] dev_remove+0x9f/0x100
> [<ffffffff8159ae00>] ? __hash_remove+0xd0/0xd0
> [<ffffffff8159b2f6>] ctl_ioctl+0x246/0x4d0
> [<ffffffff81032c19>] ? physflat_send_IPI_mask+0x9/0x10
> [<ffffffff8159b58e>] dm_ctl_ioctl+0xe/0x20
> [<ffffffff8116d587>] do_vfs_ioctl+0x87/0x510
> [<ffffffff8105478a>] ? recalc_sigpending+0x1a/0x60
> [<ffffffff81055362>] ? __set_task_blocked+0x32/0x80
> [<ffffffff8116daa1>] SyS_ioctl+0x91/0xb0
> [<ffffffff81721a02>] system_call_fastpath+0x16/0x1b
> ---[ end trace 8c8c7b2a24a51f9c ]---
> (gdb) list *__dm_destroy+0x172
> 0xffffffff8159fc72 is in __dm_destroy (drivers/md/dm.c:2140).
> 2135 mempool_destroy(md->io_pool);
> 2136 if (md->bs)
> 2137 bioset_free(md->bs);
> 2138 blk_integrity_unregister(md->disk);
> 2139 del_gendisk(md->disk);
> 2140 cleanup_srcu_struct(&md->io_barrier);
> 2141 free_minor(minor);
> 2142
> 2143 spin_lock(&_minor_lock);
> 2144 md->disk->private_data = NULL;
This is located in the function free_dev() in fact.
> (gdb) list *cleanup_srcu_struct+0x89
> 0xffffffff8106f2c9 is in cleanup_srcu_struct (kernel/srcu.c:285).
> 280 * Must invoke this after you are finished using a given srcu_struct that
> 281 * was initialized via init_srcu_struct(), else you leak memory.
> 282 */
> 283 void cleanup_srcu_struct(struct srcu_struct *sp)
> 284 {
> 285 if (WARN_ON(srcu_readers_active(sp)))
> 286 return; /* Leakage unless caller handles error. */
> 287 free_percpu(sp->per_cpu_ref);
> 288 sp->per_cpu_ref = NULL;
> 289 }
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: 3.12.y: srcu warning during free_dev() with dm-multipath
2015-06-17 16:26 3.12.y: srcu warning during free_dev() with dm-multipath Sebastian Parschauer
@ 2015-06-22 8:55 ` Sebastian Parschauer
2015-06-22 10:25 ` Zdenek Kabelac
2015-06-22 14:21 ` Mike Snitzer
0 siblings, 2 replies; 5+ messages in thread
From: Sebastian Parschauer @ 2015-06-22 8:55 UTC (permalink / raw)
To: Alasdair Kergon, Mike Snitzer; +Cc: dm-devel, Sebastian Parschauer
Really, no response? Again?
Why doesn't that surprise me if sending a bug report to Red Hat not
using their kernel,... Thanks a lot! So this is how discrimination feels
like,...
On 17.06.2015 18:26, Sebastian Parschauer wrote:
> Hi dm developers,
>
> we hit the srcu warning in cleanup_srcu_struct() when removing
> dm-multipath devices. We run v3.12.40 but there is no change compared to
> v3.12.44 in drivers/md/dm.c or kernel/srcu.c.
>
> Would the following upstream commit fix this?
>
> 63a4f06 dm: fix add_disk() NULL pointer due to race with free_dev()
>
> Thanks,
> Sebastian
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: 3.12.y: srcu warning during free_dev() with dm-multipath
2015-06-22 8:55 ` Sebastian Parschauer
@ 2015-06-22 10:25 ` Zdenek Kabelac
2015-06-22 12:22 ` Sebastian Parschauer
2015-06-22 14:21 ` Mike Snitzer
1 sibling, 1 reply; 5+ messages in thread
From: Zdenek Kabelac @ 2015-06-22 10:25 UTC (permalink / raw)
To: dm-devel
Dne 22.6.2015 v 10:55 Sebastian Parschauer napsal(a):
> Really, no response? Again?
>
> Why doesn't that surprise me if sending a bug report to Red Hat not
> using their kernel,... Thanks a lot! So this is how discrimination feels
> like,...
>
Have you checked your problem exists in the latest upstream kernel ?
If you are resolving issue with kernel 3.12 - it's likely a thing for your
distribution and its maintenance support?
Upstream (and this is upstream ml) is close to release 4.1 kernel - so you are
fixing couple years old issue? What kind of action do you expect ?
Regards
Zdenek
> On 17.06.2015 18:26, Sebastian Parschauer wrote:
>> Hi dm developers,
>>
>> we hit the srcu warning in cleanup_srcu_struct() when removing
>> dm-multipath devices. We run v3.12.40 but there is no change compared to
>> v3.12.44 in drivers/md/dm.c or kernel/srcu.c.
>>
>> Would the following upstream commit fix this?
>>
>> 63a4f06 dm: fix add_disk() NULL pointer due to race with free_dev()
>>
>> Thanks,
>> Sebastian
>
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: 3.12.y: srcu warning during free_dev() with dm-multipath
2015-06-22 10:25 ` Zdenek Kabelac
@ 2015-06-22 12:22 ` Sebastian Parschauer
0 siblings, 0 replies; 5+ messages in thread
From: Sebastian Parschauer @ 2015-06-22 12:22 UTC (permalink / raw)
To: device-mapper development
On 22.06.2015 12:25, Zdenek Kabelac wrote:
> Dne 22.6.2015 v 10:55 Sebastian Parschauer napsal(a):
>> Really, no response? Again?
>>
>> Why doesn't that surprise me if sending a bug report to Red Hat not
>> using their kernel,... Thanks a lot! So this is how discrimination feels
>> like,...
>>
>
> Have you checked your problem exists in the latest upstream kernel ?
It likely doesn't. I can't roll out the latest greatest unstable
upstream kernel into a public IaaS cloud where I can see the problem
from time to time but in a test environment it's more difficult to
reproduce. We also have custom code running in different unrelated parts
which would have to be adapted.
> If you are resolving issue with kernel 3.12 - it's likely a thing for
> your distribution and its maintenance support?
We are our own distribution. So I'm just the distribution maintainer
asking the authors of a patch if that one could be related instead of
doing reverse engineering. Maintainers should have a feeling for their
code and also care for important fixes so that these find their way into
the stable trees - no matter which distribution.
Maybe that's the difference between homebrew FOSS and commercial FOSS.
But there are upstream kernel maintainers who do care for linux-stable.
I'd like to mention md maintainer Neil Brown here.
> Upstream (and this is upstream ml) is close to release 4.1 kernel - so
> you are fixing couple years old issue? What kind of action do you expect ?
Just a hint at which patches to look at.
You maintain a 3.10 kernel for RHEL 7. That one is also old and not
latest upstream.
I'll try the patch.
Cheers,
Sebastian
>> On 17.06.2015 18:26, Sebastian Parschauer wrote:
>>> Hi dm developers,
>>>
>>> we hit the srcu warning in cleanup_srcu_struct() when removing
>>> dm-multipath devices. We run v3.12.40 but there is no change compared to
>>> v3.12.44 in drivers/md/dm.c or kernel/srcu.c.
>>>
>>> Would the following upstream commit fix this?
>>>
>>> 63a4f06 dm: fix add_disk() NULL pointer due to race with free_dev()
>>>
>>> Thanks,
>>> Sebastian
>>
>
> --
> dm-devel mailing list
> dm-devel@redhat.com
> https://www.redhat.com/mailman/listinfo/dm-devel
^ permalink raw reply [flat|nested] 5+ messages in thread
* Re: 3.12.y: srcu warning during free_dev() with dm-multipath
2015-06-22 8:55 ` Sebastian Parschauer
2015-06-22 10:25 ` Zdenek Kabelac
@ 2015-06-22 14:21 ` Mike Snitzer
1 sibling, 0 replies; 5+ messages in thread
From: Mike Snitzer @ 2015-06-22 14:21 UTC (permalink / raw)
To: Sebastian Parschauer; +Cc: dm-devel, Alasdair Kergon
On Mon, Jun 22 2015 at 4:55am -0400,
Sebastian Parschauer <sebastian.riemer@profitbricks.com> wrote:
> Really, no response? Again?
>
> Why doesn't that surprise me if sending a bug report to Red Hat not
> using their kernel,... Thanks a lot! So this is how discrimination feels
> like,...
Do you think I'm more inclined to help you now?
Reason through the problem, revert or apply patches and test.
Take care.
^ permalink raw reply [flat|nested] 5+ messages in thread
end of thread, other threads:[~2015-06-22 14:21 UTC | newest]
Thread overview: 5+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-06-17 16:26 3.12.y: srcu warning during free_dev() with dm-multipath Sebastian Parschauer
2015-06-22 8:55 ` Sebastian Parschauer
2015-06-22 10:25 ` Zdenek Kabelac
2015-06-22 12:22 ` Sebastian Parschauer
2015-06-22 14:21 ` Mike Snitzer
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.