* Re: Linux 4.2.4
[not found] ` <20151025094634.GA6832@1wt.eu>
@ 2015-10-25 10:48 ` Gerhard Wiesinger
2015-10-25 16:29 ` Greg KH
2015-10-25 19:46 ` Jozsef Kadlecsik
0 siblings, 2 replies; 14+ messages in thread
From: Gerhard Wiesinger @ 2015-10-25 10:48 UTC (permalink / raw)
To: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel
Cc: Greg KH, linux-kernel, Andrew Morton, torvalds, stable, lwn, Jiri Slaby
On 25.10.2015 10:46, Willy Tarreau wrote:
> ipset *triggered* the problem. The whole stack dump would tell more.
OK, find the stack traces in the bug report:
https://bugzilla.redhat.com/show_bug.cgi?id=1272645
Kernel 4.1.10 triggered also a kernel dump when playing with ipset
commands and IPv6, details in the bug report ....
>> Kernel 4.2 seems to me not well tested in the netfilter parts at all
>> (Bug with already known bugfix
>> https://lists.debian.org/debian-kernel/2015/10/msg00034.html was
>> triggered on 2 of 3 of my machines, the new bug on 1 of 1 tested machine).
> There's a reason why Greg maintains stable and LTS kernels :-)
Stable kernels don't crash but definiton. :-)
At least triggered 2 kernel panics in 5min, even with 4.1.10 and ipset
commands ...
Ciao,
Gerhard
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-25 10:48 ` Linux 4.2.4 Gerhard Wiesinger
@ 2015-10-25 16:29 ` Greg KH
2015-10-25 17:14 ` Gerhard Wiesinger
2015-11-08 13:51 ` Gerhard Wiesinger
2015-10-25 19:46 ` Jozsef Kadlecsik
1 sibling, 2 replies; 14+ messages in thread
From: Greg KH @ 2015-10-25 16:29 UTC (permalink / raw)
To: Gerhard Wiesinger
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
linux-kernel, Andrew Morton, torvalds, stable, lwn, Jiri Slaby
On Sun, Oct 25, 2015 at 11:48:54AM +0100, Gerhard Wiesinger wrote:
> On 25.10.2015 10:46, Willy Tarreau wrote:
> >ipset *triggered* the problem. The whole stack dump would tell more.
>
> OK, find the stack traces in the bug report:
> https://bugzilla.redhat.com/show_bug.cgi?id=1272645
>
> Kernel 4.1.10 triggered also a kernel dump when playing with ipset commands
> and IPv6, details in the bug report ....
>
> >>Kernel 4.2 seems to me not well tested in the netfilter parts at all
> >>(Bug with already known bugfix
> >>https://lists.debian.org/debian-kernel/2015/10/msg00034.html was
> >>triggered on 2 of 3 of my machines, the new bug on 1 of 1 tested machine).
> >There's a reason why Greg maintains stable and LTS kernels :-)
>
> Stable kernels don't crash but definiton. :-)
>
> At least triggered 2 kernel panics in 5min, even with 4.1.10 and ipset
> commands ...
Does this happen also with Linus's tree? I suggest you ask the
networking developers about this on netdev@vger.kernel.org, there's
nothing that I can do on my own about this, sorry.
greg k-h
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-25 16:29 ` Greg KH
@ 2015-10-25 17:14 ` Gerhard Wiesinger
2015-11-08 13:51 ` Gerhard Wiesinger
1 sibling, 0 replies; 14+ messages in thread
From: Gerhard Wiesinger @ 2015-10-25 17:14 UTC (permalink / raw)
To: Greg KH
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
linux-kernel, Andrew Morton, torvalds, stable, lwn, Jiri Slaby
On 25.10.2015 17:29, Greg KH wrote:
> On Sun, Oct 25, 2015 at 11:48:54AM +0100, Gerhard Wiesinger wrote:
>> On 25.10.2015 10:46, Willy Tarreau wrote:
>>> ipset *triggered* the problem. The whole stack dump would tell more.
>> OK, find the stack traces in the bug report:
>> https://bugzilla.redhat.com/show_bug.cgi?id=1272645
>>
>> Kernel 4.1.10 triggered also a kernel dump when playing with ipset commands
>> and IPv6, details in the bug report ....
>>
>>>> Kernel 4.2 seems to me not well tested in the netfilter parts at all
>>>> (Bug with already known bugfix
>>>> https://lists.debian.org/debian-kernel/2015/10/msg00034.html was
>>>> triggered on 2 of 3 of my machines, the new bug on 1 of 1 tested machine).
>>> There's a reason why Greg maintains stable and LTS kernels :-)
>> Stable kernels don't crash but definiton. :-)
>>
>> At least triggered 2 kernel panics in 5min, even with 4.1.10 and ipset
>> commands ...
> Does this happen also with Linus's tree? I suggest you ask the
> networking developers about this on netdev@vger.kernel.org, there's
> nothing that I can do on my own about this, sorry.
Already CCed netdev and netfilter-devel mailinglist. Need patches for
the switch driver of the banana Pi to get networking up but that patch
is stable. Maybe also some patches from the Fedora SRPMS are needed. But
I'm pretty sure that this also happens with plain vanilla kernel.
Ciao,
Gerhard
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-25 10:48 ` Linux 4.2.4 Gerhard Wiesinger
2015-10-25 16:29 ` Greg KH
@ 2015-10-25 19:46 ` Jozsef Kadlecsik
2015-10-25 20:08 ` Gerhard Wiesinger
1 sibling, 1 reply; 14+ messages in thread
From: Jozsef Kadlecsik @ 2015-10-25 19:46 UTC (permalink / raw)
To: Gerhard Wiesinger
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
Greg KH, linux-kernel, Andrew Morton, torvalds, stable, lwn,
Jiri Slaby
Hi,
On Sun, 25 Oct 2015, Gerhard Wiesinger wrote:
> On 25.10.2015 10:46, Willy Tarreau wrote:
> > ipset *triggered* the problem. The whole stack dump would tell more.
>
> OK, find the stack traces in the bug report:
> https://bugzilla.redhat.com/show_bug.cgi?id=1272645
>
> Kernel 4.1.10 triggered also a kernel dump when playing with ipset commands
> and IPv6, details in the bug report ....
It seems to me it is an architecture-specific alignment issue. I don't
have a Cortex-A7 ARM hardware and qemu doesn't seem to support it either,
so I'm unable to reproduce it (ipset passes all my tests on my hardware,
including more complex ones than what breaks here). My first wild guess is
that the dynamic array of the element structure is not aligned properly.
Could you give a try to the next patch?
diff --git a/net/netfilter/ipset/ip_set_hash_gen.h b/net/netfilter/ipset/ip_set_hash_gen.h
index afe905c..1cf357d 100644
--- a/net/netfilter/ipset/ip_set_hash_gen.h
+++ b/net/netfilter/ipset/ip_set_hash_gen.h
@@ -1211,6 +1211,9 @@ static const struct ip_set_type_variant mtype_variant = {
.same_set = mtype_same_set,
};
+#define IP_SET_BASE_ALIGN(dtype) \
+ ALIGN(sizeof(struct dtype), __alignof__(struct dtype))
+
#ifdef IP_SET_EMIT_CREATE
static int
IPSET_TOKEN(HTYPE, _create)(struct net *net, struct ip_set *set,
@@ -1319,12 +1322,12 @@ IPSET_TOKEN(HTYPE, _create)(struct net *net, struct ip_set *set,
#endif
set->variant = &IPSET_TOKEN(HTYPE, 4_variant);
set->dsize = ip_set_elem_len(set, tb,
- sizeof(struct IPSET_TOKEN(HTYPE, 4_elem)));
+ IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE, 4_elem)));
#ifndef IP_SET_PROTO_UNDEF
} else {
set->variant = &IPSET_TOKEN(HTYPE, 6_variant);
set->dsize = ip_set_elem_len(set, tb,
- sizeof(struct IPSET_TOKEN(HTYPE, 6_elem)));
+ IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE, 6_elem)));
}
#endif
if (tb[IPSET_ATTR_TIMEOUT]) {
If that does not solve it, then could you help to narrow down the issue?
Does the bug still appear if your remove the counter extension of the set?
Best regards,
Jozsef
-
E-mail : kadlec@blackhole.kfki.hu, kadlecsik.jozsef@wigner.mta.hu
PGP key : http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address : Wigner Research Centre for Physics, Hungarian Academy of Sciences
H-1525 Budapest 114, POB. 49, Hungary
^ permalink raw reply related [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-25 19:46 ` Jozsef Kadlecsik
@ 2015-10-25 20:08 ` Gerhard Wiesinger
2015-10-25 21:26 ` Gerhard Wiesinger
2015-10-26 8:58 ` Jozsef Kadlecsik
0 siblings, 2 replies; 14+ messages in thread
From: Gerhard Wiesinger @ 2015-10-25 20:08 UTC (permalink / raw)
To: Jozsef Kadlecsik
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
Greg KH, linux-kernel, Andrew Morton, torvalds, stable, lwn,
Jiri Slaby
On 25.10.2015 20:46, Jozsef Kadlecsik wrote:
> Hi,
>
> On Sun, 25 Oct 2015, Gerhard Wiesinger wrote:
>
>> On 25.10.2015 10:46, Willy Tarreau wrote:
>>> ipset *triggered* the problem. The whole stack dump would tell more.
>> OK, find the stack traces in the bug report:
>> https://bugzilla.redhat.com/show_bug.cgi?id=1272645
>>
>> Kernel 4.1.10 triggered also a kernel dump when playing with ipset commands
>> and IPv6, details in the bug report ....
> It seems to me it is an architecture-specific alignment issue. I don't
> have a Cortex-A7 ARM hardware and qemu doesn't seem to support it either,
> so I'm unable to reproduce it (ipset passes all my tests on my hardware,
> including more complex ones than what breaks here). My first wild guess is
> that the dynamic array of the element structure is not aligned properly.
> Could you give a try to the next patch?
>
> diff --git a/net/netfilter/ipset/ip_set_hash_gen.h b/net/netfilter/ipset/ip_set_hash_gen.h
> index afe905c..1cf357d 100644
> --- a/net/netfilter/ipset/ip_set_hash_gen.h
> +++ b/net/netfilter/ipset/ip_set_hash_gen.h
> @@ -1211,6 +1211,9 @@ static const struct ip_set_type_variant mtype_variant = {
> .same_set = mtype_same_set,
> };
>
> +#define IP_SET_BASE_ALIGN(dtype) \
> + ALIGN(sizeof(struct dtype), __alignof__(struct dtype))
> +
> #ifdef IP_SET_EMIT_CREATE
> static int
> IPSET_TOKEN(HTYPE, _create)(struct net *net, struct ip_set *set,
> @@ -1319,12 +1322,12 @@ IPSET_TOKEN(HTYPE, _create)(struct net *net, struct ip_set *set,
> #endif
> set->variant = &IPSET_TOKEN(HTYPE, 4_variant);
> set->dsize = ip_set_elem_len(set, tb,
> - sizeof(struct IPSET_TOKEN(HTYPE, 4_elem)));
> + IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE, 4_elem)));
> #ifndef IP_SET_PROTO_UNDEF
> } else {
> set->variant = &IPSET_TOKEN(HTYPE, 6_variant);
> set->dsize = ip_set_elem_len(set, tb,
> - sizeof(struct IPSET_TOKEN(HTYPE, 6_elem)));
> + IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE, 6_elem)));
> }
> #endif
> if (tb[IPSET_ATTR_TIMEOUT]) {
>
> If that does not solve it, then could you help to narrow down the issue?
> Does the bug still appear if your remove the counter extension of the set?
>
Hello Jozsef,
Patch applied well, compiling ...
Interesting, that it didn't happen before. Device is in production for
more than 2 month without any issue.
Also any idea regarding the second isssue? Or do you think it has the
same root cause?
Greetings from Vienna, Austria :-)
BTW: You can get the Banana Pi R1 for example at:
http://www.aliexpress.com/item/BPI-R1-Set-1-R1-Board-Clear-Case-5dB-Antenna-Power-Adapter-Banana-PI-R1-Smart/32362127917.html
I can really recommend it as a router. Power consumption is as less as
3W. Price is also IMHO very good.
Ciao,
Gerhard
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-25 20:08 ` Gerhard Wiesinger
@ 2015-10-25 21:26 ` Gerhard Wiesinger
2015-10-25 21:53 ` Jozsef Kadlecsik
2015-10-26 8:58 ` Jozsef Kadlecsik
1 sibling, 1 reply; 14+ messages in thread
From: Gerhard Wiesinger @ 2015-10-25 21:26 UTC (permalink / raw)
To: Jozsef Kadlecsik
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
Greg KH, linux-kernel, Andrew Morton, torvalds, stable, lwn,
Jiri Slaby
On 25.10.2015 21:08, Gerhard Wiesinger wrote:
> On 25.10.2015 20:46, Jozsef Kadlecsik wrote:
>> Hi,
>>
>> On Sun, 25 Oct 2015, Gerhard Wiesinger wrote:
>>
>>> On 25.10.2015 10:46, Willy Tarreau wrote:
>>>> ipset *triggered* the problem. The whole stack dump would tell more.
>>> OK, find the stack traces in the bug report:
>>> https://bugzilla.redhat.com/show_bug.cgi?id=1272645
>>>
>>> Kernel 4.1.10 triggered also a kernel dump when playing with ipset
>>> commands
>>> and IPv6, details in the bug report ....
>> It seems to me it is an architecture-specific alignment issue. I don't
>> have a Cortex-A7 ARM hardware and qemu doesn't seem to support it
>> either,
>> so I'm unable to reproduce it (ipset passes all my tests on my hardware,
>> including more complex ones than what breaks here). My first wild
>> guess is
>> that the dynamic array of the element structure is not aligned properly.
>> Could you give a try to the next patch?
>>
>> diff --git a/net/netfilter/ipset/ip_set_hash_gen.h
>> b/net/netfilter/ipset/ip_set_hash_gen.h
>> index afe905c..1cf357d 100644
>> --- a/net/netfilter/ipset/ip_set_hash_gen.h
>> +++ b/net/netfilter/ipset/ip_set_hash_gen.h
>> @@ -1211,6 +1211,9 @@ static const struct ip_set_type_variant
>> mtype_variant = {
>> .same_set = mtype_same_set,
>> };
>> +#define IP_SET_BASE_ALIGN(dtype) \
>> + ALIGN(sizeof(struct dtype), __alignof__(struct dtype))
>> +
>> #ifdef IP_SET_EMIT_CREATE
>> static int
>> IPSET_TOKEN(HTYPE, _create)(struct net *net, struct ip_set *set,
>> @@ -1319,12 +1322,12 @@ IPSET_TOKEN(HTYPE, _create)(struct net *net,
>> struct ip_set *set,
>> #endif
>> set->variant = &IPSET_TOKEN(HTYPE, 4_variant);
>> set->dsize = ip_set_elem_len(set, tb,
>> - sizeof(struct IPSET_TOKEN(HTYPE, 4_elem)));
>> + IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE, 4_elem)));
>> #ifndef IP_SET_PROTO_UNDEF
>> } else {
>> set->variant = &IPSET_TOKEN(HTYPE, 6_variant);
>> set->dsize = ip_set_elem_len(set, tb,
>> - sizeof(struct IPSET_TOKEN(HTYPE, 6_elem)));
>> + IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE, 6_elem)));
>> }
>> #endif
>> if (tb[IPSET_ATTR_TIMEOUT]) {
>>
>> If that does not solve it, then could you help to narrow down the issue?
>> Does the bug still appear if your remove the counter extension of the
>> set?
>>
>
> Hello Jozsef,
>
> Patch applied well, compiling ...
Hello Jozsef,
Thank you for the patch it but still crashes, see:
https://bugzilla.redhat.com/show_bug.cgi?id=1272645
Any further ideas?
Thank you.
Ciao,
Gerhard
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-25 21:26 ` Gerhard Wiesinger
@ 2015-10-25 21:53 ` Jozsef Kadlecsik
2015-10-26 7:27 ` Gerhard Wiesinger
0 siblings, 1 reply; 14+ messages in thread
From: Jozsef Kadlecsik @ 2015-10-25 21:53 UTC (permalink / raw)
To: Gerhard Wiesinger
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
Greg KH, linux-kernel, Andrew Morton, torvalds, stable, lwn,
Jiri Slaby
On Sun, 25 Oct 2015, Gerhard Wiesinger wrote:
> On 25.10.2015 21:08, Gerhard Wiesinger wrote:
> > On 25.10.2015 20:46, Jozsef Kadlecsik wrote:
> > > Hi,
> > >
> > > On Sun, 25 Oct 2015, Gerhard Wiesinger wrote:
> > >
> > > > On 25.10.2015 10:46, Willy Tarreau wrote:
> > > > > ipset *triggered* the problem. The whole stack dump would tell more.
> > > > OK, find the stack traces in the bug report:
> > > > https://bugzilla.redhat.com/show_bug.cgi?id=1272645
> > > >
> > > > Kernel 4.1.10 triggered also a kernel dump when playing with ipset
> > > > commands
> > > > and IPv6, details in the bug report ....
> > > It seems to me it is an architecture-specific alignment issue. I don't
> > > have a Cortex-A7 ARM hardware and qemu doesn't seem to support it either,
> > > so I'm unable to reproduce it (ipset passes all my tests on my hardware,
> > > including more complex ones than what breaks here). My first wild guess is
> > > that the dynamic array of the element structure is not aligned properly.
> > > Could you give a try to the next patch?
> > >
> > > diff --git a/net/netfilter/ipset/ip_set_hash_gen.h
> > > b/net/netfilter/ipset/ip_set_hash_gen.h
> > > index afe905c..1cf357d 100644
> > > --- a/net/netfilter/ipset/ip_set_hash_gen.h
> > > +++ b/net/netfilter/ipset/ip_set_hash_gen.h
> > > @@ -1211,6 +1211,9 @@ static const struct ip_set_type_variant
> > > mtype_variant = {
> > > .same_set = mtype_same_set,
> > > };
> > > +#define IP_SET_BASE_ALIGN(dtype) \
> > > + ALIGN(sizeof(struct dtype), __alignof__(struct dtype))
> > > +
> > > #ifdef IP_SET_EMIT_CREATE
> > > static int
> > > IPSET_TOKEN(HTYPE, _create)(struct net *net, struct ip_set *set,
> > > @@ -1319,12 +1322,12 @@ IPSET_TOKEN(HTYPE, _create)(struct net *net,
> > > struct ip_set *set,
> > > #endif
> > > set->variant = &IPSET_TOKEN(HTYPE, 4_variant);
> > > set->dsize = ip_set_elem_len(set, tb,
> > > - sizeof(struct IPSET_TOKEN(HTYPE, 4_elem)));
> > > + IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE, 4_elem)));
> > > #ifndef IP_SET_PROTO_UNDEF
> > > } else {
> > > set->variant = &IPSET_TOKEN(HTYPE, 6_variant);
> > > set->dsize = ip_set_elem_len(set, tb,
> > > - sizeof(struct IPSET_TOKEN(HTYPE, 6_elem)));
> > > + IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE, 6_elem)));
> > > }
> > > #endif
> > > if (tb[IPSET_ATTR_TIMEOUT]) {
> > >
> > > If that does not solve it, then could you help to narrow down the issue?
> > > Does the bug still appear if your remove the counter extension of the set?
> > >
>
> Thank you for the patch it but still crashes, see:
> https://bugzilla.redhat.com/show_bug.cgi?id=1272645
>
> Any further ideas?
Does it crash without counters? That could narrow down where to look for.
Best regards,
Jozsef
-
E-mail : kadlec@blackhole.kfki.hu, kadlecsik.jozsef@wigner.mta.hu
PGP key : http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address : Wigner Research Centre for Physics, Hungarian Academy of Sciences
H-1525 Budapest 114, POB. 49, Hungary
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-25 21:53 ` Jozsef Kadlecsik
@ 2015-10-26 7:27 ` Gerhard Wiesinger
0 siblings, 0 replies; 14+ messages in thread
From: Gerhard Wiesinger @ 2015-10-26 7:27 UTC (permalink / raw)
To: Jozsef Kadlecsik
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
Greg KH, linux-kernel, Andrew Morton, torvalds, stable, lwn,
Jiri Slaby
On 25.10.2015 22:53, Jozsef Kadlecsik wrote:
> On Sun, 25 Oct 2015, Gerhard Wiesinger wrote:
>
>> Any further ideas?
> Does it crash without counters? That could narrow down where to look for.
>
>
Hello Jozsef,
it doesn't crash i I don't use the counters so far. So there must be a
bug with the counters.
Any idea for the root cause?
Thnx.
Ciao,
Gerhard
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-25 20:08 ` Gerhard Wiesinger
2015-10-25 21:26 ` Gerhard Wiesinger
@ 2015-10-26 8:58 ` Jozsef Kadlecsik
2015-10-26 9:11 ` Gerhard Wiesinger
1 sibling, 1 reply; 14+ messages in thread
From: Jozsef Kadlecsik @ 2015-10-26 8:58 UTC (permalink / raw)
To: Gerhard Wiesinger
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
Greg KH, linux-kernel, Andrew Morton, torvalds, stable, lwn,
Jiri Slaby
On Sun, 25 Oct 2015, Gerhard Wiesinger wrote:
> On 25.10.2015 20:46, Jozsef Kadlecsik wrote:
> > Hi,
> >
> > On Sun, 25 Oct 2015, Gerhard Wiesinger wrote:
> >
> > > On 25.10.2015 10:46, Willy Tarreau wrote:
> > > > ipset *triggered* the problem. The whole stack dump would tell more.
> > > OK, find the stack traces in the bug report:
> > > https://bugzilla.redhat.com/show_bug.cgi?id=1272645
> > >
> > > Kernel 4.1.10 triggered also a kernel dump when playing with ipset
> > > commands
> > > and IPv6, details in the bug report ....
> > It seems to me it is an architecture-specific alignment issue. I don't
> > have a Cortex-A7 ARM hardware and qemu doesn't seem to support it either,
> > so I'm unable to reproduce it (ipset passes all my tests on my hardware,
> > including more complex ones than what breaks here). My first wild guess is
> > that the dynamic array of the element structure is not aligned properly.
> > Could you give a try to the next patch?
> >
> > diff --git a/net/netfilter/ipset/ip_set_hash_gen.h
> > b/net/netfilter/ipset/ip_set_hash_gen.h
> > index afe905c..1cf357d 100644
> > --- a/net/netfilter/ipset/ip_set_hash_gen.h
> > +++ b/net/netfilter/ipset/ip_set_hash_gen.h
> > @@ -1211,6 +1211,9 @@ static const struct ip_set_type_variant mtype_variant
> > = {
> > .same_set = mtype_same_set,
> > };
> > +#define IP_SET_BASE_ALIGN(dtype) \
> > + ALIGN(sizeof(struct dtype), __alignof__(struct dtype))
> > +
> > #ifdef IP_SET_EMIT_CREATE
> > static int
> > IPSET_TOKEN(HTYPE, _create)(struct net *net, struct ip_set *set,
> > @@ -1319,12 +1322,12 @@ IPSET_TOKEN(HTYPE, _create)(struct net *net, struct
> > ip_set *set,
> > #endif
> > set->variant = &IPSET_TOKEN(HTYPE, 4_variant);
> > set->dsize = ip_set_elem_len(set, tb,
> > - sizeof(struct IPSET_TOKEN(HTYPE, 4_elem)));
> > + IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE,
> > 4_elem)));
> > #ifndef IP_SET_PROTO_UNDEF
> > } else {
> > set->variant = &IPSET_TOKEN(HTYPE, 6_variant);
> > set->dsize = ip_set_elem_len(set, tb,
> > - sizeof(struct IPSET_TOKEN(HTYPE, 6_elem)));
> > + IP_SET_BASE_ALIGN(IPSET_TOKEN(HTYPE,
> > 6_elem)));
> > }
> > #endif
> > if (tb[IPSET_ATTR_TIMEOUT]) {
> >
> > If that does not solve it, then could you help to narrow down the issue?
> > Does the bug still appear if your remove the counter extension of the set?
> >
>
> Patch applied well, compiling ...
>
> Interesting, that it didn't happen before. Device is in production for
> more than 2 month without any issue.
You mean the device was stable with the earlier kernels, but starting with
4.2.3 (and back to 4.1.10) you have got problems, don't you?
> Also any idea regarding the second isssue? Or do you think it has the
> same root cause?
Looking at your RedHat bugzilla report, the "nf_conntrack: table full,
dropping packet" and "Alignment trap: not handling instruction" are two
unrelated issues and the second one is triggered by the unaligned counter
extension acccess in ipset, I'm investigating. I can't think of any reason
how those issues could be related to each other.
> Greetings from Vienna, Austria :-)
Quite near to my place :-)
> BTW: You can get the Banana Pi R1 for example at:
> http://www.aliexpress.com/item/BPI-R1-Set-1-R1-Board-Clear-Case-5dB-Antenna-Power-Adapter-Banana-PI-R1-Smart/32362127917.html
> I can really recommend it as a router. Power consumption is as less as 3W.
> Price is also IMHO very good.
Cool mini gear, indeed!
Best regards,
Jozsef
-
E-mail : kadlec@blackhole.kfki.hu, kadlecsik.jozsef@wigner.mta.hu
PGP key : http://www.kfki.hu/~kadlec/pgp_public_key.txt
Address : Wigner Research Centre for Physics, Hungarian Academy of Sciences
H-1525 Budapest 114, POB. 49, Hungary
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-26 8:58 ` Jozsef Kadlecsik
@ 2015-10-26 9:11 ` Gerhard Wiesinger
0 siblings, 0 replies; 14+ messages in thread
From: Gerhard Wiesinger @ 2015-10-26 9:11 UTC (permalink / raw)
To: Jozsef Kadlecsik
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
Greg KH, linux-kernel, Andrew Morton, torvalds, stable, lwn,
Jiri Slaby
On 26.10.2015 09:58, Jozsef Kadlecsik wrote:
> On Sun, 25 Oct 2015, Gerhard Wiesinger wrote:
>
>> Also any idea regarding the second isssue? Or do you think it has the
>> same root cause?
> Looking at your RedHat bugzilla report, the "nf_conntrack: table full,
> dropping packet" and "Alignment trap: not handling instruction" are two
> unrelated issues and the second one is triggered by the unaligned counter
> extension acccess in ipset, I'm investigating. I can't think of any reason
> how those issues could be related to each other.
Yes, they are unrelated.
Issue 1: nf_conntrack: table full, dropping packet => Fixed with 4.2.4
Issue 2: Alignment trap: not handling instruction => Happens when ipset
counters are enabled
Please keep in mind it happens with IPv6 commands.
Currently 4.2.4 without ipset counters runs well.
Ciao,
Gerhard
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-10-25 16:29 ` Greg KH
2015-10-25 17:14 ` Gerhard Wiesinger
@ 2015-11-08 13:51 ` Gerhard Wiesinger
2015-11-08 17:20 ` Greg KH
1 sibling, 1 reply; 14+ messages in thread
From: Gerhard Wiesinger @ 2015-11-08 13:51 UTC (permalink / raw)
To: Greg KH
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
linux-kernel, Andrew Morton, torvalds, stable, lwn, Jiri Slaby
On 25.10.2015 17:29, Greg KH wrote:
> On Sun, Oct 25, 2015 at 11:48:54AM +0100, Gerhard Wiesinger wrote:
>> On 25.10.2015 10:46, Willy Tarreau wrote:
>>> ipset *triggered* the problem. The whole stack dump would tell more.
>> OK, find the stack traces in the bug report:
>> https://bugzilla.redhat.com/show_bug.cgi?id=1272645
>>
>> Kernel 4.1.10 triggered also a kernel dump when playing with ipset commands
>> and IPv6, details in the bug report ....
>>
>>>> Kernel 4.2 seems to me not well tested in the netfilter parts at all
>>>> (Bug with already known bugfix
>>>> https://lists.debian.org/debian-kernel/2015/10/msg00034.html was
>>>> triggered on 2 of 3 of my machines, the new bug on 1 of 1 tested machine).
>>> There's a reason why Greg maintains stable and LTS kernels :-)
>> Stable kernels don't crash but definiton. :-)
>>
>> At least triggered 2 kernel panics in 5min, even with 4.1.10 and ipset
>> commands ...
> Does this happen also with Linus's tree? I suggest you ask the
> networking developers about this on netdev@vger.kernel.org, there's
> nothing that I can do on my own about this, sorry.
Patch is now available, see:
[PATCH 0/3] ipset patches for nf
https://marc.info/?l=netfilter-devel&m=144690007708041&w=2
https://marc.info/?l=netfilter-devel&m=144690007808042&w=2
https://marc.info/?l=netfilter-devel&m=144690008608043&w=2
https://marc.info/?l=netfilter-devel&m=144690007708039&w=2
[ANNOUNCE] ipset 6.27 released
https://marc.info/?l=netfilter-devel&m=144690048308099&w=2
Requires also new userland ipset version.
Please integrate it upstream.
Thanx to Jozsef Kadlecsik for fixing it.
Ciao,
Gerhard
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-11-08 13:51 ` Gerhard Wiesinger
@ 2015-11-08 17:20 ` Greg KH
2015-11-09 12:35 ` Gerhard Wiesinger
0 siblings, 1 reply; 14+ messages in thread
From: Greg KH @ 2015-11-08 17:20 UTC (permalink / raw)
To: Gerhard Wiesinger
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
linux-kernel, Andrew Morton, torvalds, stable, lwn, Jiri Slaby
On Sun, Nov 08, 2015 at 02:51:01PM +0100, Gerhard Wiesinger wrote:
> On 25.10.2015 17:29, Greg KH wrote:
> >On Sun, Oct 25, 2015 at 11:48:54AM +0100, Gerhard Wiesinger wrote:
> >>On 25.10.2015 10:46, Willy Tarreau wrote:
> >>>ipset *triggered* the problem. The whole stack dump would tell more.
> >>OK, find the stack traces in the bug report:
> >>https://bugzilla.redhat.com/show_bug.cgi?id=1272645
> >>
> >>Kernel 4.1.10 triggered also a kernel dump when playing with ipset commands
> >>and IPv6, details in the bug report ....
> >>
> >>>>Kernel 4.2 seems to me not well tested in the netfilter parts at all
> >>>>(Bug with already known bugfix
> >>>>https://lists.debian.org/debian-kernel/2015/10/msg00034.html was
> >>>>triggered on 2 of 3 of my machines, the new bug on 1 of 1 tested machine).
> >>>There's a reason why Greg maintains stable and LTS kernels :-)
> >>Stable kernels don't crash but definiton. :-)
> >>
> >>At least triggered 2 kernel panics in 5min, even with 4.1.10 and ipset
> >>commands ...
> >Does this happen also with Linus's tree? I suggest you ask the
> >networking developers about this on netdev@vger.kernel.org, there's
> >nothing that I can do on my own about this, sorry.
>
> Patch is now available, see:
> [PATCH 0/3] ipset patches for nf
> https://marc.info/?l=netfilter-devel&m=144690007708041&w=2
> https://marc.info/?l=netfilter-devel&m=144690007808042&w=2
> https://marc.info/?l=netfilter-devel&m=144690008608043&w=2
> https://marc.info/?l=netfilter-devel&m=144690007708039&w=2
> [ANNOUNCE] ipset 6.27 released
> https://marc.info/?l=netfilter-devel&m=144690048308099&w=2
>
> Requires also new userland ipset version.
>
> Please integrate it upstream.
>
> Thanx to Jozsef Kadlecsik for fixing it.
That's great, can you let me know the git commits that end up in Linus's
tree? That's what we need for the stable kernel.
thanks,
greg k-h
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-11-08 17:20 ` Greg KH
@ 2015-11-09 12:35 ` Gerhard Wiesinger
2015-11-09 12:41 ` Pablo Neira Ayuso
0 siblings, 1 reply; 14+ messages in thread
From: Gerhard Wiesinger @ 2015-11-09 12:35 UTC (permalink / raw)
To: Greg KH
Cc: Willy Tarreau, Linux Kernel Network Developers, netfilter-devel,
linux-kernel, Andrew Morton, torvalds, stable, lwn, Jiri Slaby
On 08.11.2015 18:20, Greg KH wrote:
> On Sun, Nov 08, 2015 at 02:51:01PM +0100, Gerhard Wiesinger wrote:
>> On 25.10.2015 17:29, Greg KH wrote:
>>> On Sun, Oct 25, 2015 at 11:48:54AM +0100, Gerhard Wiesinger wrote:
>>>> On 25.10.2015 10:46, Willy Tarreau wrote:
>>>>> ipset *triggered* the problem. The whole stack dump would tell more.
>>>> OK, find the stack traces in the bug report:
>>>> https://bugzilla.redhat.com/show_bug.cgi?id=1272645
>>>>
>>>> Kernel 4.1.10 triggered also a kernel dump when playing with ipset commands
>>>> and IPv6, details in the bug report ....
>>>>
>>>>>> Kernel 4.2 seems to me not well tested in the netfilter parts at all
>>>>>> (Bug with already known bugfix
>>>>>> https://lists.debian.org/debian-kernel/2015/10/msg00034.html was
>>>>>> triggered on 2 of 3 of my machines, the new bug on 1 of 1 tested machine).
>>>>> There's a reason why Greg maintains stable and LTS kernels :-)
>>>> Stable kernels don't crash but definiton. :-)
>>>>
>>>> At least triggered 2 kernel panics in 5min, even with 4.1.10 and ipset
>>>> commands ...
>>> Does this happen also with Linus's tree? I suggest you ask the
>>> networking developers about this on netdev@vger.kernel.org, there's
>>> nothing that I can do on my own about this, sorry.
>> Patch is now available, see:
>> [PATCH 0/3] ipset patches for nf
>> https://marc.info/?l=netfilter-devel&m=144690007708041&w=2
>> https://marc.info/?l=netfilter-devel&m=144690007808042&w=2
>> https://marc.info/?l=netfilter-devel&m=144690008608043&w=2
>> https://marc.info/?l=netfilter-devel&m=144690007708039&w=2
>> [ANNOUNCE] ipset 6.27 released
>> https://marc.info/?l=netfilter-devel&m=144690048308099&w=2
>>
>> Requires also new userland ipset version.
>>
>> Please integrate it upstream.
>>
>> Thanx to Jozsef Kadlecsik for fixing it.
> That's great, can you let me know the git commits that end up in Linus's
> tree? That's what we need for the stable kernel.
Find the commits here:
https://git.kernel.org/cgit/linux/kernel/git/pablo/nf.git/
https://git.kernel.org/cgit/linux/kernel/git/pablo/nf.git/commit/?id=e75cb467df29a428612c162e6f1451c5c0717091
Don't know exactly the merging processes, so feel free to merge or
contact Pablo.
Ciao,
Gerhard
^ permalink raw reply [flat|nested] 14+ messages in thread
* Re: Linux 4.2.4
2015-11-09 12:35 ` Gerhard Wiesinger
@ 2015-11-09 12:41 ` Pablo Neira Ayuso
0 siblings, 0 replies; 14+ messages in thread
From: Pablo Neira Ayuso @ 2015-11-09 12:41 UTC (permalink / raw)
To: Gerhard Wiesinger
Cc: Greg KH, Willy Tarreau, Linux Kernel Network Developers,
netfilter-devel, linux-kernel, Andrew Morton, torvalds, stable,
lwn, Jiri Slaby
On Mon, Nov 09, 2015 at 01:35:11PM +0100, Gerhard Wiesinger wrote:
> On 08.11.2015 18:20, Greg KH wrote:
> >That's great, can you let me know the git commits that end up in Linus's
> >tree? That's what we need for the stable kernel.
>
> Find the commits here:
> https://git.kernel.org/cgit/linux/kernel/git/pablo/nf.git/
> https://git.kernel.org/cgit/linux/kernel/git/pablo/nf.git/commit/?id=e75cb467df29a428612c162e6f1451c5c0717091
>
> Don't know exactly the merging processes, so feel free to merge or contact
> Pablo.
I'll take care of that, this is already following its path to -stable,
will take a little while though as usual.
^ permalink raw reply [flat|nested] 14+ messages in thread
end of thread, other threads:[~2015-11-09 12:41 UTC | newest]
Thread overview: 14+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
[not found] <20151023003306.GA25388@kroah.com>
[not found] ` <562C83FD.6010402@wiesinger.com>
[not found] ` <20151025082547.GA22300@kroah.com>
[not found] ` <20151025090308.GA17656@1wt.eu>
[not found] ` <562CA13C.90500@wiesinger.com>
[not found] ` <20151025094634.GA6832@1wt.eu>
2015-10-25 10:48 ` Linux 4.2.4 Gerhard Wiesinger
2015-10-25 16:29 ` Greg KH
2015-10-25 17:14 ` Gerhard Wiesinger
2015-11-08 13:51 ` Gerhard Wiesinger
2015-11-08 17:20 ` Greg KH
2015-11-09 12:35 ` Gerhard Wiesinger
2015-11-09 12:41 ` Pablo Neira Ayuso
2015-10-25 19:46 ` Jozsef Kadlecsik
2015-10-25 20:08 ` Gerhard Wiesinger
2015-10-25 21:26 ` Gerhard Wiesinger
2015-10-25 21:53 ` Jozsef Kadlecsik
2015-10-26 7:27 ` Gerhard Wiesinger
2015-10-26 8:58 ` Jozsef Kadlecsik
2015-10-26 9:11 ` Gerhard Wiesinger
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).