All of lore.kernel.org
 help / color / mirror / Atom feed
From: Herbert Xu <herbert@gondor.hengli.com.au>
To: Eric Dumazet <eric.dumazet@gmail.com>
Cc: Jarek Poplawski <jarkao2@gmail.com>,
	Plamen Petrov <pvp-lsts@fs.uni-ruse.bg>,
	"Rafael J. Wysocki" <rjw@sisk.pl>,
	Kernel Testers List <kernel-testers@vger.kernel.org>,
	Maciej Rutecki <maciej.rutecki@gmail.com>,
	"David S. Miller" <davem@davemloft.net>,
	Linux Kernel Mailing List <linux-kernel@vger.kernel.org>,
	netdev@vger.kernel.org
Subject: Re: [Bug #16626] Machine hangs with EIP at skb_copy_and_csum_dev
Date: Fri, 3 Sep 2010 16:30:28 +0800	[thread overview]
Message-ID: <20100903083028.GB28952@gondor.apana.org.au> (raw)
In-Reply-To: <1283338251.2556.124.camel@edumazet-laptop>

On Wed, Sep 01, 2010 at 12:50:51PM +0200, Eric Dumazet wrote:
> Plamen, could you test following patch ?
> 
> I reproduced problem on a dev machine and following patch cured it.
> 
> Thanks
> 
> [PATCH] gro: fix different skb headrooms
> 
> packets entering GRO might have different headrooms, even for a given
> flow (because of implementation details in drivers, like copybreak).
> We cant force drivers to deliver packets with a fixed headroom.
> 
> 1) fix skb_segment()
> 
> skb_segment() makes the false assumption headrooms of fragments are same
> than the head. When CHECKSUM_PARTIAL is used, this can give csum_start
> errors, and crash later in skb_copy_and_csum_dev()
> 
> 2) allocate a minimal skb for head of frag_list
> 
> skb_gro_receive() uses netdev_alloc_skb(headroom + skb_gro_offset(p)) to
> allocate a fresh skb. This adds NET_SKB_PAD to a padding already
> provided by netdevice, depending on various things, like copybreak.
> 
> Use alloc_skb() to allocate an exact padding, to reduce cache line
> needs:
> NET_SKB_PAD + NET_IP_ALIGN
> 
> bugzilla : https://bugzilla.kernel.org/show_bug.cgi?id=16626
> 
> Many thanks to Plamen Petrov, testing many debugging patches !
> With help of Jarek Poplawski.
> 
> Reported-by: Plamen Petrov <pvp-lsts@fs.uni-ruse.bg>
> Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com>
> CC: Jarek Poplawski <jarkao2@gmail.com>

Thanks for diagnosing and fixing this!

> diff --git a/net/core/skbuff.c b/net/core/skbuff.c
> index 3a2513f..26396ff 100644
> --- a/net/core/skbuff.c
> +++ b/net/core/skbuff.c
> @@ -2573,6 +2573,10 @@ struct sk_buff *skb_segment(struct sk_buff *skb, int features)
>  		__copy_skb_header(nskb, skb);
>  		nskb->mac_len = skb->mac_len;
>  
> +		/* nskb and skb might have different headroom */
> +		if (nskb->ip_summed == CHECKSUM_PARTIAL)
> +			nskb->csum_start += skb_headroom(nskb) - headroom;

This test is redundant since we require CHECKSUM_PARTIAL for
GSO packets.

Cheers,
-- 
Email: Herbert Xu <herbert@gondor.apana.org.au>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt

WARNING: multiple messages have this Message-ID (diff)
From: Herbert Xu <herbert-lOAM2aK0SrRLBo1qDEOMRrpzq4S04n8Q@public.gmane.org>
To: Eric Dumazet <eric.dumazet-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
Cc: Jarek Poplawski <jarkao2-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
	Plamen Petrov <pvp-lsts-s6OjJRe3oxUfI6EYonfoRA@public.gmane.org>,
	"Rafael J. Wysocki" <rjw-KKrjLPT3xs0@public.gmane.org>,
	Kernel Testers List
	<kernel-testers-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
	Maciej Rutecki
	<maciej.rutecki-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>,
	"David S. Miller" <davem-fT/PcQaiUtIeIZ0/mPfg9Q@public.gmane.org>,
	Linux Kernel Mailing List
	<linux-kernel-u79uwXL29TY76Z2rM5mHXA@public.gmane.org>,
	netdev-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Subject: Re: [Bug #16626] Machine hangs with EIP at skb_copy_and_csum_dev
Date: Fri, 3 Sep 2010 16:30:28 +0800	[thread overview]
Message-ID: <20100903083028.GB28952@gondor.apana.org.au> (raw)
In-Reply-To: <1283338251.2556.124.camel@edumazet-laptop>

On Wed, Sep 01, 2010 at 12:50:51PM +0200, Eric Dumazet wrote:
> Plamen, could you test following patch ?
> 
> I reproduced problem on a dev machine and following patch cured it.
> 
> Thanks
> 
> [PATCH] gro: fix different skb headrooms
> 
> packets entering GRO might have different headrooms, even for a given
> flow (because of implementation details in drivers, like copybreak).
> We cant force drivers to deliver packets with a fixed headroom.
> 
> 1) fix skb_segment()
> 
> skb_segment() makes the false assumption headrooms of fragments are same
> than the head. When CHECKSUM_PARTIAL is used, this can give csum_start
> errors, and crash later in skb_copy_and_csum_dev()
> 
> 2) allocate a minimal skb for head of frag_list
> 
> skb_gro_receive() uses netdev_alloc_skb(headroom + skb_gro_offset(p)) to
> allocate a fresh skb. This adds NET_SKB_PAD to a padding already
> provided by netdevice, depending on various things, like copybreak.
> 
> Use alloc_skb() to allocate an exact padding, to reduce cache line
> needs:
> NET_SKB_PAD + NET_IP_ALIGN
> 
> bugzilla : https://bugzilla.kernel.org/show_bug.cgi?id=16626
> 
> Many thanks to Plamen Petrov, testing many debugging patches !
> With help of Jarek Poplawski.
> 
> Reported-by: Plamen Petrov <pvp-lsts-s6OjJRe3oxUfI6EYonfoRA@public.gmane.org>
> Signed-off-by: Eric Dumazet <eric.dumazet-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
> CC: Jarek Poplawski <jarkao2-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>

Thanks for diagnosing and fixing this!

> diff --git a/net/core/skbuff.c b/net/core/skbuff.c
> index 3a2513f..26396ff 100644
> --- a/net/core/skbuff.c
> +++ b/net/core/skbuff.c
> @@ -2573,6 +2573,10 @@ struct sk_buff *skb_segment(struct sk_buff *skb, int features)
>  		__copy_skb_header(nskb, skb);
>  		nskb->mac_len = skb->mac_len;
>  
> +		/* nskb and skb might have different headroom */
> +		if (nskb->ip_summed == CHECKSUM_PARTIAL)
> +			nskb->csum_start += skb_headroom(nskb) - headroom;

This test is redundant since we require CHECKSUM_PARTIAL for
GSO packets.

Cheers,
-- 
Email: Herbert Xu <herbert-lOAM2aK0SrRLBo1qDEOMRrpzq4S04n8Q@public.gmane.org>
Home Page: http://gondor.apana.org.au/~herbert/
PGP Key: http://gondor.apana.org.au/~herbert/pubkey.txt

  parent reply	other threads:[~2010-09-03  8:30 UTC|newest]

Thread overview: 90+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2010-08-29 22:24 2.6.36-rc3: Reported regressions from 2.6.35 Rafael J. Wysocki
2010-08-29 22:24 ` [Bug #16626] Machine hangs with EIP at skb_copy_and_csum_dev Rafael J. Wysocki
     [not found]   ` <courier.4C7C99F2.00001F74@fs.ru.acad.bg>
2010-08-31 19:26     ` Jarek Poplawski
2010-08-31 19:26       ` Jarek Poplawski
2010-09-01 10:50       ` Eric Dumazet
2010-09-01 10:50         ` Eric Dumazet
2010-09-01 11:20         ` Jarek Poplawski
2010-09-01 11:20           ` Jarek Poplawski
2010-09-01 13:57           ` Eric Dumazet
2010-09-01 13:57             ` Eric Dumazet
2010-09-01 15:05             ` Jarek Poplawski
2010-09-01 15:05               ` Jarek Poplawski
2010-09-02  1:23         ` David Miller
2010-09-02  1:23           ` David Miller
2010-09-03  8:00         ` Plamen Petrov
2010-09-03  8:00           ` Plamen Petrov
2010-09-03  9:06           ` Jarek Poplawski
2010-09-03  9:06             ` Jarek Poplawski
2010-09-03  8:30         ` Herbert Xu [this message]
2010-09-03  8:30           ` Herbert Xu
2010-09-04 20:34         ` Jarek Poplawski
2010-09-04 20:34           ` Jarek Poplawski
2010-09-05  7:49           ` Eric Dumazet
2010-09-05  7:49             ` Eric Dumazet
2010-09-08  4:57             ` Plamen Petrov
2010-09-08  4:57               ` Plamen Petrov
2010-09-08  6:20               ` Jarek Poplawski
2010-09-08  6:20                 ` Jarek Poplawski
2010-09-08 17:32                 ` David Miller
2010-09-08 17:32                   ` David Miller
2010-09-08 20:21                   ` Rafael J. Wysocki
2010-09-08 20:21                     ` Rafael J. Wysocki
2010-09-12  6:55                     ` Plamen Petrov
2010-09-12  6:55                       ` Plamen Petrov
2010-09-12 15:55                       ` David Miller
2010-09-12 15:55                         ` David Miller
2010-09-13  6:49                         ` Plamen Petrov
2010-09-13  6:49                           ` Plamen Petrov
2010-09-12 17:28                       ` Rafael J. Wysocki
2010-09-12 17:28                         ` Rafael J. Wysocki
2018-08-29 21:39       ` Mitchell Erblich
2010-08-29 22:36 ` [Bug #16971] qla4xxx compile failure on 32-bit PowerPC: missing readq and writeq Rafael J. Wysocki
2010-08-30  8:45   ` Meelis Roos
2010-08-30 13:46     ` Florian Mickler
2010-08-30 13:46       ` Florian Mickler
2010-08-29 22:36 ` [Bug #16629] fix BUG: using smp_processor_id() in preemptible code (resend) Rafael J. Wysocki
2010-08-29 22:36   ` Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #17021] [REGRESSION] [2.6.36-rc1] [DRM INTEL] [drm:intel_calculate_wm] *ERROR* Insufficient FIFO for plane, expect flickering: entries required = 36, available = 28 Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #16961] kernel BUG at arch/x86/kvm/../../../virt/kvm/kvm_main.c:1978 Rafael J. Wysocki
2010-08-29 22:36   ` Rafael J. Wysocki
2010-08-30  8:55   ` Sergey Senozhatsky
2010-08-30  8:55     ` Sergey Senozhatsky
2010-08-30 13:44     ` Florian Mickler
2010-08-30 13:44       ` Florian Mickler
2010-08-30 17:38     ` Rafael J. Wysocki
2010-08-30 17:38       ` Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #16881] [REGRESSION, Radeon-KMS] 2.6.36-rc1,2 - missing textures in 0 A.D Rafael J. Wysocki
2010-08-29 22:36   ` Rafael J. Wysocki
2010-08-30  9:54   ` trapDoor
2010-08-30  9:54     ` trapDoor
2010-08-30 13:39     ` Florian Mickler
2010-08-30 13:39       ` Florian Mickler
2010-08-30 14:45       ` trapDoor
2010-08-30 14:45         ` trapDoor
2010-08-29 22:36 ` [Bug #16951] hackbench regression with 2.6.36-rc1 Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #17131] WARN with 3c905 boomerang NIC Rafael J. Wysocki
2010-09-06  5:50   ` Florian Mickler
2010-09-06  5:50     ` Florian Mickler
2010-09-06 10:30     ` Doug Nazar
2010-09-06 10:30       ` Doug Nazar
2010-09-06 11:00       ` Florian Mickler
2010-09-06 11:00         ` Florian Mickler
2010-08-29 22:36 ` [Bug #17041] 2.6.36-rc1 hangs during XFS barrier test for / Rafael J. Wysocki
2010-08-30  4:36   ` Torsten Kaiser
2010-08-30  4:36     ` Torsten Kaiser
2010-08-30  9:27     ` Florian Mickler
2010-08-30  9:27       ` Florian Mickler
2010-08-30 17:40     ` Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #17151] i915: 2.6.36-rc2 hoses my Intel display Rafael J. Wysocki
2010-08-30 18:59   ` Jonathan Corbet
2010-08-30 20:42     ` Rafael J. Wysocki
2010-08-30 20:42       ` Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #17061] 2.6.36-rc1 on zaurus: bluetooth regression Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #17301] i915: 2.6.36-rc2 wrong resolution on gdm start Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #17321] i386 WARNING: at kernel/softirq.c:143 _local_bh_enable_ip+0x2f/0x7e Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #17311] 2.6.36-rc2-git4 - INFO: possible circular locking dependency detected Rafael J. Wysocki
2010-08-29 22:36   ` Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #17341] kdump regression compared to v2.6.35 Rafael J. Wysocki
2010-08-29 22:36   ` Rafael J. Wysocki
2010-08-29 22:36 ` [Bug #17331] BUG: scheduling while atomic Rafael J. Wysocki

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20100903083028.GB28952@gondor.apana.org.au \
    --to=herbert@gondor.hengli.com.au \
    --cc=davem@davemloft.net \
    --cc=eric.dumazet@gmail.com \
    --cc=jarkao2@gmail.com \
    --cc=kernel-testers@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=maciej.rutecki@gmail.com \
    --cc=netdev@vger.kernel.org \
    --cc=pvp-lsts@fs.uni-ruse.bg \
    --cc=rjw@sisk.pl \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.