From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751696AbeERSpG (ORCPT ); Fri, 18 May 2018 14:45:06 -0400 Received: from mail-ua0-f195.google.com ([209.85.217.195]:45279 "EHLO mail-ua0-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751156AbeERSpE (ORCPT ); Fri, 18 May 2018 14:45:04 -0400 X-Google-Smtp-Source: AB8JxZq/ByIP4CZIObgaXwazX701Zsp16eE+hlcHuU1pLWuvWvefn5BF9uWTulp4Ycu1eGOueKDEff0LZ2G9YUg3x8I= MIME-Version: 1.0 In-Reply-To: References: <20180518120826.GA19515@dragonet.kaist.ac.kr> <293d029c-b14c-a625-3703-97a5754e99f1@gmail.com> <20180518.114433.390752642781753429.davem@davemloft.net> From: Willem de Bruijn Date: Fri, 18 May 2018 14:44:22 -0400 Message-ID: Subject: Re: WARNING in ip_recv_error To: David Miller Cc: Eric Dumazet , DaeLyong Jeong , Alexey Kuznetsov , Hideaki YOSHIFUJI , Network Development , LKML , Byoungyoung Lee , Kyungtae Kim , bammanag@purdue.edu, Willem de Bruijn Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, May 18, 2018 at 1:09 PM, Willem de Bruijn wrote: > On Fri, May 18, 2018 at 11:44 AM, David Miller wrote: >> From: Eric Dumazet >> Date: Fri, 18 May 2018 08:30:43 -0700 >> >>> We probably need to revert Willem patch (7ce875e5ecb8562fd44040f69bda96c999e38bbc) >> >> Is it really valid to reach ip_recv_err with an ipv6 socket? > > I guess the issue is that setsockopt IPV6_ADDRFORM is not an > atomic operation, so that the socket is neither fully ipv4 nor fully > ipv6 by the time it reaches ip_recv_error. > > sk->sk_socket->ops = &inet_dgram_ops; > < HERE > > sk->sk_family = PF_INET; > > Even calling inet_recv_error to demux would not necessarily help. > > Safest would be to look up by skb->protocol, similar to what > ipv6_recv_error does to handle v4-mapped-v6. > > Or to make that function safe with PF_INET and swap the order > of the above two operations. > > All sound needlessly complicated for this rare socket option, but > I don't have a better idea yet. Dropping on the floor is not nice, > either. Ensuring that ip_recv_error correctly handles packets from either socket and removing the warning should indeed be good. It is robust against v4-mapped packets from an AF_INET6 socket, but see caveat on reconnect below. The code between ipv6_recv_error for v4-mapped addresses and ip_recv_error is essentially the same, the main difference being whether to return network headers as sockaddr_in with SOL_IP or sockaddr_in6 with SOL_IPV6. There are very few other locations in the stack that explicitly test sk_family in this way and thus would be vulnerable to races with IPV6_ADDRFORM. I'm not sure whether it is possible for a udpv6 socket to queue a real ipv6 packet on the error queue, disconnect, connect to an ipv4 address, call IPV6_ADDRFORM and then call ip_recv_error on a true ipv6 packet. That would return buggy data, e.g., in msg_name.