From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-13.5 required=3.0 tests=BAYES_00,DKIM_INVALID, DKIM_SIGNED,HEADER_FROM_DIFFERENT_DOMAINS,INCLUDES_CR_TRAILER,INCLUDES_PATCH, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id A5929C433B4 for ; Tue, 13 Apr 2021 13:50:58 +0000 (UTC) Received: from smtp3.osuosl.org (smtp3.osuosl.org [140.211.166.136]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPS id 4CCDD61242 for ; Tue, 13 Apr 2021 13:50:58 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 4CCDD61242 Authentication-Results: mail.kernel.org; dmarc=fail (p=none dis=none) header.from=redhat.com Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=virtualization-bounces@lists.linux-foundation.org Received: from localhost (localhost [127.0.0.1]) by smtp3.osuosl.org (Postfix) with ESMTP id 19484605A3; Tue, 13 Apr 2021 13:50:58 +0000 (UTC) X-Virus-Scanned: amavisd-new at osuosl.org Received: from smtp3.osuosl.org ([127.0.0.1]) by localhost (smtp3.osuosl.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 035lp0Q0bAU2; Tue, 13 Apr 2021 13:50:57 +0000 (UTC) Received: from lists.linuxfoundation.org (lf-lists.osuosl.org [IPv6:2605:bc80:3010:104::8cd3:938]) by smtp3.osuosl.org (Postfix) with ESMTP id 8AEAD606A7; Tue, 13 Apr 2021 13:50:56 +0000 (UTC) Received: from lf-lists.osuosl.org (localhost [127.0.0.1]) by lists.linuxfoundation.org (Postfix) with ESMTP id 584B0C000B; Tue, 13 Apr 2021 13:50:56 +0000 (UTC) Received: from smtp1.osuosl.org (smtp1.osuosl.org [IPv6:2605:bc80:3010::138]) by lists.linuxfoundation.org (Postfix) with ESMTP id 0ED13C000A for ; Tue, 13 Apr 2021 13:50:55 +0000 (UTC) Received: from localhost (localhost [127.0.0.1]) by smtp1.osuosl.org (Postfix) with ESMTP id E427283CB7 for ; Tue, 13 Apr 2021 13:50:54 +0000 (UTC) X-Virus-Scanned: amavisd-new at osuosl.org Authentication-Results: smtp1.osuosl.org (amavisd-new); dkim=pass (1024-bit key) header.d=redhat.com Received: from smtp1.osuosl.org ([127.0.0.1]) by localhost (smtp1.osuosl.org [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id nGI-Q9_HUBpN for ; Tue, 13 Apr 2021 13:50:53 +0000 (UTC) X-Greylist: domain auto-whitelisted by SQLgrey-1.8.0 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [63.128.21.124]) by smtp1.osuosl.org (Postfix) with ESMTPS id B35BD83CC9 for ; Tue, 13 Apr 2021 13:50:53 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1618321852; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=3Lzjz6sbkoltOhmZF1GSTtIoPmdNeKpwGCjKD+C57bM=; b=F9hjsVbqOBnQzn46uzhz+RsLPtj6Chkp1okXLd6JGxrJ8zBeck+JiDPped1wIZq+/z7aw9 gwPlefFokUQnmdVS3JhDeokJ+tP3c6+t+wQpwBJ7sf8OhpkODfQXFyVv5boof9i0MoNwJT A+PK+EBJWX9Vxte8DUHB3x53rVTF67U= Received: from mail-wm1-f70.google.com (mail-wm1-f70.google.com [209.85.128.70]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-590-Lu_HcOyINeeNDCwD2Z32ow-1; Tue, 13 Apr 2021 09:50:50 -0400 X-MC-Unique: Lu_HcOyINeeNDCwD2Z32ow-1 Received: by mail-wm1-f70.google.com with SMTP id u11-20020a05600c00cbb029012a3f52677dso1151650wmm.8 for ; Tue, 13 Apr 2021 06:50:50 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=3Lzjz6sbkoltOhmZF1GSTtIoPmdNeKpwGCjKD+C57bM=; b=iU89Cmyr/ICJ0rl3gpmy1974szADd4sWLhKOZdVRkJkhzV1h7gQR+ldAKP5PFwVBfm iMwlz+4+pA/DLoU7gKPC4RwLa7CVvhKJLGMUcRYAkdFyQgZGHKQo2qYJp3SVsOGPEUud z1Ds+6wgmgLGOCXO4qBCmE1HhpAQ/X9/5W//oxV1OFWnaiu2BFNDVsXoMGx5HCrpgX4d zhiq7kk+gWwLCN7QUXSZFTHVqcAt7nYJARCkyh56QNVo4WggV6mSGMWJcSCBP6JjIyS8 ClwcygIeHNczdN4HviaPJrEv63SOGpQJrSUTPuZ/ITs5KUxH/D/05ERF3Agmlo/LxdKd ng6Q== X-Gm-Message-State: AOAM530wm6OivZeLDnwyM3qc83+cCmCRORJlzi+ZTM8r/t+gk6CaLzV5 +ReP3UukTaDTIzOeVrokpw5tfiJOCS6emF5RkqN+Mpa0LzR+ZQcXBZdNkPL0y299vYViCW5MuqK UwV6GuNPruLMbaIQ6AexznBlnICJcAM13pNtMFgG7qw== X-Received: by 2002:adf:f70f:: with SMTP id r15mr9127969wrp.208.1618321849525; Tue, 13 Apr 2021 06:50:49 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyqfbrMtaH/cXC4UtAOMUFq8i2GnmV6rLBlhAgTsal3iMBigo64oNp7l7peM+bcAq2YqQKMbQ== X-Received: by 2002:adf:f70f:: with SMTP id r15mr9127953wrp.208.1618321849337; Tue, 13 Apr 2021 06:50:49 -0700 (PDT) Received: from redhat.com ([2a10:8006:2281:0:1994:c627:9eac:1825]) by smtp.gmail.com with ESMTPSA id b12sm8648123wrn.18.2021.04.13.06.50.47 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 13 Apr 2021 06:50:48 -0700 (PDT) Date: Tue, 13 Apr 2021 09:50:45 -0400 From: "Michael S. Tsirkin" To: Stefano Garzarella Subject: Re: [RFC v2] virtio-vsock: add description for datagram type Message-ID: <20210413094722-mutt-send-email-mst@kernel.org> References: <20210401043602.3510538-1-jiang.wang@bytedance.com> <20210412142133.t44pn5pjy6fdcvk4@steredhat> <20210413125853.2dkldmp23vkkc74c@steredhat> <20210413091251-mutt-send-email-mst@kernel.org> <20210413133852.ebkrlbyetiqu4uje@steredhat> MIME-Version: 1.0 In-Reply-To: <20210413133852.ebkrlbyetiqu4uje@steredhat> Authentication-Results: relay.mimecast.com; auth=pass smtp.auth=CUSA124A263 smtp.mailfrom=mst@redhat.com X-Mimecast-Spam-Score: 0 X-Mimecast-Originator: redhat.com Content-Disposition: inline Cc: cong.wang@bytedance.com, Xiongchun Duan , "Jiang Wang ." , cohuck@redhat.com, virtualization@lists.linux-foundation.org, xieyongji@bytedance.com, Stefan Hajnoczi , Arseny Krasnov , asias@redhat.com X-BeenThere: virtualization@lists.linux-foundation.org X-Mailman-Version: 2.1.15 Precedence: list List-Id: Linux virtualization List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: virtualization-bounces@lists.linux-foundation.org Sender: "Virtualization" On Tue, Apr 13, 2021 at 03:38:52PM +0200, Stefano Garzarella wrote: > On Tue, Apr 13, 2021 at 09:16:50AM -0400, Michael S. Tsirkin wrote: > > On Tue, Apr 13, 2021 at 02:58:53PM +0200, Stefano Garzarella wrote: > > > On Mon, Apr 12, 2021 at 03:42:23PM -0700, Jiang Wang . wrote: > > > > On Mon, Apr 12, 2021 at 7:21 AM Stefano Garzarella wrote: > > > > > > > > > > On Mon, Apr 12, 2021 at 02:50:17PM +0100, Stefan Hajnoczi wrote: > > > > > >On Thu, Apr 01, 2021 at 04:36:02AM +0000, jiang.wang wrote: > > > > > >> Add supports for datagram type for virtio-vsock. Datagram > > > > > >> sockets are connectionless and unreliable. To avoid contention > > > > > >> with stream and other sockets, add two more virtqueues and > > > > > >> a new feature bit to identify if those two new queues exist or not. > > > > > >> > > > > > >> Also add descriptions for resource management of datagram, which > > > > > >> does not use the existing credit update mechanism associated with > > > > > >> stream sockets. > > > > > >> > > > > > >> Signed-off-by: Jiang Wang > > > > > >> --- > > > > > >> V2 addressed the comments for the previous version. > > > > > >> > > > > > >> virtio-vsock.tex | 62 +++++++++++++++++++++++++++++++++++++++++++++++--------- > > > > > >> 1 file changed, 52 insertions(+), 10 deletions(-) > > > > > >> > > > > > >> diff --git a/virtio-vsock.tex b/virtio-vsock.tex > > > > > >> index da7e641..62c12e0 100644 > > > > > >> --- a/virtio-vsock.tex > > > > > >> +++ b/virtio-vsock.tex > > > > > >> @@ -11,12 +11,25 @@ \subsection{Virtqueues}\label{sec:Device Types / Socket Device / Virtqueues} > > > > > >> \begin{description} > > > > > >> \item[0] rx > > > > > >> \item[1] tx > > > > > >> +\item[2] datagram rx > > > > > >> +\item[3] datagram tx > > > > > >> +\item[4] event > > > > > >> +\end{description} > > > > > >> +The virtio socket device uses 5 queues if feature bit VIRTIO_VSOCK_F_DRGAM is set. Otherwise, it > > > > > >> +only uses 3 queues, as the following. Rx and tx queues are always used for stream sockets. > > > > > >> + > > > > > >> +\begin{description} > > > > > >> +\item[0] rx > > > > > >> +\item[1] tx > > > > > >> \item[2] event > > > > > >> \end{description} > > > > > >> > > > > > > > > > > > >I suggest renaming "rx" and "tx" to "stream rx" and "stream tx" > > > > > >virtqueues and also adding this: > > > > > > > > > > > > When behavior differs between stream and datagram rx/tx virtqueues > > > > > > their full names are used. Common behavior is simply described in > > > > > > terms of rx/tx virtqueues and applies to both stream and datagram > > > > > > virtqueues. > > > > > > > > > > > >This way you won't need to duplicate portions of the spec that deal with > > > > > >populating the virtqueues, for example. > > > > > > > > > > > >It's also clearer to use a full name for stream rx/tx virtqueues instead > > > > > >of calling them rx/tx virtqueues now that we have datagram rx/tx > > > > > >virtqueues. > > > > > > > > > > > >> + > > > > > >> \subsection{Feature bits}\label{sec:Device Types / Socket Device / Feature bits} > > > > > >> > > > > > >> -There are currently no feature bits defined for this device. > > > > > >> +\begin{description} > > > > > >> +\item[VIRTIO_VSOCK_F_DGRAM (0)] Device has support for datagram socket type. > > > > > >> +\end{description} > > > > > >> > > > > > >> \subsection{Device configuration layout}\label{sec:Device Types / Socket Device / Device configuration layout} > > > > > >> > > > > > >> @@ -107,6 +120,9 @@ \subsection{Device Operation}\label{sec:Device Types / Socket Device / Device Op > > > > > >> > > > > > >> \subsubsection{Virtqueue Flow Control}\label{sec:Device Types / Socket Device / Device Operation / Virtqueue Flow Control} > > > > > >> > > > > > >> +Flow control applies to stream sockets; datagram sockets do not have > > > > > >> +flow control. > > > > > >> + > > > > > >> The tx virtqueue carries packets initiated by applications and replies to > > > > > >> received packets. The rx virtqueue carries packets initiated by the device and > > > > > >> replies to previously transmitted packets. > > > > > >> @@ -140,12 +156,15 @@ \subsubsection{Addressing}\label{sec:Device Types / Socket Device / Device Opera > > > > > >> consists of a (cid, port number) tuple. The header fields used for this are > > > > > >> \field{src_cid}, \field{src_port}, \field{dst_cid}, and \field{dst_port}. > > > > > >> > > > > > >> -Currently only stream sockets are supported. \field{type} is 1 for stream > > > > > >> -socket types. > > > > > >> +Currently stream and datagram (dgram) sockets are supported. \field{type} is 1 for stream > > > > > >> +socket types. \field{type} is 3 for dgram socket types. > > > > > >> > > > > > >> Stream sockets provide in-order, guaranteed, connection-oriented delivery > > > > > >> without message boundaries. > > > > > >> > > > > > >> +Datagram sockets provide connectionless unreliable messages of > > > > > >> +a fixed maximum length. > > > > > > > > > > > >Plus unordered (?) and with message boundaries. In other words: > > > > > > > > > > > > Datagram sockets provide unordered, unreliable, connectionless message > > > > > > with message boundaries and a fixed maximum length. > > > > > > > > > > > >I didn't think of the fixed maximum length aspect before. I guess the > > > > > >intention is that the rx buffer size is the message size limit? That's > > > > > >different from UDP messages, which can be fragmented into multiple IP > > > > > >packets and can be larger than 64KiB: > > > > > >https://en.wikipedia.org/wiki/User_Datagram_Protocol#UDP_datagram_structure > > > > > > > > > > > >Is it possible to support large datagram messages in vsock? I'm a little > > > > > >concerned that applications that run successfully over UDP will not be > > > > > >portable if vsock has this limitation because it would impose extra > > > > > >message boundaries that the application protocol might not tolerate. > > > > > > > > > > Maybe we can reuse the same approach Arseny is using for SEQPACKET. > > > > > Fragment the packets according to the buffers in the virtqueue and set > > > > > the EOR flag to indicate the last packet in the message. > > > > > > > > > Agree. Another option is to use the ones for skb since we may need to > > > > use skbs for multiple transport support anyway. > > > > > > > > > > The important thing I think is to have a single flag in virtio-vsock that > > > identifies pretty much the same thing: this is the last fragment of a series > > > to rebuild a packet. > > > > > > We should reuse the same flag for DGRAM and SEQPACKET. > > > > > > Thanks, > > > Stefano > > > > Well DGRAM can drop data so I wonder whether it can work ... > > > > Yep, this is true, but the channel should not be losing packets, so if the > receiver discards packets, it knows that it must then discard all of them > until the EOR. That is not so easy - they can come mixed up from multiple sources. Sure linux net core does this but with fragmentation added in, I start wondering whether you are beginning to reinvent the net stack ... > If the transmitter has to discard a packet, actually that could be a > problem... > We could provide another flag or a special packet to say that the rest of > the packet won't arrive and so you have to discard it all. > > Otherwise we need to go back to boundary packets as in the first > implementation of SEQPACKET. > > Stefano -- MST _______________________________________________ Virtualization mailing list Virtualization@lists.linux-foundation.org https://lists.linuxfoundation.org/mailman/listinfo/virtualization