From: Rao Shoaib <rao.shoaib at oracle.com>
To: mptcp at lists.01.org
Subject: Re: [MPTCP] Separating MPTCP packet processing from TCP
Date: Tue, 16 May 2017 09:39:09 -0700 [thread overview]
Message-ID: <d4946067-efed-a063-639f-b699b86753fc@oracle.com> (raw)
In-Reply-To: 20170516045931.GZ6301@da0602a-dhcp133.apple.com
[-- Attachment #1: Type: text/plain, Size: 4677 bytes --]
On 05/15/2017 09:59 PM, Christoph Paasch wrote:
> Hello Rao,
>
> On 15/05/17 - 18:16:36, Rao Shoaib wrote:
>> I am thinking the cleanest implementation would be to separate MPTCP
>> processing from TCP. On the receive side, when a packet arrives TCP only
>> does TCP processing (That leaves the code untouched), the packet is than
>> passed upto MPTCP which does MPTCP processing and can either process the
>> packet, drop it or send a reset.
>>
>> The current implementation is doing receive processing in TCP because it
>> wants to validate the packet and accept it in TCP or not -- but why ? that
>> seems to be an implementation choice.
> you can look at mptcp_handle_options for the conditions upon which we stop
> processing the segment, when coming from tcp_validate_incoming. For example,
> when a MP_FASTCLOSE is received, the subflow gets killed with a TCP-RST.
>
> The same holds when receiving a DSS-option without DSS-checksum although the
> DSS-checksum has been negotiated.
>
>
> The question would be, whether we can delay the killing of the subflow &
> sending of the RST until later. This would mean that the packet gets
> completely handled (acks processed, incoming data acknowledged,...)
> by the TCP-stack and later on the subflow gets killed.
> This *might* be ok from a protocol-perspective, but I'm not sure.
Hi Christoph,
That is exactly what I am suggesting. I know this is fine from the
protocol perspective and I don't see why it can not be implemented. That
will leave the TCP code untouched on the receive side and make the
upstream guys happy because that is the fast path. Last night I was
looking at the control path, that needs to be cleaned up also but that
is the slow path.
>
>> In the case where the receiver drops the packet in MPTCP, no data ack will
>> be sent and MPTCP will re-transmit, It can retransmit even on the same flow.
>> To achieve this the code requires some change as the DSS option has to be
>> saved. I think this is doable and is a much cleaner solution.
> For incoming data, that's already how it is handled. We pass the segment to
> the MPTCP-stack through the sk_data_ready callback, where we go over the
> segments, check whether their DSS-checksum is correct, check if it is
> in-window,... And if all is good, queue it at the MPTCP-level and send a
> DATA_ACK.
Yes. However the MPTCP options are parsed and mptcp_flags and dss offset
are extracted, no need to do that. Let MPTCP handle it. It's not a lot
of overhead and leaves the TCP code clean. This also removes the
requirement of trying to find space in tcp_skb_cb to pass any
information to MPTCP.
Regards,
Rao.
>
>
> Cheers,
> Christoph
>
>> Similarly we need to think about the Tx side -- That is not so straight
>> forward but we need to think harder.
>>
>> I can work on making the Rx changes but want to discuss it in case I am
>> missing potential issues and if this is not a got option.
>>
>> Comments ?
>>
>> Rao
>>
>> From: Rao Shoaib <rao.shoaib(a)oracle.com>
>> To: "multipathtcp(a)ietf.org" <multipathtcp(a)ietf.org>
>> Date: Mon, 15 May 2017 17:39:12 -0700
>> Subject: [multipathtcp] Question on Data Sequence Mapping
>>
>> Hi,
>>
>> Section 3.3.1 of RFC 6824 says
>>
>> The data sequence mapping specifies a mapping from subflow sequence space to
>> data sequence space. This is expressed in terms of starting sequence numbers
>> for the subflow and the data level, and a length of bytes for which this
>> mapping is valid.
>>
>> <...>
>>
>> A mapping is fixed, in that the subflow sequence number is bound to the data
>> sequence number after the mapping has been processed. A sender MUST NOT
>> change this mapping after it has been declared;
>>
>> Does it mean that I can not map the data sequence to another (higher) TCP
>> sequence number on the same flow.
>>
>> The reason I am asking this is that if this is allowed TCP and MPTCP
>> processing can be separated on the recv side. For example, TCP could accept
>> the packet but MPTCP could reject it. Since data level ack would not be sent
>> the peer would retransmit, possibly using the same flow but with a different
>> mapping so that TCP would accept the packet.
>>
>> Is this legal to do and if not than why ?
>>
>> Regards,
>>
>> Rao
>>
>> _______________________________________________
>> multipathtcp mailing list
>> multipathtcp(a)ietf.org
>> https://www.ietf.org/mailman/listinfo/multipathtcp
>> _______________________________________________
>> mptcp mailing list
>> mptcp(a)lists.01.org
>> https://lists.01.org/mailman/listinfo/mptcp
next reply other threads:[~2017-05-16 16:39 UTC|newest]
Thread overview: 6+ messages / expand[flat|nested] mbox.gz Atom feed top
2017-05-16 16:39 Rao Shoaib [this message]
-- strict thread matches above, loose matches on Subject: below --
2017-05-16 22:34 [MPTCP] Separating MPTCP packet processing from TCP Rao Shoaib
2017-05-16 22:22 Rao Shoaib
2017-05-16 21:57 Mat Martineau
2017-05-16 4:59 Christoph Paasch
2017-05-16 1:16 Rao Shoaib
Reply instructions:
You may reply publicly to this message via plain-text email
using any one of the following methods:
* Save the following mbox file, import it into your mail client,
and reply-to-all from there: mbox
Avoid top-posting and favor interleaved quoting:
https://en.wikipedia.org/wiki/Posting_style#Interleaved_style
* Reply using the --to, --cc, and --in-reply-to
switches of git-send-email(1):
git send-email \
--in-reply-to=d4946067-efed-a063-639f-b699b86753fc@oracle.com \
--to=unknown@example.com \
/path/to/YOUR_REPLY
https://kernel.org/pub/software/scm/git/docs/git-send-email.html
* If your mail client supports setting the In-Reply-To header
via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line
before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.