On Thu, Sep 30, 2021 at 11:30:35AM -0400, Vivek Goyal wrote: > So far we did not have the notion of cross queue traffic. That is, we > get request on a queue and send back response on same queue. So if a > request be being processed and at the same time a stop queue request > comes in, we wait for all pending requests to finish and then queue > is stopped and associated data structure cleaned. > > But with notification queue, now it is possible that we get a locking > request on request queue and send the notification back on a different > queue (notificaiton queue). This means, we need to make sure that s/notificaiton/notification/ > notifiation queue has not already been shutdown or is not being s/notifiation/notification/ > shutdown in parallel while we are trying to send a notification back. > Otherwise bad things are bound to happen. > > One way to solve this problem is that stop notification queue in the > end. First stop hiprio and all request queues. That means by the > time we are trying to stop notification queue, we know no other > request can be in progress which can try to send something on > notification queue. > > But problem is that currently we don't have any control on in what > order queues should be stopped. If there was a notion of whole device > being stopped, then we could decide in what order queues should be > stopped. > > Stefan mentioned that there is a command to stop whole device > VHOST_USER_SET_STATUS but it is not implemented in libvhost-user > yet. Also we probably could not move away from per queue stop > logic we have as of now. > > As an alternative, he said if we stop all queue when qidx 0 is > being stopped, it should be fine and we can solve the issue of > notification queue shutdown order. > > So in this patch I am shutting down all queues when queue 0 > is being shutdown. And also changed shutdown order in such a > way that notification queue is shutdown last. > > Suggested-by: Stefan Hajnoczi > Signed-off-by: Vivek Goyal > --- > tools/virtiofsd/fuse_virtio.c | 27 ++++++++++++++++++++++++++- > 1 file changed, 26 insertions(+), 1 deletion(-) > > diff --git a/tools/virtiofsd/fuse_virtio.c b/tools/virtiofsd/fuse_virtio.c > index c67c2e0e7a..a87e88e286 100644 > --- a/tools/virtiofsd/fuse_virtio.c > +++ b/tools/virtiofsd/fuse_virtio.c > @@ -826,6 +826,11 @@ static void fv_queue_cleanup_thread(struct fv_VuDev *vud, int qidx) > assert(qidx < vud->nqueues); > ourqi = vud->qi[qidx]; > > + /* Queue is already stopped */ > + if (!ourqi) { > + return; > + } > + > /* qidx == 1 is the notification queue if notifications are enabled */ > if (!se->notify_enabled || qidx != 1) { > /* Kill the thread */ > @@ -847,14 +852,25 @@ static void fv_queue_cleanup_thread(struct fv_VuDev *vud, int qidx) > > static void stop_all_queues(struct fv_VuDev *vud) > { > + struct fuse_session *se = vud->se; > + > for (int i = 0; i < vud->nqueues; i++) { > if (!vud->qi[i]) { > continue; > } > > + /* Shutdown notification queue in the end */ > + if (se->notify_enabled && i == 1) { > + continue; > + } > fuse_log(FUSE_LOG_INFO, "%s: Stopping queue %d thread\n", __func__, i); > fv_queue_cleanup_thread(vud, i); > } > + > + if (se->notify_enabled) { > + fuse_log(FUSE_LOG_INFO, "%s: Stopping queue %d thread\n", __func__, 1); > + fv_queue_cleanup_thread(vud, 1); > + } > } > > /* Callback from libvhost-user on start or stop of a queue */ > @@ -934,7 +950,16 @@ static void fv_queue_set_started(VuDev *dev, int qidx, bool started) > * the queue thread doesn't block in virtio_send_msg(). > */ > vu_dispatch_unlock(vud); > - fv_queue_cleanup_thread(vud, qidx); > + > + /* > + * If queue 0 is being shutdown, treat it as if device is being > + * shutdown and stop all queues. > + */ Please expand this comment so it's clear why we do this.