From mboxrd@z Thu Jan  1 00:00:00 1970
Received: from eggs.gnu.org ([2001:4830:134:3::10]:34814)
	by lists.gnu.org with esmtp (Exim 4.71)
	(envelope-from <armbru@redhat.com>) id 1bwlnd-0000FP-7P
	for qemu-devel@nongnu.org; Wed, 19 Oct 2016 04:00:34 -0400
Received: from Debian-exim by eggs.gnu.org with spam-scanned (Exim 4.71)
	(envelope-from <armbru@redhat.com>) id 1bwlnY-0002hb-8R
	for qemu-devel@nongnu.org; Wed, 19 Oct 2016 04:00:33 -0400
Received: from mx1.redhat.com ([209.132.183.28]:42660)
	by eggs.gnu.org with esmtps (TLS1.0:DHE_RSA_AES_256_CBC_SHA1:32)
	(Exim 4.71) (envelope-from <armbru@redhat.com>) id 1bwlnY-0002gm-0s
	for qemu-devel@nongnu.org; Wed, 19 Oct 2016 04:00:28 -0400
Received: from int-mx11.intmail.prod.int.phx2.redhat.com
	(int-mx11.intmail.prod.int.phx2.redhat.com [10.5.11.24])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(No client certificate requested)
	by mx1.redhat.com (Postfix) with ESMTPS id 15855624AA
	for <qemu-devel@nongnu.org>; Wed, 19 Oct 2016 08:00:27 +0000 (UTC)
From: Markus Armbruster <armbru@redhat.com>
References: <20161012191502.GC16187@work-vm>
	<20161018100409.GH4349@redhat.com> <20161018113202.GE2190@work-vm>
	<20161018120121.GN4349@redhat.com> <20161018132524.GG2190@work-vm>
	<20161018133528.GD12728@redhat.com> <20161018135213.GI2190@work-vm>
	<20161018140141.GF12728@redhat.com>
Date: Wed, 19 Oct 2016 10:00:24 +0200
In-Reply-To: <20161018140141.GF12728@redhat.com> (Daniel P. Berrange's message
	of "Tue, 18 Oct 2016 15:01:41 +0100")
Message-ID: <87wph4g44n.fsf@dusky.pond.sub.org>
MIME-Version: 1.0
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
Subject: Re: [Qemu-devel] chardev's and fd's in monitors
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <http://lists.nongnu.org/archive/html/qemu-devel/>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
	<mailto:qemu-devel-request@nongnu.org?subject=subscribe>
To: "Daniel P. Berrange" <berrange@redhat.com>
Cc: "Dr. David Alan Gilbert" <dgilbert@redhat.com>, qemu-devel@nongnu.org

"Daniel P. Berrange" <berrange@redhat.com> writes:

> On Tue, Oct 18, 2016 at 02:52:13PM +0100, Dr. David Alan Gilbert wrote:
>> * Daniel P. Berrange (berrange@redhat.com) wrote:
>> > On Tue, Oct 18, 2016 at 02:25:25PM +0100, Dr. David Alan Gilbert wrote:
>> > > * Daniel P. Berrange (berrange@redhat.com) wrote:
>> > > > On Tue, Oct 18, 2016 at 12:32:02PM +0100, Dr. David Alan Gilbert w=
rote:
>> > > > > * Daniel P. Berrange (berrange@redhat.com) wrote:
>> > > > > > On Wed, Oct 12, 2016 at 08:15:02PM +0100, Dr. David Alan Gilbe=
rt wrote:
>> > > > > > > Hi,
>> > > > > > >   I had a look at a couple of readline like libraries;
>> > > > > > > editline and linenoise.  A difficulty with using them is that
>> > > > > > > they both want fd's or FILE*'s; editline takes either but
>> > > > > > > from a brief look I think it's expecting to extract the fd.
>> > > > > > > That makes them tricky to integrate into qemu, where
>> > > > > > > the chardev's hide a whole bunch of non-fd things; in partic=
ular
>> > > > > > > tls, mux, ringbuffers etc.
>> > > > > > >=20
>> > > > > > > If we could get away with just a FILE* then we could use fop=
encookie,
>> > > > > > > but that's GNU only.
>> > > > > > >=20
>> > > > > > > Is there any sane way of shepherding all chardev's into havi=
ng an
>> > > > > > > fd?
>> > > > > >=20
>> > > > > > The entire chardev abstraction model exists precisely because =
we cannot
>> > > > > > make all chardevs look like a single fd. Even those which are =
fd based
>> > > > > > may have separate FDs for input and output.
>> > > > >=20
>> > > > > Note that editline takes separate in/out streams, but it does wa=
nt those streams
>> > > > > to be FILE*'s.
>> > > > >=20
>> > > > > > IMHO the only viable approach would be to enhance linenoise/ed=
itline to
>> > > > > > not assume use of fd* or FILE * abstractions.
>> > > > >=20
>> > > > > I think if it came to that then we'd probably end up sticking wi=
th what we
>> > > > > had for a very long time; I'd assume it would take a long time b=
efore
>> > > > > any mods we made to the libraries would come around to be genera=
lly useful.
>> > > > >=20
>> > > > > > BTW, what is the actual thread issue you are facing ? Chardevs=
 at least
>> > > > > > ought to be usable from a separate thread, as long as each dis=
tinct
>> > > > > > chardev object instance was only used from one thread at a tim=
e ?
>> > > > >=20
>> > > > > Marc-Andr=C3=A9 pointed that out; I hadn't realised they were th=
read safe.
>> > > > > But what are the rules? You say 'only used from one thread at a =
time' -
>> > > > > what happens if we have a mux and the different streams to the m=
ux come
>> > > > > from different threads?
>> > > >=20
>> > > > Well there is no mutex locking on the CharDriverState objects, so =
the
>> > > > exact rule is "you mustn't do anything from multiple threads that =
will
>> > > > race on contents of CharDriverState". That's too fuzzy to be usefu=
l to
>> > > > developers though, so I think the only sensible option right now i=
s to
>> > > > say any "top level" CharDriverState should only be touch from one =
thread
>> > > > at a time. IOW, if you have a mux, that that rule would apply to t=
he
>> > > > mux itself and the various children it owns as if they were a sing=
le
>> > > > unnit.
>> > >=20
>> > > OK; I think we're probably saved by the big lock at the moment, so t=
hat
>> > > all device emulation that outputs text is probably holding it and th=
e monitor
>> > > is also.  What about something like an error_report from a different=
 thread
>> > > while something is happening in the monitor?
>> >=20
>> > If we moved execution of monitor commands to separate thread from the
>> > thread handling monitor I/O, then we'd have to modify error_report so
>> > that it queued the text in some manner, such that it was only then
>> > fed back to the client once the command thread completed. Alternatively
>> > we'd have to introduced locking in the Monitor object, that serialized
>> > access to the underling CharDriverState I/O funcs.
>>=20
>> I already use error_report's in places in migration threads of various
>> types; I'm not sure if that's a problem.
>
> Unless those places are protected by the big qemu lock, that sounds
> not good. error_report calls into error_vprintf which checks the
> 'cur_mon' global "Monitor" pointer. This variable is updated at
> runtime - eg in qmp_human_monitor_command(), monitor_qmp_read(),
> monitor_read(), etc. So if migration threads outside the BQL are
> calling error_report() that could well cause problems. If you
> are lucky messages will merely end up going to stderr instead of
> the monitor, but in worst case I wouldn't be surprised if there
> is a crash possibility in some race conditions.

cur_mon dates back to single-threaded times.

The idea is to print to the monitor when running within an HMP command,
else to stderr.

The current solution is to set cur_mon around monitor commands.  Fine
with a single thread, not fine at all with multiple threads.

Making cur_mon thread-local should fix things.

If you do want to report errors from another thread in a monitor, you
should use error_setg() & friends to get them into the monitor, in my
opinion.  Asynchronously barfing output to a monitor doesn't strike me
as a sensible design.  Not least because it doesn't work at all with
QMP!  If an error message is important enough for the human monitor's
user to make use route it to the human monitor, why is hiding it from
the QMP client okay?

If I'm wrong and it is sensible, we need locking.