From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <SRS0=2LwM=FZ=nongnu.org=qemu-devel-bounces+qemu-devel=archiver.kernel.org@kernel.org>
X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on
	aws-us-west-2-korg-lkml-1.web.codeaurora.org
X-Spam-Level: 
X-Spam-Status: No, score=-2.0 required=3.0 tests=BAYES_00,BODY_EMPTY,
	HK_RANDOM_FROM,MAILING_LIST_MULTI,SPF_HELO_NONE autolearn=no
	autolearn_force=no version=3.4.0
Received: from mail.kernel.org (mail.kernel.org [198.145.29.99])
	by smtp.lore.kernel.org (Postfix) with ESMTP id 642A5C433E0
	for <qemu-devel@archiver.kernel.org>; Mon, 21 Dec 2020 18:49:36 +0000 (UTC)
Received: from lists.gnu.org (lists.gnu.org [209.51.188.17])
	(using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits))
	(No client certificate requested)
	by mail.kernel.org (Postfix) with ESMTPS id 9BB0622AAB
	for <qemu-devel@archiver.kernel.org>; Mon, 21 Dec 2020 18:49:35 +0000 (UTC)
DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 9BB0622AAB
Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=bu.edu
Authentication-Results: mail.kernel.org; spf=pass smtp.mailfrom=qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Received: from localhost ([::1]:36558 helo=lists1p.gnu.org)
	by lists.gnu.org with esmtp (Exim 4.90_1)
	(envelope-from <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org>)
	id 1krQFS-0005bw-5s
	for qemu-devel@archiver.kernel.org; Mon, 21 Dec 2020 13:49:34 -0500
Received: from eggs.gnu.org ([2001:470:142:3::10]:45710)
 by lists.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <alxndr@bu.edu>) id 1krQDm-0004xI-MZ
 for qemu-devel@nongnu.org; Mon, 21 Dec 2020 13:47:50 -0500
Received: from relay64.bu.edu ([128.197.228.104]:50881)
 by eggs.gnu.org with esmtps (TLS1.2:ECDHE_RSA_AES_256_GCM_SHA384:256)
 (Exim 4.90_1) (envelope-from <alxndr@bu.edu>) id 1krQDi-0000BX-Ut
 for qemu-devel@nongnu.org; Mon, 21 Dec 2020 13:47:49 -0500
X-Envelope-From: alxndr@bu.edu
X-BU-AUTH: pool-72-93-72-163.bstnma.fios.verizon.net [72.93.72.163]
Received: from BU-AUTH (localhost.localdomain [127.0.0.1]) (authenticated
 bits=0)
 by relay64.bu.edu (8.14.3/8.14.3) with ESMTP id 0BLIl0dq028397
 (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256 verify=NO);
 Mon, 21 Dec 2020 13:47:15 -0500
From: Alexander Bulekov <alxndr@bu.edu>
To: Qiuhao Li <Qiuhao.Li@outlook.com>, qemu-devel@nongnu.org
Subject: Re: [PATCH 1/4] fuzz: refine crash detection mechanism
In-Reply-To: <ME3P282MB14924A6558A105B7FBFA579DFCC20@ME3P282MB1492.AUSP282.PROD.OUTLOOK.COM>
MIME-Version: 1.0
Content-Type: text/plain
signatures: 
https: //github.com/google/clusterfuzz/blob/master/src/python/crash_analysis/crash_analyzer.py
 Qiuhao Li <Qiuhao.Li@outlook.com> writes:
 > The original crash detection method is to fork a process to test our new
 > trace input. If the child process exits in time and the second-to-last line
 > is the same as the first crash, we think it is a crash triggered by the same
 > bug. However, in some situations, it doesn't work since it is a
 > hardcoded-offset string comparison. >
 > For example, suppose an assertion failure makes the crash. In that case, the
 > second-to-last line will be 'timeout: the monitored command dumped core',
 > which doesn't contain any information about the assertion failure like where
 > it happened or the assertion statement. This may lead to a minimized input
 > triggers assertion failure but may indicate another bug. As for some
 > sanitizers' crashes, the direct string comparison may stop us from getting a
 > smaller input, since they may have a different leaf stack frame. >
 > Perhaps we can detect crashes using both precise output string comparison
 > and rough pattern string match and info the user when the trace input
 > triggers different but a seminar output. ^^ similar > > Tested:
 > Assertion failure, https://bugs.launchpad.net/qemu/+bug/1908062
 > AddressSanitizer, https://bugs.launchpad.net/qemu/+bug/1907497
 > Trace input that doesn't crash > Trace input that crashes Qtest
 I'm not sure about this one. Is there an example where setting
 CRASH_TOKEN is not sufficient? The current approach isn't great. It
 relies on only a few bad assumptions and has some limitations:
 1. lines[-2] is often "good enough" to find a crash.
 2. If lines[-2] doesn't do the trick, it should be simple to identify a
 "CRASH_TOKEN" (eg a path:line-number in the stack-trace)
 3. Limitation: no good way to minimize timeouts. This is a tricky one,
 since a well-behaved QEMU will continune running after going through
 all the qtest commands and this can be tough to distinguish from a QEMU
 stuck in an infinite loop, or stuck in a syscall.
 I think my main concerns are
 * Crash_patterns might not catch everything. For example, this one
 doesn't match either pattern https://bugs.launchpad.net/bugs/1890160
 * SUMMARY.*Sanitizer lines often contain volatile addresses, so the
 matching will often fallback to SUMMARY.*Sanitizer. Sometimes this
 means that the minimized result will be another crash (I have seen this
 happen).
 * Maybe it is unlikely, but what will happen if ASan/UBSan etc decide
 to change the format of their output?
 We can look at the way ClusterFuzz (and OSS-Fuzz) identifies crash
 It seems a lot more involved, and I'm not sure if it is necessary,
 since at this point, the minimizer is only used manually.
 Are there any cases, where the current approach + sometimes a fallback
 to CRASH_TOKEN are not sufficient?
 I like the idea of making CRASH_TOKEN/crash_pattern a regex, though I
 would simply do a global match over the output, instead of applying it
 to each line. Thanks -Alex >
 > Signed-off-by: Qiuhao Li <Qiuhao.Li@outlook.com> > ---
 >  scripts/oss-fuzz/minimize_qtest_trace.py | 59 ++++++++++++++++++------
 >  1 file changed, 46 insertions(+), 13 deletions(-) >
 > diff --git a/scripts/oss-fuzz/minimize_qtest_trace.py
 b/scripts/oss-fuzz/minimize_qtest_trace.py
 > index 5e405a0d5f..d3b09e6567 100755
 > --- a/scripts/oss-fuzz/minimize_qtest_trace.py
 > +++ b/scripts/oss-fuzz/minimize_qtest_trace.py
 > @@ -10,11 +10,16 @@ import os >  import subprocess >  import time
 >  import struct > +import re > >  QEMU_ARGS = None >  QEMU_PATH = None
 >  TIMEOUT = 5 > -CRASH_TOKEN = None > +
 > +crash_patterns = ("Assertion.+failed",
 > +                  "SUMMARY.+Sanitizer") > +crash_pattern = None
 > +crash_string = None > >  write_suffix_lookup = {"b": (1, "B"),
 >                         "w": (2, "H"),
 > @@ -24,13 +29,12 @@ write_suffix_lookup = {"b": (1, "B"),
 >  def usage(): >      sys.exit("""\
 > Usage: QEMU_PATH="/path/to/qemu" QEMU_ARGS="args" {} input_trace
 output_trace > -By default,
 will try to use the second-to-last line in the output to identify
 > -whether the crash occred. Optionally,
 manually set a string that idenitifes the
 > -crash by setting CRASH_TOKEN=
 > +By default, we will try to search predefined crash patterns through the
 > +tracing output to see whether the crash occred. Optionally, manually set a
 > +string that idenitifes the crash by setting CRASH_PATTERN=
 >  """.format((sys.argv[0]))) >
 >  def check_if_trace_crashes(trace, path): > -    global CRASH_TOKEN
 >      with open(path, "w") as tracefile:
 >          tracefile.write("".join(trace)) >
 > @@ -42,17 +46,47 @@ def check_if_trace_crashes(trace, path):
 >                            shell=True,
 >                            stdin=subprocess.PIPE,
 >                            stdout=subprocess.PIPE)
 > +    if rc.returncode == 137:    # Timed Out > +        return False
 > + >      stdo = rc.communicate()[0]
 >      output = stdo.decode('unicode_escape')
 > -    if rc.returncode == 137:    # Timed Out > -        return False
 > -    if len(output.splitlines()) < 2:
 > +    output_lines = output.splitlines()
 > +    # Usually we care about the summary info in the last few lines, reverse.
 > +    output_lines.reverse() > +
 > +    global crash_pattern, crash_patterns, crash_string
 > +    if crash_pattern is None: # Initialization
 > +        for line in output_lines:
 > +            for c in crash_patterns:
 > +                if re.search(c, line) is not None:
 > +                    crash_pattern = c
 > +                    crash_string = line
 > +                    print("Identifying crash pattern by this string: ",\
 > +                          crash_string)
 > +                    print("Using regex pattern: ", crash_pattern)
 > +                    return True
 > +        print("Failed to initialize crash pattern: no match.")
 >          return False > > -    if CRASH_TOKEN is None:
 > -        CRASH_TOKEN = output.splitlines()[-2]
 > +    # First, we search exactly the previous crash string.
 > +    for line in output_lines: > +        if crash_string == line:
 > +            return True > +
 > +    # Then we decide whether a similar (same pattern) crash happened.
 > +    # Slower now :( > +    for line in output_lines:
 > +        if re.search(crash_pattern, line) is not None:
 > + print("\nINFO: The crash string changed during our minimization process.")
 > +            print("Before: ", crash_string)
 > +            print("After: ", line)
 > +            print("The original regex pattern can still match,
 updated the crash string.") > +            crash_string = line
 > +            return True > > -    return CRASH_TOKEN in output
 > +    # The input did not trigger (the same type) bug.
 > +    return False > > >  def minimize_trace(inpath, outpath):
 > @@ -66,7 +100,6 @@ def minimize_trace(inpath, outpath):
 >      print("Crashed in {} seconds".format(end-start))
 >      TIMEOUT = (end-start)*5
 >      print("Setting the timeout for {} seconds".format(TIMEOUT))
 > -    print("Identifying Crashes by this string: {}".format(CRASH_TOKEN))
 > >      i = 0 >      newtrace = trace[:]
 > @@ -152,6 +185,6 @@ if __name__ == '__main__': >          usage()
 >      # if "accel" not in QEMU_ARGS:
 >      #     QEMU_ARGS += " -accel qtest"
 > -    CRASH_TOKEN = os.getenv("CRASH_TOKEN")
 > +    crash_pattern = os.getenv("CRASH_PATTERN")
 >      QEMU_ARGS += " -qtest stdio -monitor none -serial none "
 >      minimize_trace(sys.argv[1], sys.argv[2]) > -- > 2.25.1
Date: Mon, 21 Dec 2020 13:46:40 -0500
Message-ID: <87v9cv3skv.fsf@stormtrooper.vrmnet>
Received-SPF: pass client-ip=128.197.228.104; envelope-from=alxndr@bu.edu;
 helo=relay64.bu.edu
X-Spam_score_int: 8
X-Spam_score: 0.8
X-Spam_bar: /
X-Spam_report: (0.8 / 5.0 requ) BAYES_00=-1.9, BODY_EMPTY=1.999,
 HK_RANDOM_ENVFROM=0.001, HK_RANDOM_FROM=0.001, PYZOR_CHECK=1.392,
 RCVD_IN_DNSWL_LOW=-0.7, SPF_HELO_NONE=0.001,
 SPF_PASS=-0.001 autolearn=no autolearn_force=no
X-Spam_action: no action
X-BeenThere: qemu-devel@nongnu.org
X-Mailman-Version: 2.1.23
Precedence: list
List-Id: <qemu-devel.nongnu.org>
List-Unsubscribe: <https://lists.nongnu.org/mailman/options/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=unsubscribe>
List-Archive: <https://lists.nongnu.org/archive/html/qemu-devel>
List-Post: <mailto:qemu-devel@nongnu.org>
List-Help: <mailto:qemu-devel-request@nongnu.org?subject=help>
List-Subscribe: <https://lists.nongnu.org/mailman/listinfo/qemu-devel>,
 <mailto:qemu-devel-request@nongnu.org?subject=subscribe>
Cc: darren.kenny@oracle.com, bsd@redhat.com, thuth@redhat.com,
 stefanha@redhat.com, pbonzini@redhat.com
Errors-To: qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org
Sender: "Qemu-devel"
 <qemu-devel-bounces+qemu-devel=archiver.kernel.org@nongnu.org>