All of lore.kernel.org
 help / color / mirror / Atom feed
From: "Nikolai Kondrashov" <Nikolai.Kondrashov@redhat.com>
To: Dmitry Vyukov <dvyukov@google.com>
Cc: "Nikolai Kondrashov" <spbnick@gmail.com>,
	kernelci@groups.io,
	"Guillaume Tucker" <guillaume.tucker@collabora.com>,
	"Philip Li" <philip.li@intel.com>,
	kernelci-members@groups.io, nkondras@redhat.com,
	"Don Zickus" <dzickus@redhat.com>,
	syzkaller <syzkaller@googlegroups.com>,
	"Iñaki Malerba" <imalerba@redhat.com>
Subject: Re: [kernelci-members] Working with the KernelCI project
Date: Thu, 1 Oct 2020 19:34:17 +0300	[thread overview]
Message-ID: <9123a571-721b-5c1b-367e-e028acec4096@redhat.com> (raw)
In-Reply-To: <CACT4Y+bO_FZsDkrBh2LtxQ+QyxXP5S7e+szO-HYxuj=GyKjUMA@mail.gmail.com>

On 10/1/20 7:00 PM, Dmitry Vyukov wrote:
 > On Thu, Oct 1, 2020 at 5:52 PM Nikolai Kondrashov
 > <Nikolai.Kondrashov@redhat.com> wrote:
 >>
 >> On 10/1/20 6:49 PM, Nikolai Kondrashov wrote:
 >>> On 10/1/20 5:48 PM, Dmitry Vyukov via groups.io wrote:
 >>>   > On Thu, Oct 1, 2020 at 3:32 PM Nikolai Kondrashov
 >>>   > <Nikolai.Kondrashov@redhat.com> wrote:
 >>>   >>
 >>>   >> On 10/1/20 1:48 PM, Nikolai Kondrashov wrote:
 >>>   >>   > Here are the things which could be improved:
 >>>   >>
 >>>   >> Oh, and another thing: could you avoid re-sending the identical
 >>>   >> revisions/builds you've already sent? I.e. send them only once, unless you
 >>>   >> have fields to add? Each of those costs us a row in the database, which is not
 >>>   >> a big deal, but would be good to avoid.
 >>>   >
 >>>   > Do you see that we send dups? I've added logic to not send dups and
 >>>   > from what I see it's working.
 >>>
 >>> Please find attached a zip archive with seven submissions from about six hours
 >>> ago. Each has revisions with the same id and git_commit_hash. Some of those
 >>> revisions have different discovery_time, and that alone shouldn't really be
 >>> the reason to resend. Each of those submissions have a build object, but some
 >>> of them are repeated.
 >>>
 >>> Revision with that git_commit_hash was submitted at least 72 times.
 >>
 >> And now the file is attached :D
 >
 > This is intentional (in the current implementation) and is a
 > consequence of the fact that we always send all 3 entities for each
 > issue/test failure. It's much simpler on our side this way. If we
 > would send a test failure multiple times it would be unintentional.

I see.

 > Identical builds should have the same id, though. And if they have
 > different id's, then these are different builds.

Sure.

 > Revisions were discovered separately by different instances, that's
 > why they have different discovery time. Consider that as if different
 > origin systems would discover it separately. Since discovery time is
 > not an inherent property of the commit itself, there is no way they
 > can agree on it.

Ah, I see now why. Yeah, it's similar to multiple systems submitting the same
revision. Although we don't have systems submitting the same build multiple
times.

 > Will it be better if we don't send discovery time at all?

No, it's OK to submit the discover time multiple times, but we can't say which
one will be displayed :)

 > Amount of duplication for builds/revisions is capped by the number of
 > bugs we discover, since build/revision is only sent with a bug once.
 > So it's not that it's unlimited.

I think it's OK that you have some duplication, and certainly OK for the
start. Let's get on with sending the data and see how much actual duplication
we get, how it affects performance, and how the de-duplication implementation
goes :)

Thank you!
Nick


  reply	other threads:[~2020-10-01 16:34 UTC|newest]

Thread overview: 54+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <20200707222342.scrz75265etaqlmd@redhat.com>
     [not found] ` <42d15463-e4ee-4c0b-c63f-dce7acb05e35@collabora.com>
     [not found]   ` <CACT4Y+ZLoBLFWRM+RcKZJyR2Hh5az9W8_329ShM9JuSg6V4uVw@mail.gmail.com>
     [not found]     ` <bbeeb467-1571-5404-7408-9b112d64e928@redhat.com>
     [not found]       ` <CACT4Y+a1t-9sT7xz7d=Wmesnn_QoUqfipmoZXBu40_B+GQy=nQ@mail.gmail.com>
2020-07-17 12:22         ` [kernelci-members] Working with the KernelCI project Nikolai Kondrashov
2020-08-03  9:25           ` Nikolai Kondrashov
2020-08-05 18:44             ` Dmitry Vyukov
2020-08-21 10:10               ` Nikolai Kondrashov
2020-09-28 12:48                 ` Dmitry Vyukov
2020-09-28 13:15                   ` Nikolai Kondrashov
2020-09-28 15:23                     ` Dmitry Vyukov
2020-09-28 16:09                       ` Dmitry Vyukov
2020-09-28 16:24                         ` Dmitry Vyukov
2020-09-28 17:16                           ` Nikolai Kondrashov
2020-09-29  7:52                             ` Dmitry Vyukov
2020-09-29 17:13                               ` Nikolai Kondrashov
2020-09-30  7:07                                 ` Dmitry Vyukov
2020-10-01  8:30                                   ` Nikolai Kondrashov
2020-10-01  8:43                                     ` Dmitry Vyukov
2020-10-01 10:51                                       ` Nikolai Kondrashov
2020-09-30  7:21                                 ` Dmitry Vyukov
2020-10-01  8:53                                   ` Nikolai Kondrashov
2020-09-30  7:31                                 ` Dmitry Vyukov
2020-10-01  8:57                                   ` Nikolai Kondrashov
     [not found]                                 ` <16397F50C12D08DD.21243@groups.io>
2020-09-30 16:07                                   ` Dmitry Vyukov
2020-10-01 10:48                                     ` Nikolai Kondrashov
2020-10-01 13:32                                       ` Nikolai Kondrashov
2020-10-01 14:48                                         ` Dmitry Vyukov
2020-10-01 15:49                                           ` Nikolai Kondrashov
2020-10-01 15:51                                             ` Nikolai Kondrashov
2020-10-01 16:00                                               ` Dmitry Vyukov
2020-10-01 16:34                                                 ` Nikolai Kondrashov [this message]
2020-10-01 17:02                                                   ` Dmitry Vyukov
2020-10-02  7:52                                                     ` Dmitry Vyukov
2020-10-02  8:12                                                       ` Nikolai Kondrashov
2020-10-02  9:02                                                         ` Dmitry Vyukov
2020-10-02  9:08                                                           ` Nikolai Kondrashov
2020-10-02 10:39                                                           ` Nikolai Kondrashov
2020-10-02 13:40                                                             ` Dmitry Vyukov
2020-10-02  8:12                                                     ` Nikolai Kondrashov
2020-10-01 15:01                                       ` Dmitry Vyukov
2020-10-01 15:11                                         ` Nikolai Kondrashov
2020-10-01 15:36                                           ` Dmitry Vyukov
2020-10-01 15:40                                             ` Dmitry Vyukov
2020-10-01 15:58                                               ` Nikolai Kondrashov
2020-10-01 15:55                                             ` Nikolai Kondrashov
2020-10-01 16:03                                               ` Dmitry Vyukov
2020-10-01 16:28                                                 ` Nikolai Kondrashov
2020-09-28 16:50                         ` Nikolai Kondrashov
2020-09-28 16:49                       ` Nikolai Kondrashov
2020-09-28 17:09                         ` Dmitry Vyukov
     [not found] ` <20200709110029.GB27682@intel.com>
     [not found]   ` <69138572-7241-1636-8018-34cd380ec540@redhat.com>
     [not found]     ` <20200713001929.GA1812@intel.com>
2020-07-22 12:42       ` Nikolai Kondrashov
2020-08-03  9:11         ` Nikolai Kondrashov
2020-08-04  0:13           ` Philip Li
2020-08-09  2:25         ` Philip Li
2020-08-10  8:50           ` Nikolai Kondrashov
2020-08-21  9:50           ` Nikolai Kondrashov
2020-08-21 10:19             ` Nikolai Kondrashov

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=9123a571-721b-5c1b-367e-e028acec4096@redhat.com \
    --to=nikolai.kondrashov@redhat.com \
    --cc=dvyukov@google.com \
    --cc=dzickus@redhat.com \
    --cc=guillaume.tucker@collabora.com \
    --cc=imalerba@redhat.com \
    --cc=kernelci-members@groups.io \
    --cc=kernelci@groups.io \
    --cc=nkondras@redhat.com \
    --cc=philip.li@intel.com \
    --cc=spbnick@gmail.com \
    --cc=syzkaller@googlegroups.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.