From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-7.2 required=3.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,MENTIONS_GIT_HOSTING,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED, USER_AGENT_SANE_1 autolearn=ham autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id 8A718C74A2B for ; Wed, 10 Jul 2019 16:06:25 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [209.132.180.67]) by mail.kernel.org (Postfix) with ESMTP id 643292087F for ; Wed, 10 Jul 2019 16:06:25 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727325AbfGJQGZ (ORCPT ); Wed, 10 Jul 2019 12:06:25 -0400 Received: from mail.kernel.org ([198.145.29.99]:45132 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726617AbfGJQGZ (ORCPT ); Wed, 10 Jul 2019 12:06:25 -0400 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 588F92064A; Wed, 10 Jul 2019 16:06:23 +0000 (UTC) Date: Wed, 10 Jul 2019 18:06:21 +0200 From: Greg KH To: J Lovejoy Cc: linux-spdx@vger.kernel.org Subject: Re: efficacy of MODULE_LICENSE Message-ID: <20190710160621.GA25564@kroah.com> References: <1B1C3AD5-9222-40A8-A0D2-7BF7CE1181DF@jilayne.com> <20190710093806.GA22916@kroah.com> <789E72F5-FAF6-4E64-8CA8-471EE00BF865@jilayne.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <789E72F5-FAF6-4E64-8CA8-471EE00BF865@jilayne.com> User-Agent: Mutt/1.12.1 (2019-06-15) Sender: linux-spdx-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-spdx@vger.kernel.org On Wed, Jul 10, 2019 at 07:41:03AM -0600, J Lovejoy wrote: > > > > On Jul 10, 2019, at 3:38 AM, Greg KH wrote: > > > > On Tue, Jul 09, 2019 at 10:28:59PM -0600, J Lovejoy wrote: > >> Hi all, > >> > >> We seem to have gone a bit quiet recently! Hopefully that’s just a > >> symptom of nicer weather and holiday season, but we can still pick up > >> some momentum :) > >> > >> I wanted to get your input on the MODULE_LICENSE tag, which I have > >> found to be a bit vexing in some instances. I am finding examples > >> where there is a clearly identifiable license in the file, for example > >> ISC, and then the MODULE_LICENSE tag is something like "Dual BSD/GPL”. > >> There is absolutely no other reference to GPL whatsoever (or any BSD > >> variant for that matter). > > > > MODULE_LICENSE is used by the kernel itself, at runtime, to determine > > the "license" of the module that is being loaded into it. > > > > At that point in time, it is a dual-licensed chunk of code, as it > > incorporated gplv2 bits into it in order to create that module image, > > right? > > by dual-licensed, I’m assuming you mean conjunctive (“AND”) by way of the incorporated GPL-2.0 bits into the ISC-licensed file, then yes, that sounds right. > But this file, by itself, is just ISC for the license as the “incorporating” part only happens at runtime, right? > > so for purposes of identifying the license of the file (and if someone wanted to re-use that file elsewhere), ISC would be the operative license info. > > > >> Based on my understanding of > >> https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/include/linux/module.h#n172 > >> > >> - the MODULE_LICENSE info was never meant to be definitive license > >> info, but seemingly more of an approximation. I’m wondering if others > >> have a different view? > > > > It is used at runtime to determine if the module has access to some > > types of kernel symbols or not. > > > > It can also be used at any time to extract the license from the module > > image on a disk, you can see this by running the 'modinfo' program on > > any kernel module: > > $ modinfo visor | grep license > > license: GPL v2 > > “extract the license” is a bit concerning here - as I take that to > mean the extraction part is the MODULE_LICENSE info, which isn’t > really accurate. It seems pretty accurate for the aggregated work that is the object file that is loaded into the kernel, right? How is it not "accurate"? > or is the intent here not necessarily accuracy, but a threshold > determination that it’s open source/compatible (and not proprietary or > something like that)? That is what we use it for, yes. It really is a yes/no test that we use to allow access to specific internal kernel symbols or not. But, it's also useful for when companies "lie" with that string. Lawyers generally frown when technical people at a company try to do "tricks" and fool the kernel by messing with that string. I know at least 2 VERY major and huge companies[1] that have turned their treatment around for this very reason when they realized what they were "caught" doing, and now are big contributors to our kernel community. So it is a nice tool to be able to use at times :) > >> More specifically - where we have specific license match (like the > >> example above) - we can add the appropriate SPDX identifier, but if we > >> leave the MODULE_LICENSE info, I suspect that scanners will pick that > >> up and report a mix of licensing info (e.g., ISC, BSD, GPL, as in my > >> above example), which kind of brings us to the same place we are now. > >> Should we also remove the MODULE_LICENSE tag where it contradicts the > >> actual license info in terms of an exact license match (i.e., there is > >> nothing to match to GPL here, other than the MODULE_LICENSE tag, but > >> there is an exact match to a different license, ISC, in this case). > > > > MODULE_LICENSE predated SPDX by a decade or so, and was designed to > > solve a totally different use case. I would not try to mix the two, or > > infer one from the other. > > > > MODULE_LICENSE covers the "resulting image" of combining many different > > files that can have different SPDX-identified licenses in them. > > > > Does this help any? > > yes. And I can understand the different use case, I guess my > concern/question is does the existence of MODULE_LICENSE info that > sort of contradicts the actual license info for the file (when looking > just at that file, not the combined/resulting image) frustrate the > goal of having clean licensing info for when people run scans over the > kernel? It shouldn't, which is why we clarified a number of these strings recently with the documentation. What else needs to be adjusted to make this more accurate for what you want to use when scanning a binary image? Remember, that's what this is used for, the binary image, not for the individual files. thanks, greg k-h