From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-14.9 required=3.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS,USER_AGENT_SANE_1,USER_IN_DEF_DKIM_WL autolearn=no autolearn_force=no version=3.4.0 Received: from mail.kernel.org (mail.kernel.org [198.145.29.99]) by smtp.lore.kernel.org (Postfix) with ESMTP id C1C70C433E9 for ; Tue, 2 Feb 2021 18:16:33 +0000 (UTC) Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by mail.kernel.org (Postfix) with ESMTP id 9CD04601FB for ; Tue, 2 Feb 2021 18:16:33 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234270AbhBBSQa (ORCPT ); Tue, 2 Feb 2021 13:16:30 -0500 Received: from linux.microsoft.com ([13.77.154.182]:37888 "EHLO linux.microsoft.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233916AbhBBSOv (ORCPT ); Tue, 2 Feb 2021 13:14:51 -0500 Received: from [192.168.1.13] (c-24-16-6-251.hsd1.wa.comcast.net [24.16.6.251]) by linux.microsoft.com (Postfix) with ESMTPSA id 07D6120B7192; Tue, 2 Feb 2021 10:14:09 -0800 (PST) DKIM-Filter: OpenDKIM Filter v2.11.0 linux.microsoft.com 07D6120B7192 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.microsoft.com; s=default; t=1612289650; bh=RBImT/YOk85apb/UQR0i4fPjwoymHl2wNL5VUUqTZk0=; h=Subject:To:Cc:References:From:Date:In-Reply-To:From; b=hm60PHpByjn9NM9Eij+F/7AG52XZhWhr6l9VYtfFHnhmCJtfHRZwGoFeF4HXh8G+K exqHFUNeZgN41QzGSb/qKIxCVzEvti0D3UMObkpCoCMMA0+R0I0hjJvDvGU9Ak1xLQ erYhYTXH7/TFEBj7AThZIYG78siRlVWQ2gjQNIc8= Subject: Re: [RFC] Persist ima logs to disk To: Mimi Zohar , Amir Goldstein Cc: James Bottomley , janne.karhunen@gmail.com, linux-integrity , tusharsu@linux.microsoft.com, tyhicks@linux.microsoft.com, nramas@linux.microsoft.com, balajib@linux.microsoft.com References: <20210105195742.2629-1-raphgi@linux.microsoft.com> <87127d502bcb9707dd4e7a43475ab6bed2fdd421.camel@linux.ibm.com> <715a265180a092754ab9ea8522c39427645b25ad.camel@HansenPartnership.com> <6e28c7a9742131cf508e77448bfee0a03b2c2e5e.camel@linux.ibm.com> <3c50bc449aae2f09bd7d43c401cc9b292f9ec2ae.camel@HansenPartnership.com> <570d54ca679b7a4f786fa65eb78601a2af91c397.camel@linux.ibm.com> <13447f30db609d4bd77d5a826c5102dd5a931a19.camel@HansenPartnership.com> <734adc26-0050-ce8f-4c8c-c8a907b569a6@linux.microsoft.com> <8c78437d0e9a4968996b834030661b6f567f87eb.camel@linux.ibm.com> <3f76d250-6aa7-65c1-b903-5bc82f6f8845@linux.microsoft.com> <3f1571d5b9f2fbb6a8ff9a5fad75b54e2b597904.camel@linux.ibm.com> From: Raphael Gianotti Message-ID: <169af4d7-9f1a-69c9-44a8-9d30deab80cb@linux.microsoft.com> Date: Tue, 2 Feb 2021 10:14:08 -0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:78.0) Gecko/20100101 Thunderbird/78.7.0 MIME-Version: 1.0 In-Reply-To: <3f1571d5b9f2fbb6a8ff9a5fad75b54e2b597904.camel@linux.ibm.com> Content-Type: text/plain; charset=iso-8859-15; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US Precedence: bulk List-ID: X-Mailing-List: linux-integrity@vger.kernel.org On 2/2/2021 5:07 AM, Mimi Zohar wrote: > On Tue, 2021-02-02 at 07:54 +0200, Amir Goldstein wrote: >> On Tue, Feb 2, 2021 at 12:53 AM Raphael Gianotti >> wrote: >>> >>> On 1/8/2021 9:58 AM, Raphael Gianotti wrote: >>>> On 1/8/2021 4:38 AM, Mimi Zohar wrote: >>>>> On Thu, 2021-01-07 at 14:57 -0800, Raphael Gianotti wrote: >>>>>>>>>> But this doesn't address where the offloaded measurement list >>>>>>>>>> will be stored, how long the list will be retained, nor who >>>>>>>>>> guarantees the integrity of the offloaded list. In addition, >>>>>>>>>> different form factors will have different requirements. >>>>>> For how long the list would be retained, or in the case of a log >>>>>> segments, it >>>>>> might make sense to have that be an admin decision, something that >>>>>> can be >>>>>> configured to satisfy the needs of a specific system, as mentioned >>>>>> below by >>>>>> James, does that seem correct? >>>>> For the discussion on exporting and truncating the IMA measurement >>>>> list, refer to: >>>>> https://lore.kernel.org/linux-integrity/1580998432.5585.411.camel@linux.ibm.com/ >>>>> >>>>> >>>>>> Given the possibility of keeping the logs around for an indefinite >>>>>> amount of >>>>>> time, would using an expansion of the method present in this RFC be >>>>>> more >>>>>> appropriate than going down the vfs_tmpfile route? Forgive my lack >>>>>> on expertise >>>>>> on mm, but would the vfs_tmpfile approach work for keeping several >>>>>> log segments >>>>>> across multiple kexecs? >>>>> With the "vfs_tmpfile" mechanism, breaking up and saving the log in >>>>> segments isn't needed. The existing mechanism for carrying the >>>>> measurement list across kexec would still be used. Currently, if the >>>>> kernel cannot allocate the memory needed for carrying the measurement >>>>> across kexec, it simply emits an error message, but continues with the >>>>> kexec. >>>> In this change I had introduced "exporting" the log to disk when the size >>>> of the measurement list was too large. Given part of the motivation >>>> behind >>>> moving the measurement list is the possibility of it growing too large >>>> and taking up too much of the kernel memory, that case would likely lead >>>> to kexec not being able to carry over the logs. Do you believe it's >>>> better >>>> to use the "vfs_tmpfile" mechanism for moving the logs to disk and worry >>>> about potential issues with kexec not being able to carry over the logs >>>> separately, given the "vfs_tempfile" approach seems to be preferred and >>>> also simplifies worries regarding truncating the logs? >>> After a chat with Mimi I went ahead and did some investigative >>> work in the vfs_tmpfile approach suggested, and I wanted to >>> restart this thread with some thoughts/questions that came up >>> from that. >>> For the work I did I simply created a tmp file during ima's >>> initialization and then tried to use vm_mmap to map it to memory, >>> with the goal of using that memory mapping to generate return >>> pointers to the code that writes the measurement entries to memory. >> I don't understand why you would want to do that. I might have misunderstood >> the requirements, but this was not how I meant for tmpfile to be used. >> >> Mimi explained to me that currently the IMA measurement list is entirely in >> memory and that you are looking for a way to dump it into a file in order to >> free up memory. >> >> What I suggested is this: >> >> - User opens an O_TMPFILE and passes fd to IMA to start export >> - IMA starts writing (exporting) records to that file using *kernel* write API >> - Every record written to the file is removed from the in-memory list >> - While list is being exported, IMA keeps in-memory count of exported entries >> - In ima_measurements_start, if export file exists, start iterator >> starts reading >> records from the file >> - In ima_measurements_next(), when next iterator reaches the export count, >> it switches over to iterate in-memory list >> >> This process can: >> 1. Continue forever without maintaining any in-memory list >> 2. Work in the background to periodically flush list to file >> 3. Controlled by explicit user commands >> 4. All of the above >> >> Is that understood? Did I understand the requirements correctly? Thanks for the clarification Amir, I never actually saw your initial mails, I apologize for the confusion, the use of mmap was something the original author of the export ima logs to disk mentioned had been suggested, which is why I went down that route. Given the actual suggestion you originally had given, I believe the coding of it is somewhat to the code I sent in the RFC in terms of approach (if we were to have it do periodic flushes, for example). With the addition of reads to the log starting with the file as the oldest logs will be there. I believe the only difference there is whether the list is kept in a tmp file or not, so with the tmp file approach it would be just to keep the list out of memory (either partially or permanently), where with a permanent file, the list would still be available after a cold boot for instance. > Thanks, Amir! > > One of the main requirements we discussed was protecting the O_TMPFILE, > not allowing userspace direct access to the file. > > Mimi