From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1030225AbVLFUcd (ORCPT ); Tue, 6 Dec 2005 15:32:33 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1030224AbVLFUcc (ORCPT ); Tue, 6 Dec 2005 15:32:32 -0500 Received: from silver.veritas.com ([143.127.12.111]:59687 "EHLO silver.veritas.com") by vger.kernel.org with ESMTP id S1030210AbVLFUcb (ORCPT ); Tue, 6 Dec 2005 15:32:31 -0500 Date: Tue, 6 Dec 2005 20:31:43 +0000 (GMT) From: Hugh Dickins X-X-Sender: hugh@goblin.wat.veritas.com To: Ryan Richter cc: Linus Torvalds , Kai Makisara , Andrew Morton , James Bottomley , linux-kernel@vger.kernel.org, linux-scsi@vger.kernel.org Subject: Re: Fw: crash on x86_64 - mm related? In-Reply-To: <20051206160815.GC11560@tau.solarneutrino.net> Message-ID: References: <20051129092432.0f5742f0.akpm@osdl.org> <20051201195657.GB7236@tau.solarneutrino.net> <20051202180326.GB7634@tau.solarneutrino.net> <20051202194447.GA7679@tau.solarneutrino.net> <20051206160815.GC11560@tau.solarneutrino.net> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-OriginalArrivalTime: 06 Dec 2005 20:32:19.0726 (UTC) FILETIME=[27AE92E0:01C5FAA4] Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 6 Dec 2005, Ryan Richter wrote: > Another crash during last night's backup: > > Bad page state at free_hot_cold_page (in process 'taper', page ffff810002410cc0) > flags:0x010000000000000c mapping:ffff81017d5beb70 mapcount:2 count:0 > Bad page state at free_hot_cold_page (in process 'taper', page ffff810002410cc0) > flags:0x010000000000081c mapping:ffff810064cd6910 mapcount:0 count:0 > Kernel BUG at include/linux/mm.h:341 > Pid: 11402, comm: taper Tainted: G B 2.6.14.3 #1 > RIP: 0010:[] {sgl_unmap_user_pages+93} > Kernel BUG at mm/rmap.c:487 > Pid: 11402, comm: taper Tainted: G B 2.6.14.3 #1 > RIP: 0010:[] {page_remove_rmap+39} > Bad page state at prep_new_page (in process 'dumper', page ffff810002410cc0) > flags:0x010000000000001c mapping:0000000000000000 mapcount:-1 count:0 > general protection fault: 0000 [3] SMP > Pid: 1303, comm: kjournald Tainted: G B 2.6.14.3 #1 > RIP: 0010:[] {cache_alloc_refill+330} > NMI Watchdog detected LOCKUP on CPU 1 > Pid: 918, comm: md0_raid5 Tainted: G B 2.6.14.3 #1 > RIP: 0010:[] {.text.lock.spinlock+116} Thanks for the further report. And you had my st.c patch in along with 2.6.14.3, but it still happened, very much like before (except the latter errors, general protection fault onwards - but once we get into using one page for two uses at the same time, anything can go wrong). I've been staring and thinking, but no inspiration yet. Hugh