From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753554AbbDATej (ORCPT ); Wed, 1 Apr 2015 15:34:39 -0400 Received: from g4t3427.houston.hp.com ([15.201.208.55]:42358 "EHLO g4t3427.houston.hp.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753086AbbDATeg convert rfc822-to-8bit (ORCPT ); Wed, 1 Apr 2015 15:34:36 -0400 From: "Elliott, Robert (Server Storage)" To: Christoph Hellwig , "linux-nvdimm@ml01.01.org" , "linux-fsdevel@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "x86@kernel.org" CC: "ross.zwisler@linux.intel.com" , "axboe@kernel.dk" , "boaz@plexistor.com" , "Kani, Toshimitsu" Subject: RE: another pmem variant V2 Thread-Topic: another pmem variant V2 Thread-Index: AQHQZ5+JacS38fCB50eh5aSIiWDyPZ04kJiQ Date: Wed, 1 Apr 2015 19:33:38 +0000 Message-ID: <94D0CD8314A33A4D9D801C0FE68B40295A8556BB@G9W0745.americas.hpqcorp.net> References: <1427358764-6126-1-git-send-email-hch@lst.de> In-Reply-To: <1427358764-6126-1-git-send-email-hch@lst.de> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-originating-ip: [16.210.48.26] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 8BIT MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org > -----Original Message----- > From: linux-kernel-owner@vger.kernel.org [mailto:linux-kernel- > owner@vger.kernel.org] On Behalf Of Christoph Hellwig > Sent: Thursday, March 26, 2015 3:33 AM > To: linux-nvdimm@ml01.01.org; linux-fsdevel@vger.kernel.org; linux- > kernel@vger.kernel.org; x86@kernel.org > Cc: ross.zwisler@linux.intel.com; axboe@kernel.dk; boaz@plexistor.com > Subject: another pmem variant V2 > I triggered a paging error in the memcpy call for a block read from system-udevd (actually in a modified memcpy() for the cache attribute experiments). 1. This triggered an illegal schedule() call from an atomic context. The call trace is shown below. 2. memcpy() doesn't provide exception handling or error reporting. Some functions like do so, like __copy_user_nocache in arch/x85/lib/copy_user_nocache_64.S. Should pmem only use functions that do so, if available on the architecture? pmem_rw_page can pass along the return value from the copy function. pmem_make_request can report the error, if any, via bio_endio. Call trace ========== [62117.317216] BUG: scheduling while atomic: systemd-udevd/22135/0x00000001 [62117.317232] Modules linked in: pmem ip6table_filter ip6_tables iptable_filter ip_tables ebtable_nat ebtables sg vfat fat x86_pkg_temp_thermal coretemp kvm_intel kvm crc32c_intel ghash_clmulni_intel aesni_intel aes_x86_64 glue_helper lrw gf128mul ablk_helper cryptd xhci_pci hpilo xhci_hcd sb_edac edac_core microcode iTCO_wdt iTCO_vendor_support hpwdt ioatdma shpchp pcspkr lpc_ich mfd_core i2c_i801 wmi pcc_cpufreq dca acpi_cpufreq uinput nfsd auth_rpcgss nfs_acl lockd grace sunrpc xfs exportfs sr_mod cdrom sd_mod bnx2x tg3 ahci mdio libahci ptp pps_core hpsa libcrc32c dm_mirror dm_region_hash dm_log dm_mod ipv6 autofs4 [last unloaded: pmem] [62117.317233] CPU: 31 PID: 22135 Comm: systemd-udevd Tainted: G D 4.0.0-rc6+ #7 [62117.317234] Hardware name: HP ProLiant DL380 Gen9 [62117.317235] ffff88047f3f3ac0 ffff8804241db2e8 ffffffff815a8866 00000000ff86ff86 [62117.317236] ffff8804241dbfd8 ffff8804241db2f8 ffffffff815a4b45 ffff8804241db348 [62117.317237] ffffffff815ab893 ffff880457091050 ffff88047f3fbb20 0000000000000000 [62117.317237] Call Trace: [62117.317240] [] dump_stack+0x45/0x57 [62117.317245] [] __schedule_bug+0x46/0x54 [62117.317247] [] __schedule+0x793/0x870 [62117.317251] [] ? bit_wait+0x50/0x50 [62117.317252] [] schedule+0x37/0x90 [62117.317253] [] schedule_timeout+0x1dc/0x260 [62117.317258] [] ? ktime_get+0x3e/0xa0 [62117.317259] [] io_schedule_timeout+0xac/0x140 [62117.317261] [] bit_wait_io+0x36/0x50 [62117.317262] [] __wait_on_bit_lock+0x4b/0xb0 [62117.317263] [] ? find_get_entries+0xe2/0x130 [62117.317265] [] __lock_page+0xac/0xb0 [62117.317269] [] ? autoremove_wake_function+0x40/0x40 [62117.317276] [] truncate_inode_pages_range+0x3af/0x620 [62117.317278] [] ? cpumask_next_and+0x37/0x50 [62117.317279] [] ? __brelse+0x40/0x40 [62117.317283] [] ? smp_call_function_many+0x5d/0x280 [62117.317284] [] ? free_cpumask_var+0x9/0x10 [62117.317285] [] ? on_each_cpu_cond+0xbd/0x160 [62117.317286] [] ? __brelse+0x40/0x40 [62117.317288] [] truncate_inode_pages+0x15/0x20 [62117.317289] [] kill_bdev+0x33/0x40 [62117.317291] [] __blkdev_put+0x68/0x210 [62117.317293] [] blkdev_put+0x50/0x130 [62117.317294] [] blkdev_close+0x25/0x30 [62117.317296] [] __fput+0xe7/0x220 [62117.317298] [] ____fput+0xe/0x10 [62117.317302] [] task_work_run+0xc4/0xe0 [62117.317306] [] do_exit+0x2d8/0xb10 [62117.317308] [] ? kmsg_dump+0x9c/0xc0 [62117.317312] [] oops_end+0x8e/0xd0 [62117.317313] [] no_context+0x2d4/0x334 [62117.317314] [] __bad_area_nosemaphore+0x6d/0x1c6 [62117.317317] [] ? zone_statistics+0x80/0xa0 [62117.317319] [] bad_area_nosemaphore+0x13/0x15 [62117.317321] [] __do_page_fault+0x91/0x430 [62117.317322] [] do_page_fault+0xc/0x10 [62117.317324] [] page_fault+0x22/0x30 [62117.317325] [] ? pmem_do_bvec.isra.6+0x212/0x3f0 [pmem] [62117.317326] [] pmem_rw_page+0x43/0x60 [pmem] [62117.317328] [] ? __radix_tree_preload+0x38/0xa0 [62117.317329] [] bdev_read_page+0x2e/0x40 [62117.317330] [] do_mpage_readpage+0x51f/0x6c0 [62117.317331] [] ? lru_cache_add+0xe/0x10 [62117.317332] [] mpage_readpages+0xdb/0x130 [62117.317333] [] ? I_BDEV+0x10/0x10 [62117.317334] [] ? I_BDEV+0x10/0x10 [62117.317336] [] blkdev_readpages+0x1d/0x20 [62117.317336] [] __do_page_cache_readahead+0x194/0x210 [62117.317337] [] force_page_cache_readahead+0x75/0xb0 [62117.317338] [] page_cache_sync_readahead+0x43/0x50 [62117.317339] [] generic_file_read_iter+0x431/0x630 [62117.317341] [] blkdev_read_iter+0x37/0x40 [62117.317342] [] new_sync_read+0x7e/0xb0 [62117.317343] [] __vfs_read+0x18/0x50 [62117.317344] [] vfs_read+0x86/0x140 [62117.317345] [] SyS_read+0x46/0xb0 [62117.317346] [] ? __audit_syscall_entry+0xb4/0x110 [62117.317348] [] system_call_fastpath+0x12/0x17 [62121.618505] note: systemd-udevd[22133] exited with preempt_count 1