From: Mel Gorman <mgorman@suse.de> To: Peter Zijlstra <a.p.zijlstra@chello.nl>, Rik van Riel <riel@redhat.com> Cc: Srikar Dronamraju <srikar@linux.vnet.ibm.com>, Ingo Molnar <mingo@kernel.org>, Andrea Arcangeli <aarcange@redhat.com>, Johannes Weiner <hannes@cmpxchg.org>, Linux-MM <linux-mm@kvack.org>, LKML <linux-kernel@vger.kernel.org>, Mel Gorman <mgorman@suse.de> Subject: [PATCH 17/63] sched: Set the scan rate proportional to the memory usage of the task being scanned Date: Mon, 7 Oct 2013 11:28:55 +0100 [thread overview] Message-ID: <1381141781-10992-18-git-send-email-mgorman@suse.de> (raw) In-Reply-To: <1381141781-10992-1-git-send-email-mgorman@suse.de> The NUMA PTE scan rate is controlled with a combination of the numa_balancing_scan_period_min, numa_balancing_scan_period_max and numa_balancing_scan_size. This scan rate is independent of the size of the task and as an aside it is further complicated by the fact that numa_balancing_scan_size controls how many pages are marked pte_numa and not how much virtual memory is scanned. In combination, it is almost impossible to meaningfully tune the min and max scan periods and reasoning about performance is complex when the time to complete a full scan is is partially a function of the tasks memory size. This patch alters the semantic of the min and max tunables to be about tuning the length time it takes to complete a scan of a tasks occupied virtual address space. Conceptually this is a lot easier to understand. There is a "sanity" check to ensure the scan rate is never extremely fast based on the amount of virtual memory that should be scanned in a second. The default of 2.5G seems arbitrary but it is to have the maximum scan rate after the patch roughly match the maximum scan rate before the patch was applied. On a similar note, numa_scan_period is in milliseconds and not jiffies. Properly placed pages slow the scanning rate but adding 10 jiffies to numa_scan_period means that the rate scanning slows depends on HZ which is confusing. Get rid of the jiffies_to_msec conversion and treat it as ms. Signed-off-by: Mel Gorman <mgorman@suse.de> --- Documentation/sysctl/kernel.txt | 11 +++--- include/linux/sched.h | 1 + kernel/sched/fair.c | 88 +++++++++++++++++++++++++++++++++++------ 3 files changed, 83 insertions(+), 17 deletions(-) diff --git a/Documentation/sysctl/kernel.txt b/Documentation/sysctl/kernel.txt index 1428c66..8cd7e5f 100644 --- a/Documentation/sysctl/kernel.txt +++ b/Documentation/sysctl/kernel.txt @@ -403,15 +403,16 @@ workload pattern changes and minimises performance impact due to remote memory accesses. These sysctls control the thresholds for scan delays and the number of pages scanned. -numa_balancing_scan_period_min_ms is the minimum delay in milliseconds -between scans. It effectively controls the maximum scanning rate for -each task. +numa_balancing_scan_period_min_ms is the minimum time in milliseconds to +scan a tasks virtual memory. It effectively controls the maximum scanning +rate for each task. numa_balancing_scan_delay_ms is the starting "scan delay" used for a task when it initially forks. -numa_balancing_scan_period_max_ms is the maximum delay between scans. It -effectively controls the minimum scanning rate for each task. +numa_balancing_scan_period_max_ms is the maximum time in milliseconds to +scan a tasks virtual memory. It effectively controls the minimum scanning +rate for each task. numa_balancing_scan_size_mb is how many megabytes worth of pages are scanned for a given scan. diff --git a/include/linux/sched.h b/include/linux/sched.h index 5308d89..a8095ad 100644 --- a/include/linux/sched.h +++ b/include/linux/sched.h @@ -1329,6 +1329,7 @@ struct task_struct { int numa_scan_seq; int numa_migrate_seq; unsigned int numa_scan_period; + unsigned int numa_scan_period_max; u64 node_stamp; /* migration stamp */ struct callback_head numa_work; #endif /* CONFIG_NUMA_BALANCING */ diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c index 22c0c7c..c0092e5 100644 --- a/kernel/sched/fair.c +++ b/kernel/sched/fair.c @@ -818,11 +818,13 @@ update_stats_curr_start(struct cfs_rq *cfs_rq, struct sched_entity *se) #ifdef CONFIG_NUMA_BALANCING /* - * numa task sample period in ms + * Approximate time to scan a full NUMA task in ms. The task scan period is + * calculated based on the tasks virtual memory size and + * numa_balancing_scan_size. */ -unsigned int sysctl_numa_balancing_scan_period_min = 100; -unsigned int sysctl_numa_balancing_scan_period_max = 100*50; -unsigned int sysctl_numa_balancing_scan_period_reset = 100*600; +unsigned int sysctl_numa_balancing_scan_period_min = 1000; +unsigned int sysctl_numa_balancing_scan_period_max = 60000; +unsigned int sysctl_numa_balancing_scan_period_reset = 60000; /* Portion of address space to scan in MB */ unsigned int sysctl_numa_balancing_scan_size = 256; @@ -830,6 +832,51 @@ unsigned int sysctl_numa_balancing_scan_size = 256; /* Scan @scan_size MB every @scan_period after an initial @scan_delay in ms */ unsigned int sysctl_numa_balancing_scan_delay = 1000; +static unsigned int task_nr_scan_windows(struct task_struct *p) +{ + unsigned long rss = 0; + unsigned long nr_scan_pages; + + /* + * Calculations based on RSS as non-present and empty pages are skipped + * by the PTE scanner and NUMA hinting faults should be trapped based + * on resident pages + */ + nr_scan_pages = sysctl_numa_balancing_scan_size << (20 - PAGE_SHIFT); + rss = get_mm_rss(p->mm); + if (!rss) + rss = nr_scan_pages; + + rss = round_up(rss, nr_scan_pages); + return rss / nr_scan_pages; +} + +/* For sanitys sake, never scan more PTEs than MAX_SCAN_WINDOW MB/sec. */ +#define MAX_SCAN_WINDOW 2560 + +static unsigned int task_scan_min(struct task_struct *p) +{ + unsigned int scan, floor; + unsigned int windows = 1; + + if (sysctl_numa_balancing_scan_size < MAX_SCAN_WINDOW) + windows = MAX_SCAN_WINDOW / sysctl_numa_balancing_scan_size; + floor = 1000 / windows; + + scan = sysctl_numa_balancing_scan_period_min / task_nr_scan_windows(p); + return max_t(unsigned int, floor, scan); +} + +static unsigned int task_scan_max(struct task_struct *p) +{ + unsigned int smin = task_scan_min(p); + unsigned int smax; + + /* Watch for min being lower than max due to floor calculations */ + smax = sysctl_numa_balancing_scan_period_max / task_nr_scan_windows(p); + return max(smin, smax); +} + static void task_numa_placement(struct task_struct *p) { int seq; @@ -840,6 +887,7 @@ static void task_numa_placement(struct task_struct *p) if (p->numa_scan_seq == seq) return; p->numa_scan_seq = seq; + p->numa_scan_period_max = task_scan_max(p); /* FIXME: Scheduling placement policy hints go here */ } @@ -860,9 +908,14 @@ void task_numa_fault(int node, int pages, bool migrated) * If pages are properly placed (did not migrate) then scan slower. * This is reset periodically in case of phase changes */ - if (!migrated) - p->numa_scan_period = min(sysctl_numa_balancing_scan_period_max, - p->numa_scan_period + jiffies_to_msecs(10)); + if (!migrated) { + /* Initialise if necessary */ + if (!p->numa_scan_period_max) + p->numa_scan_period_max = task_scan_max(p); + + p->numa_scan_period = min(p->numa_scan_period_max, + p->numa_scan_period + 10); + } task_numa_placement(p); } @@ -884,6 +937,7 @@ void task_numa_work(struct callback_head *work) struct mm_struct *mm = p->mm; struct vm_area_struct *vma; unsigned long start, end; + unsigned long nr_pte_updates = 0; long pages; WARN_ON_ONCE(p != container_of(work, struct task_struct, numa_work)); @@ -915,7 +969,7 @@ void task_numa_work(struct callback_head *work) */ migrate = mm->numa_next_reset; if (time_after(now, migrate)) { - p->numa_scan_period = sysctl_numa_balancing_scan_period_min; + p->numa_scan_period = task_scan_min(p); next_scan = now + msecs_to_jiffies(sysctl_numa_balancing_scan_period_reset); xchg(&mm->numa_next_reset, next_scan); } @@ -927,8 +981,10 @@ void task_numa_work(struct callback_head *work) if (time_before(now, migrate)) return; - if (p->numa_scan_period == 0) - p->numa_scan_period = sysctl_numa_balancing_scan_period_min; + if (p->numa_scan_period == 0) { + p->numa_scan_period_max = task_scan_max(p); + p->numa_scan_period = task_scan_min(p); + } next_scan = now + msecs_to_jiffies(p->numa_scan_period); if (cmpxchg(&mm->numa_next_scan, migrate, next_scan) != migrate) @@ -965,7 +1021,15 @@ void task_numa_work(struct callback_head *work) start = max(start, vma->vm_start); end = ALIGN(start + (pages << PAGE_SHIFT), HPAGE_SIZE); end = min(end, vma->vm_end); - pages -= change_prot_numa(vma, start, end); + nr_pte_updates += change_prot_numa(vma, start, end); + + /* + * Scan sysctl_numa_balancing_scan_size but ensure that + * at least one PTE is updated so that unused virtual + * address space is quickly skipped. + */ + if (nr_pte_updates) + pages -= (end - start) >> PAGE_SHIFT; start = end; if (pages <= 0) @@ -1012,7 +1076,7 @@ void task_tick_numa(struct rq *rq, struct task_struct *curr) if (now - curr->node_stamp > period) { if (!curr->node_stamp) - curr->numa_scan_period = sysctl_numa_balancing_scan_period_min; + curr->numa_scan_period = task_scan_min(curr); curr->node_stamp += period; if (!time_before(jiffies, curr->mm->numa_next_scan)) { -- 1.8.4
WARNING: multiple messages have this Message-ID (diff)
From: Mel Gorman <mgorman@suse.de> To: Peter Zijlstra <a.p.zijlstra@chello.nl>, Rik van Riel <riel@redhat.com> Cc: Srikar Dronamraju <srikar@linux.vnet.ibm.com>, Ingo Molnar <mingo@kernel.org>, Andrea Arcangeli <aarcange@redhat.com>, Johannes Weiner <hannes@cmpxchg.org>, Linux-MM <linux-mm@kvack.org>, LKML <linux-kernel@vger.kernel.org>, Mel Gorman <mgorman@suse.de> Subject: [PATCH 17/63] sched: Set the scan rate proportional to the memory usage of the task being scanned Date: Mon, 7 Oct 2013 11:28:55 +0100 [thread overview] Message-ID: <1381141781-10992-18-git-send-email-mgorman@suse.de> (raw) In-Reply-To: <1381141781-10992-1-git-send-email-mgorman@suse.de> The NUMA PTE scan rate is controlled with a combination of the numa_balancing_scan_period_min, numa_balancing_scan_period_max and numa_balancing_scan_size. This scan rate is independent of the size of the task and as an aside it is further complicated by the fact that numa_balancing_scan_size controls how many pages are marked pte_numa and not how much virtual memory is scanned. In combination, it is almost impossible to meaningfully tune the min and max scan periods and reasoning about performance is complex when the time to complete a full scan is is partially a function of the tasks memory size. This patch alters the semantic of the min and max tunables to be about tuning the length time it takes to complete a scan of a tasks occupied virtual address space. Conceptually this is a lot easier to understand. There is a "sanity" check to ensure the scan rate is never extremely fast based on the amount of virtual memory that should be scanned in a second. The default of 2.5G seems arbitrary but it is to have the maximum scan rate after the patch roughly match the maximum scan rate before the patch was applied. On a similar note, numa_scan_period is in milliseconds and not jiffies. Properly placed pages slow the scanning rate but adding 10 jiffies to numa_scan_period means that the rate scanning slows depends on HZ which is confusing. Get rid of the jiffies_to_msec conversion and treat it as ms. Signed-off-by: Mel Gorman <mgorman@suse.de> --- Documentation/sysctl/kernel.txt | 11 +++--- include/linux/sched.h | 1 + kernel/sched/fair.c | 88 +++++++++++++++++++++++++++++++++++------ 3 files changed, 83 insertions(+), 17 deletions(-) diff --git a/Documentation/sysctl/kernel.txt b/Documentation/sysctl/kernel.txt index 1428c66..8cd7e5f 100644 --- a/Documentation/sysctl/kernel.txt +++ b/Documentation/sysctl/kernel.txt @@ -403,15 +403,16 @@ workload pattern changes and minimises performance impact due to remote memory accesses. These sysctls control the thresholds for scan delays and the number of pages scanned. -numa_balancing_scan_period_min_ms is the minimum delay in milliseconds -between scans. It effectively controls the maximum scanning rate for -each task. +numa_balancing_scan_period_min_ms is the minimum time in milliseconds to +scan a tasks virtual memory. It effectively controls the maximum scanning +rate for each task. numa_balancing_scan_delay_ms is the starting "scan delay" used for a task when it initially forks. -numa_balancing_scan_period_max_ms is the maximum delay between scans. It -effectively controls the minimum scanning rate for each task. +numa_balancing_scan_period_max_ms is the maximum time in milliseconds to +scan a tasks virtual memory. It effectively controls the minimum scanning +rate for each task. numa_balancing_scan_size_mb is how many megabytes worth of pages are scanned for a given scan. diff --git a/include/linux/sched.h b/include/linux/sched.h index 5308d89..a8095ad 100644 --- a/include/linux/sched.h +++ b/include/linux/sched.h @@ -1329,6 +1329,7 @@ struct task_struct { int numa_scan_seq; int numa_migrate_seq; unsigned int numa_scan_period; + unsigned int numa_scan_period_max; u64 node_stamp; /* migration stamp */ struct callback_head numa_work; #endif /* CONFIG_NUMA_BALANCING */ diff --git a/kernel/sched/fair.c b/kernel/sched/fair.c index 22c0c7c..c0092e5 100644 --- a/kernel/sched/fair.c +++ b/kernel/sched/fair.c @@ -818,11 +818,13 @@ update_stats_curr_start(struct cfs_rq *cfs_rq, struct sched_entity *se) #ifdef CONFIG_NUMA_BALANCING /* - * numa task sample period in ms + * Approximate time to scan a full NUMA task in ms. The task scan period is + * calculated based on the tasks virtual memory size and + * numa_balancing_scan_size. */ -unsigned int sysctl_numa_balancing_scan_period_min = 100; -unsigned int sysctl_numa_balancing_scan_period_max = 100*50; -unsigned int sysctl_numa_balancing_scan_period_reset = 100*600; +unsigned int sysctl_numa_balancing_scan_period_min = 1000; +unsigned int sysctl_numa_balancing_scan_period_max = 60000; +unsigned int sysctl_numa_balancing_scan_period_reset = 60000; /* Portion of address space to scan in MB */ unsigned int sysctl_numa_balancing_scan_size = 256; @@ -830,6 +832,51 @@ unsigned int sysctl_numa_balancing_scan_size = 256; /* Scan @scan_size MB every @scan_period after an initial @scan_delay in ms */ unsigned int sysctl_numa_balancing_scan_delay = 1000; +static unsigned int task_nr_scan_windows(struct task_struct *p) +{ + unsigned long rss = 0; + unsigned long nr_scan_pages; + + /* + * Calculations based on RSS as non-present and empty pages are skipped + * by the PTE scanner and NUMA hinting faults should be trapped based + * on resident pages + */ + nr_scan_pages = sysctl_numa_balancing_scan_size << (20 - PAGE_SHIFT); + rss = get_mm_rss(p->mm); + if (!rss) + rss = nr_scan_pages; + + rss = round_up(rss, nr_scan_pages); + return rss / nr_scan_pages; +} + +/* For sanitys sake, never scan more PTEs than MAX_SCAN_WINDOW MB/sec. */ +#define MAX_SCAN_WINDOW 2560 + +static unsigned int task_scan_min(struct task_struct *p) +{ + unsigned int scan, floor; + unsigned int windows = 1; + + if (sysctl_numa_balancing_scan_size < MAX_SCAN_WINDOW) + windows = MAX_SCAN_WINDOW / sysctl_numa_balancing_scan_size; + floor = 1000 / windows; + + scan = sysctl_numa_balancing_scan_period_min / task_nr_scan_windows(p); + return max_t(unsigned int, floor, scan); +} + +static unsigned int task_scan_max(struct task_struct *p) +{ + unsigned int smin = task_scan_min(p); + unsigned int smax; + + /* Watch for min being lower than max due to floor calculations */ + smax = sysctl_numa_balancing_scan_period_max / task_nr_scan_windows(p); + return max(smin, smax); +} + static void task_numa_placement(struct task_struct *p) { int seq; @@ -840,6 +887,7 @@ static void task_numa_placement(struct task_struct *p) if (p->numa_scan_seq == seq) return; p->numa_scan_seq = seq; + p->numa_scan_period_max = task_scan_max(p); /* FIXME: Scheduling placement policy hints go here */ } @@ -860,9 +908,14 @@ void task_numa_fault(int node, int pages, bool migrated) * If pages are properly placed (did not migrate) then scan slower. * This is reset periodically in case of phase changes */ - if (!migrated) - p->numa_scan_period = min(sysctl_numa_balancing_scan_period_max, - p->numa_scan_period + jiffies_to_msecs(10)); + if (!migrated) { + /* Initialise if necessary */ + if (!p->numa_scan_period_max) + p->numa_scan_period_max = task_scan_max(p); + + p->numa_scan_period = min(p->numa_scan_period_max, + p->numa_scan_period + 10); + } task_numa_placement(p); } @@ -884,6 +937,7 @@ void task_numa_work(struct callback_head *work) struct mm_struct *mm = p->mm; struct vm_area_struct *vma; unsigned long start, end; + unsigned long nr_pte_updates = 0; long pages; WARN_ON_ONCE(p != container_of(work, struct task_struct, numa_work)); @@ -915,7 +969,7 @@ void task_numa_work(struct callback_head *work) */ migrate = mm->numa_next_reset; if (time_after(now, migrate)) { - p->numa_scan_period = sysctl_numa_balancing_scan_period_min; + p->numa_scan_period = task_scan_min(p); next_scan = now + msecs_to_jiffies(sysctl_numa_balancing_scan_period_reset); xchg(&mm->numa_next_reset, next_scan); } @@ -927,8 +981,10 @@ void task_numa_work(struct callback_head *work) if (time_before(now, migrate)) return; - if (p->numa_scan_period == 0) - p->numa_scan_period = sysctl_numa_balancing_scan_period_min; + if (p->numa_scan_period == 0) { + p->numa_scan_period_max = task_scan_max(p); + p->numa_scan_period = task_scan_min(p); + } next_scan = now + msecs_to_jiffies(p->numa_scan_period); if (cmpxchg(&mm->numa_next_scan, migrate, next_scan) != migrate) @@ -965,7 +1021,15 @@ void task_numa_work(struct callback_head *work) start = max(start, vma->vm_start); end = ALIGN(start + (pages << PAGE_SHIFT), HPAGE_SIZE); end = min(end, vma->vm_end); - pages -= change_prot_numa(vma, start, end); + nr_pte_updates += change_prot_numa(vma, start, end); + + /* + * Scan sysctl_numa_balancing_scan_size but ensure that + * at least one PTE is updated so that unused virtual + * address space is quickly skipped. + */ + if (nr_pte_updates) + pages -= (end - start) >> PAGE_SHIFT; start = end; if (pages <= 0) @@ -1012,7 +1076,7 @@ void task_tick_numa(struct rq *rq, struct task_struct *curr) if (now - curr->node_stamp > period) { if (!curr->node_stamp) - curr->numa_scan_period = sysctl_numa_balancing_scan_period_min; + curr->numa_scan_period = task_scan_min(curr); curr->node_stamp += period; if (!time_before(jiffies, curr->mm->numa_next_scan)) { -- 1.8.4 -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@kvack.org. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
next prev parent reply other threads:[~2013-10-07 10:42 UTC|newest] Thread overview: 340+ messages / expand[flat|nested] mbox.gz Atom feed top 2013-10-07 10:28 [PATCH 0/63] Basic scheduler support for automatic NUMA balancing V9 Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 10:28 ` [PATCH 01/63] hotplug: Optimize {get,put}_online_cpus() Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 10:28 ` [PATCH 02/63] mm: numa: Document automatic NUMA balancing sysctls Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 12:46 ` Rik van Riel 2013-10-07 12:46 ` Rik van Riel 2013-10-09 17:24 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 03/63] sched, numa: Comment fixlets Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 12:46 ` Rik van Riel 2013-10-07 12:46 ` Rik van Riel 2013-10-09 17:24 ` [tip:sched/core] sched/numa: Fix comments tip-bot for Peter Zijlstra 2013-10-07 10:28 ` [PATCH 04/63] mm: numa: Do not account for a hinting fault if we raced Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 12:47 ` Rik van Riel 2013-10-07 12:47 ` Rik van Riel 2013-10-09 17:24 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-29 10:42 ` [tip:core/urgent] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 05/63] mm: Wait for THP migrations to complete during NUMA hinting faults Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 13:55 ` Rik van Riel 2013-10-07 13:55 ` Rik van Riel 2013-10-09 17:24 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-29 10:42 ` [tip:core/urgent] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 06/63] mm: Prevent parallel splits during THP migration Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 14:01 ` Rik van Riel 2013-10-07 14:01 ` Rik van Riel 2013-10-09 17:24 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-29 10:42 ` [tip:core/urgent] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 07/63] mm: numa: Sanitize task_numa_fault() callsites Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 14:02 ` Rik van Riel 2013-10-07 14:02 ` Rik van Riel 2013-10-09 17:25 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-29 10:42 ` [tip:core/urgent] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 08/63] mm: Close races between THP migration and PMD numa clearing Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 14:02 ` Rik van Riel 2013-10-07 14:02 ` Rik van Riel 2013-10-09 17:25 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-29 10:42 ` [tip:core/urgent] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 09/63] mm: Account for a THP NUMA hinting update as one PTE update Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 14:02 ` Rik van Riel 2013-10-07 14:02 ` Rik van Riel 2013-10-09 17:25 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-29 10:43 ` [tip:core/urgent] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 10/63] mm: Do not flush TLB during protection change if !pte_present && !migration_entry Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 15:12 ` Rik van Riel 2013-10-07 15:12 ` Rik van Riel 2013-10-09 17:25 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 11/63] mm: Only flush TLBs if a transhuge PMD is modified for NUMA pte scanning Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-09 17:25 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 12/63] mm: numa: Do not migrate or account for hinting faults on the zero page Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 17:10 ` Rik van Riel 2013-10-07 17:10 ` Rik van Riel 2013-10-09 17:25 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 13/63] sched: numa: Mitigate chance that same task always updates PTEs Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 17:24 ` Rik van Riel 2013-10-07 17:24 ` Rik van Riel 2013-10-09 17:26 ` [tip:sched/core] sched/numa: " tip-bot for Peter Zijlstra 2013-10-07 10:28 ` [PATCH 14/63] sched: numa: Continue PTE scanning even if migrate rate limited Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 17:24 ` Rik van Riel 2013-10-07 17:24 ` Rik van Riel 2013-10-09 17:26 ` [tip:sched/core] sched/numa: " tip-bot for Peter Zijlstra 2013-10-07 10:28 ` [PATCH 15/63] Revert "mm: sched: numa: Delay PTE scanning until a task is scheduled on a new node" Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 17:42 ` Rik van Riel 2013-10-07 17:42 ` Rik van Riel 2013-10-09 17:26 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 16/63] sched: numa: Initialise numa_next_scan properly Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 17:44 ` Rik van Riel 2013-10-07 17:44 ` Rik van Riel 2013-10-09 17:26 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:28 ` Mel Gorman [this message] 2013-10-07 10:28 ` [PATCH 17/63] sched: Set the scan rate proportional to the memory usage of the task being scanned Mel Gorman 2013-10-07 17:44 ` Rik van Riel 2013-10-07 17:44 ` Rik van Riel 2013-10-09 17:26 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 18/63] sched: numa: Slow scan rate if no NUMA hinting faults are being recorded Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 18:02 ` Rik van Riel 2013-10-07 18:02 ` Rik van Riel 2013-10-09 17:26 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 19/63] sched: Track NUMA hinting faults on per-node basis Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 18:02 ` Rik van Riel 2013-10-07 18:02 ` Rik van Riel 2013-10-09 17:27 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-12-04 5:32 ` [PATCH 19/63] sched: " Wanpeng Li 2013-12-04 5:37 ` Wanpeng Li 2013-10-07 10:28 ` [PATCH 20/63] sched: Select a preferred node with the most numa hinting faults Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 18:04 ` Rik van Riel 2013-10-07 18:04 ` Rik van Riel 2013-10-09 17:27 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:28 ` [PATCH 21/63] sched: Update NUMA hinting faults once per scan Mel Gorman 2013-10-07 10:28 ` Mel Gorman 2013-10-07 18:39 ` Rik van Riel 2013-10-07 18:39 ` Rik van Riel 2013-10-09 17:27 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 22/63] sched: Favour moving tasks towards the preferred node Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:39 ` Rik van Riel 2013-10-07 18:39 ` Rik van Riel 2013-10-09 17:27 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 23/63] sched: Resist moving tasks towards nodes with fewer hinting faults Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:40 ` Rik van Riel 2013-10-07 18:40 ` Rik van Riel 2013-10-09 17:27 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 24/63] sched: Reschedule task on preferred NUMA node once selected Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:40 ` Rik van Riel 2013-10-07 18:40 ` Rik van Riel 2013-10-09 17:27 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 25/63] sched: Add infrastructure for split shared/private accounting of NUMA hinting faults Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:41 ` Rik van Riel 2013-10-07 18:41 ` Rik van Riel 2013-10-09 17:28 ` [tip:sched/core] sched/numa: Add infrastructure for split shared/ private " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 26/63] sched: Check current->mm before allocating NUMA faults Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:41 ` Rik van Riel 2013-10-07 18:41 ` Rik van Riel 2013-10-09 17:28 ` [tip:sched/core] sched/numa: Check current-> mm " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 27/63] mm: numa: Scan pages with elevated page_mapcount Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:43 ` Rik van Riel 2013-10-07 18:43 ` Rik van Riel 2013-10-09 17:28 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 28/63] sched: Remove check that skips small VMAs Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:44 ` Rik van Riel 2013-10-07 18:44 ` Rik van Riel 2013-10-09 17:28 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 29/63] sched: Set preferred NUMA node based on number of private faults Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:45 ` Rik van Riel 2013-10-07 18:45 ` Rik van Riel 2013-10-09 17:28 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 30/63] sched: Do not migrate memory immediately after switching node Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:28 ` [tip:sched/core] sched/numa: " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 31/63] mm: numa: only unmap migrate-on-fault VMAs Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:29 ` [tip:sched/core] mm: numa: Limit NUMA scanning to " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 32/63] sched: Avoid overloading CPUs on a preferred NUMA node Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:58 ` Rik van Riel 2013-10-07 18:58 ` Rik van Riel 2013-10-09 17:29 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 33/63] sched: Retry migration of tasks to CPU on a preferred node Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 18:58 ` Rik van Riel 2013-10-07 18:58 ` Rik van Riel 2013-10-09 17:29 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 34/63] sched: numa: increment numa_migrate_seq when task runs in correct location Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:29 ` [tip:sched/core] sched/numa: Increment " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 35/63] sched: numa: Do not trap hinting faults for shared libraries Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:04 ` Rik van Riel 2013-10-07 19:04 ` Rik van Riel 2013-10-09 17:29 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 36/63] mm: numa: Only trap pmd hinting faults if we would otherwise trap PTE faults Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:06 ` Rik van Riel 2013-10-07 19:06 ` Rik van Riel 2013-10-09 17:29 ` [tip:sched/core] mm: numa: Trap pmd hinting faults only " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 37/63] stop_machine: Introduce stop_two_cpus() Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:30 ` [tip:sched/core] " tip-bot for Peter Zijlstra 2013-10-07 10:29 ` [PATCH 38/63] sched: Introduce migrate_swap() Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:06 ` Rik van Riel 2013-10-07 19:06 ` Rik van Riel 2013-10-09 17:30 ` [tip:sched/core] sched/numa: " tip-bot for Peter Zijlstra 2013-10-10 18:17 ` Peter Zijlstra 2013-10-10 19:04 ` Rik van Riel 2013-10-15 9:55 ` Mel Gorman 2013-10-17 16:49 ` [tip:sched/core] sched: Fix race in migrate_swap_stop() tip-bot for Peter Zijlstra 2013-10-07 10:29 ` [PATCH 39/63] sched: numa: Use a system-wide search to find swap/migration candidates Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:07 ` Rik van Riel 2013-10-07 19:07 ` Rik van Riel 2013-10-09 17:30 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 40/63] sched: numa: Favor placing a task on the preferred node Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:07 ` Rik van Riel 2013-10-07 19:07 ` Rik van Riel 2013-10-09 17:30 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 41/63] sched: numa: fix placement of workloads spread across multiple nodes Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:30 ` [tip:sched/core] sched/numa: Fix " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 42/63] mm: numa: Change page last {nid,pid} into {cpu,pid} Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:08 ` Rik van Riel 2013-10-07 19:08 ` Rik van Riel 2013-10-09 17:30 ` [tip:sched/core] mm: numa: Change page last {nid,pid} into {cpu, pid} tip-bot for Peter Zijlstra 2013-10-07 10:29 ` [PATCH 43/63] sched: numa: Use {cpu, pid} to create task groups for shared faults Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:09 ` Rik van Riel 2013-10-07 19:09 ` Rik van Riel 2013-10-09 17:31 ` [tip:sched/core] sched/numa: " tip-bot for Peter Zijlstra 2013-10-07 10:29 ` [PATCH 44/63] sched: numa: Report a NUMA task group ID Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:09 ` Rik van Riel 2013-10-07 19:09 ` Rik van Riel 2013-10-09 17:31 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 45/63] mm: numa: copy cpupid on page migration Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:31 ` [tip:sched/core] mm: numa: Copy " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 46/63] mm: numa: Do not group on RO pages Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:10 ` Rik van Riel 2013-10-07 19:10 ` Rik van Riel 2013-10-09 17:31 ` [tip:sched/core] " tip-bot for Peter Zijlstra 2013-10-07 10:29 ` [PATCH 47/63] mm: numa: Do not batch handle PMD pages Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:11 ` Rik van Riel 2013-10-07 19:11 ` Rik van Riel 2013-10-09 17:31 ` [tip:sched/core] " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 48/63] sched: numa: stay on the same node if CLONE_VM Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:31 ` [tip:sched/core] sched/numa: Stay " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 49/63] sched: numa: use group fault statistics in numa placement Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:32 ` [tip:sched/core] sched/numa: Use " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 50/63] sched: numa: call task_numa_free from do_execve Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:32 ` [tip:sched/core] sched/numa: Call task_numa_free() from do_execve () tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 51/63] sched: numa: Prevent parallel updates to group stats during placement Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:13 ` Rik van Riel 2013-10-07 19:13 ` Rik van Riel 2013-10-09 17:32 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 52/63] sched: numa: add debugging Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:13 ` Rik van Riel 2013-10-07 19:13 ` Rik van Riel 2013-10-09 17:32 ` [tip:sched/core] sched/numa: Add debugging tip-bot for Ingo Molnar 2013-10-07 10:29 ` [PATCH 53/63] sched: numa: Decide whether to favour task or group weights based on swap candidate relationships Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:32 ` [tip:sched/core] sched/numa: " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 54/63] sched: numa: fix task or group comparison Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:32 ` [tip:sched/core] sched/numa: Fix " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 55/63] sched: numa: Avoid migrating tasks that are placed on their preferred node Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:14 ` Rik van Riel 2013-10-07 19:14 ` Rik van Riel 2013-10-09 17:33 ` [tip:sched/core] sched/numa: " tip-bot for Peter Zijlstra 2013-10-07 10:29 ` [PATCH 56/63] sched: numa: be more careful about joining numa groups Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:33 ` [tip:sched/core] sched/numa: Be " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 57/63] sched: numa: Take false sharing into account when adapting scan rate Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:14 ` Rik van Riel 2013-10-07 19:14 ` Rik van Riel 2013-10-09 17:33 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 58/63] sched: numa: adjust scan rate in task_numa_placement Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:33 ` [tip:sched/core] sched/numa: Adjust " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 59/63] sched: numa: Remove the numa_balancing_scan_period_reset sysctl Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:14 ` Rik van Riel 2013-10-07 19:14 ` Rik van Riel 2013-10-09 17:33 ` [tip:sched/core] sched/numa: " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 60/63] mm: numa: revert temporarily disabling of NUMA migration Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:33 ` [tip:sched/core] mm: numa: Revert " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 61/63] sched: numa: skip some page migrations after a shared fault Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:34 ` [tip:sched/core] sched/numa: Skip " tip-bot for Rik van Riel 2013-10-07 10:29 ` [PATCH 62/63] sched: numa: use unsigned longs for numa group fault stats Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-07 19:15 ` Rik van Riel 2013-10-07 19:15 ` Rik van Riel 2013-10-09 17:34 ` [tip:sched/core] sched/numa: Use " tip-bot for Mel Gorman 2013-10-07 10:29 ` [PATCH 63/63] sched: numa: periodically retry task_numa_migrate Mel Gorman 2013-10-07 10:29 ` Mel Gorman 2013-10-09 17:34 ` [tip:sched/core] sched/numa: Retry task_numa_migrate() periodically tip-bot for Rik van Riel 2013-10-09 11:03 ` [PATCH 0/63] Basic scheduler support for automatic NUMA balancing V9 Ingo Molnar 2013-10-09 11:03 ` Ingo Molnar 2013-10-09 11:11 ` Ingo Molnar 2013-10-09 11:11 ` Ingo Molnar 2013-10-09 11:13 ` Ingo Molnar 2013-10-09 11:13 ` Ingo Molnar 2013-10-09 12:05 ` Peter Zijlstra 2013-10-09 12:05 ` Peter Zijlstra 2013-10-09 12:48 ` Ingo Molnar 2013-10-09 12:48 ` Ingo Molnar 2013-10-10 7:05 ` Mel Gorman 2013-10-10 7:05 ` Mel Gorman 2013-10-09 16:28 ` Ingo Molnar 2013-10-09 16:29 ` Ingo Molnar 2013-10-09 16:57 ` Ingo Molnar 2013-10-09 16:57 ` Ingo Molnar 2013-10-09 17:09 ` Ingo Molnar 2013-10-09 17:09 ` Ingo Molnar 2013-10-09 17:11 ` Peter Zijlstra 2013-10-09 17:11 ` Peter Zijlstra 2013-10-09 17:08 ` Peter Zijlstra 2013-10-09 17:08 ` Peter Zijlstra 2013-10-09 17:15 ` Ingo Molnar 2013-10-09 17:15 ` Ingo Molnar 2013-10-09 17:18 ` Peter Zijlstra 2013-10-09 17:18 ` Peter Zijlstra 2013-10-24 12:26 ` Automatic NUMA balancing patches for tip-urgent/stable Mel Gorman 2013-10-24 12:26 ` Mel Gorman 2013-10-26 12:11 ` Ingo Molnar 2013-10-26 12:11 ` Ingo Molnar 2013-10-29 9:42 ` Mel Gorman 2013-10-29 9:42 ` Mel Gorman 2013-10-29 9:48 ` Ingo Molnar 2013-10-29 9:48 ` Ingo Molnar 2013-10-29 10:24 ` Mel Gorman 2013-10-29 10:24 ` Mel Gorman 2013-10-29 10:41 ` Ingo Molnar 2013-10-29 10:41 ` Ingo Molnar 2013-10-29 12:48 ` Mel Gorman 2013-10-29 12:48 ` Mel Gorman 2013-10-31 9:51 ` [RFC GIT PULL] NUMA-balancing memory corruption fixes Ingo Molnar 2013-10-31 9:51 ` Ingo Molnar 2013-10-31 22:25 ` Linus Torvalds 2013-10-31 22:25 ` Linus Torvalds 2013-11-01 7:36 ` Ingo Molnar 2013-11-01 7:36 ` Ingo Molnar -- strict thread matches above, loose matches on Subject: below -- 2013-09-27 13:26 [PATCH 0/63] Basic scheduler support for automatic NUMA balancing V8 Mel Gorman 2013-09-27 13:27 ` [PATCH 17/63] sched: Set the scan rate proportional to the memory usage of the task being scanned Mel Gorman 2013-09-27 13:27 ` Mel Gorman
Reply instructions: You may reply publicly to this message via plain-text email using any one of the following methods: * Save the following mbox file, import it into your mail client, and reply-to-all from there: mbox Avoid top-posting and favor interleaved quoting: https://en.wikipedia.org/wiki/Posting_style#Interleaved_style * Reply using the --to, --cc, and --in-reply-to switches of git-send-email(1): git send-email \ --in-reply-to=1381141781-10992-18-git-send-email-mgorman@suse.de \ --to=mgorman@suse.de \ --cc=a.p.zijlstra@chello.nl \ --cc=aarcange@redhat.com \ --cc=hannes@cmpxchg.org \ --cc=linux-kernel@vger.kernel.org \ --cc=linux-mm@kvack.org \ --cc=mingo@kernel.org \ --cc=riel@redhat.com \ --cc=srikar@linux.vnet.ibm.com \ /path/to/YOUR_REPLY https://kernel.org/pub/software/scm/git/docs/git-send-email.html * If your mail client supports setting the In-Reply-To header via mailto: links, try the mailto: linkBe sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.