linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs
@ 2022-10-11  1:00 Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 01/14] mm: add bdi_set_strict_limit() function Stefan Roesch
                   ` (15 more replies)
  0 siblings, 16 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

At meta network block devices (nbd) are used to implement remote block storage. In testing
and during production it has been observed that these network block devices can consume
a huge portion of the dirty writeback and writeback can take a considerable time.

To give stricter limits, I'm proposing the following changes and new sysfs knobs:

1) strictlimit knob
  Currently the max_ratio knob exists to limit the dirty_memory. However this knob
  only applies once (dirty_ratio + dirty_background_ratio) / 2 has been reached.
  With the BDI_CAP_STRICTLIMIT flag, the max_ratio can be applied without reaching
  that limit. This change exposes that knob.

  This knob can also be useful for NFS, fuse filesystems and USB devices.

2) Part of 10000 internal calculation
  The max_ratio is based on percentage. With the current machine sizes percentage
  values can be very high (1% of a 256GB main memory is already 2.5GB). This change
  uses part of 10000 instead of percentages for the internal calculations.

3) Introduce two new knobs: min_bytes and max_bytes.
  Currently all calculations are based on ratio, but for a user it often more
  convenient to specify a limit in bytes. The new knobs will not store bytes values,
  instead they will translate the byte value to a corresponding ratio. As the internal
  values are now part of 10000, the ratio is closer to the specified value. However
  the value should be more seen as an approximation as it can fluctuate over time.


Stefan Roesch (14):
  mm: add bdi_set_strict_limit() function
  mm: Add new knob /sys/class/bdi/<bdi>/strict_limit
  mm: document new /sys/class/bdi/<bdi>/strict_limit knob
  mm: Use part per 10000 for bdi ratios.
  mm: add bdi_get_max_bytes() function
  mm: split off __bdi_set_max_ratio() function
  mm: add bdi_set_max_bytes() function.
  mm: Add new knob /sys/class/bdi/<bdi>/max_bytes
  mm: document new /sys/class/bdi/<bdi>/max_bytes knob
  mm: add bdi_get_min_bytes() function.
  mm: split off __bdi_set_min_ratio() function
  mm: add bdi_set_min_bytes() function
  mm: add new /sys/class/bdi/<bdi>/min_bytes knob
  mm: document new /sys/class/bdi/<bdi>/min_bytes knob

 Documentation/ABI/testing/sysfs-class-bdi |  40 +++++++
 include/linux/backing-dev.h               |   8 ++
 mm/backing-dev.c                          |  93 +++++++++++++++-
 mm/page-writeback.c                       | 126 ++++++++++++++++++++--
 4 files changed, 253 insertions(+), 14 deletions(-)


base-commit: e2302539dd4f1c62d96651c07ddb05aa2461d29c
-- 
2.30.2



^ permalink raw reply	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 01/14] mm: add bdi_set_strict_limit() function
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 02/14] mm: Add new knob /sys/class/bdi/<bdi>/strict_limit Stefan Roesch
                   ` (14 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This adds the bdi_set_strict_limit function to be able to set/unset the
BDI_CAP_STRICTLIMIT flag.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 include/linux/backing-dev.h |  1 +
 mm/page-writeback.c         | 16 ++++++++++++++++
 2 files changed, 17 insertions(+)

diff --git a/include/linux/backing-dev.h b/include/linux/backing-dev.h
index 439815cc1ab9..9c984ffc8a0a 100644
--- a/include/linux/backing-dev.h
+++ b/include/linux/backing-dev.h
@@ -104,6 +104,7 @@ static inline unsigned long wb_stat_error(void)
 
 int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio);
 int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio);
+int bdi_set_strict_limit(struct backing_dev_info *bdi, unsigned int strict_limit);
 
 /*
  * Flags in backing_dev_info::capability
diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index 7e9d8d857ecc..e22aae0ecacd 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -698,6 +698,22 @@ int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned max_ratio)
 }
 EXPORT_SYMBOL(bdi_set_max_ratio);
 
+int bdi_set_strict_limit(struct backing_dev_info *bdi, unsigned int strict_limit)
+{
+	if (strict_limit > 1)
+		return -EINVAL;
+
+	spin_lock_bh(&bdi_lock);
+	if (strict_limit)
+		bdi->capabilities |= BDI_CAP_STRICTLIMIT;
+	else
+		bdi->capabilities &= ~BDI_CAP_STRICTLIMIT;
+	spin_unlock_bh(&bdi_lock);
+
+	return 0;
+}
+EXPORT_SYMBOL_GPL(bdi_set_strict_limit);
+
 static unsigned long dirty_freerun_ceiling(unsigned long thresh,
 					   unsigned long bg_thresh)
 {
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 02/14] mm: Add new knob /sys/class/bdi/<bdi>/strict_limit
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 01/14] mm: add bdi_set_strict_limit() function Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 03/14] mm: document new /sys/class/bdi/<bdi>/strict_limit knob Stefan Roesch
                   ` (13 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

Add a new knob to /sys/class/bdi/<bdi>/strict_limit. This new knob
allows to set/unset the flag BDI_CAP_STRICTLIMIT in the bdi
capabilities.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 mm/backing-dev.c | 29 +++++++++++++++++++++++++++++
 1 file changed, 29 insertions(+)

diff --git a/mm/backing-dev.c b/mm/backing-dev.c
index de65cb1e5f76..f9aaa14ad98f 100644
--- a/mm/backing-dev.c
+++ b/mm/backing-dev.c
@@ -209,11 +209,40 @@ static ssize_t stable_pages_required_show(struct device *dev,
 }
 static DEVICE_ATTR_RO(stable_pages_required);
 
+static ssize_t strict_limit_store(struct device *dev,
+		struct device_attribute *attr, const char *buf, size_t count)
+{
+	struct backing_dev_info *bdi = dev_get_drvdata(dev);
+	unsigned int strict_limit;
+	ssize_t ret;
+
+	ret = kstrtouint(buf, 10, &strict_limit);
+	if (ret < 0)
+		return ret;
+
+	ret = bdi_set_strict_limit(bdi, strict_limit);
+	if (!ret)
+		ret = count;
+
+	return ret;
+}
+
+static ssize_t strict_limit_show(struct device *dev,
+		struct device_attribute *attr, char *buf)
+{
+	struct backing_dev_info *bdi = dev_get_drvdata(dev);
+
+	return sysfs_emit(buf, "%d\n",
+			!!(bdi->capabilities & BDI_CAP_STRICTLIMIT));
+}
+static DEVICE_ATTR_RW(strict_limit);
+
 static struct attribute *bdi_dev_attrs[] = {
 	&dev_attr_read_ahead_kb.attr,
 	&dev_attr_min_ratio.attr,
 	&dev_attr_max_ratio.attr,
 	&dev_attr_stable_pages_required.attr,
+	&dev_attr_strict_limit.attr,
 	NULL,
 };
 ATTRIBUTE_GROUPS(bdi_dev);
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 03/14] mm: document new /sys/class/bdi/<bdi>/strict_limit knob
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 01/14] mm: add bdi_set_strict_limit() function Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 02/14] mm: Add new knob /sys/class/bdi/<bdi>/strict_limit Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 04/14] mm: Use part per 10000 for bdi ratios Stefan Roesch
                   ` (12 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This documents the new /sys/class/bdi/<bdi>/strict_limit knob.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 Documentation/ABI/testing/sysfs-class-bdi | 11 +++++++++++
 1 file changed, 11 insertions(+)

diff --git a/Documentation/ABI/testing/sysfs-class-bdi b/Documentation/ABI/testing/sysfs-class-bdi
index 6d2a2fc189dd..68b5d4018c2f 100644
--- a/Documentation/ABI/testing/sysfs-class-bdi
+++ b/Documentation/ABI/testing/sysfs-class-bdi
@@ -55,6 +55,17 @@ Description:
 	mount that is prone to get stuck, or a FUSE mount which cannot
 	be trusted to play fair.
 
+	(read-write)
+What:		/sys/class/bdi/<bdi>/strict_limit
+Date:		October 2022
+Contact:	Stefan Roesch <shr@devkernel.io>
+Description:
+	Forces per-BDI checks for the share of given device in the write-back
+	cache even before the global background dirty limit is reached. This
+	is useful in situations where the global limit is much higher than
+	affordable for given relatively slow (or untrusted) device. Turning
+	strictlimit on has no visible effect if max_ratio is equal to 100%.
+
 	(read-write)
 What:		/sys/class/bdi/<bdi>/stable_pages_required
 Date:		January 2008
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 04/14] mm: Use part per 10000 for bdi ratios.
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (2 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 03/14] mm: document new /sys/class/bdi/<bdi>/strict_limit knob Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 05/14] mm: add bdi_get_max_bytes() function Stefan Roesch
                   ` (11 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

To get finer granularity for ratio calculations use part per 10000
instead of percentiles. This is especially important if we want to
automatically convert byte values to ratios. Otherwise the values that
are actually used can be quite different. This is also important for
machines with more main memory (1% of 256GB is already 2.5GB).

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 include/linux/backing-dev.h |  3 +++
 mm/backing-dev.c            |  6 +++---
 mm/page-writeback.c         | 15 +++++++++------
 3 files changed, 15 insertions(+), 9 deletions(-)

diff --git a/include/linux/backing-dev.h b/include/linux/backing-dev.h
index 9c984ffc8a0a..f698befa76a0 100644
--- a/include/linux/backing-dev.h
+++ b/include/linux/backing-dev.h
@@ -102,6 +102,9 @@ static inline unsigned long wb_stat_error(void)
 #endif
 }
 
+/* BDI ratio is expressed as part per 10000 for finer granularity. */
+#define BDI_RATIO_SCALE 100
+
 int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio);
 int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio);
 int bdi_set_strict_limit(struct backing_dev_info *bdi, unsigned int strict_limit);
diff --git a/mm/backing-dev.c b/mm/backing-dev.c
index f9aaa14ad98f..e64bc49561b1 100644
--- a/mm/backing-dev.c
+++ b/mm/backing-dev.c
@@ -178,7 +178,7 @@ static ssize_t min_ratio_store(struct device *dev,
 
 	return ret;
 }
-BDI_SHOW(min_ratio, bdi->min_ratio)
+BDI_SHOW(min_ratio, bdi->min_ratio / BDI_RATIO_SCALE)
 
 static ssize_t max_ratio_store(struct device *dev,
 		struct device_attribute *attr, const char *buf, size_t count)
@@ -197,7 +197,7 @@ static ssize_t max_ratio_store(struct device *dev,
 
 	return ret;
 }
-BDI_SHOW(max_ratio, bdi->max_ratio)
+BDI_SHOW(max_ratio, bdi->max_ratio / BDI_RATIO_SCALE)
 
 static ssize_t stable_pages_required_show(struct device *dev,
 					  struct device_attribute *attr,
@@ -811,7 +811,7 @@ int bdi_init(struct backing_dev_info *bdi)
 
 	kref_init(&bdi->refcnt);
 	bdi->min_ratio = 0;
-	bdi->max_ratio = 100;
+	bdi->max_ratio = 100 * BDI_RATIO_SCALE;
 	bdi->max_prop_frac = FPROP_FRAC_BASE;
 	INIT_LIST_HEAD(&bdi->bdi_list);
 	INIT_LIST_HEAD(&bdi->wb_list);
diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index e22aae0ecacd..4d5383d4da45 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -197,7 +197,7 @@ static void wb_min_max_ratio(struct bdi_writeback *wb,
 			min *= this_bw;
 			min = div64_ul(min, tot_bw);
 		}
-		if (max < 100) {
+		if (max < 100 * BDI_RATIO_SCALE) {
 			max *= this_bw;
 			max = div64_ul(max, tot_bw);
 		}
@@ -655,6 +655,8 @@ int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio)
 	unsigned int delta;
 	int ret = 0;
 
+	min_ratio *= BDI_RATIO_SCALE;
+
 	spin_lock_bh(&bdi_lock);
 	if (min_ratio > bdi->max_ratio) {
 		ret = -EINVAL;
@@ -665,7 +667,7 @@ int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio)
 			bdi->min_ratio = min_ratio;
 		} else {
 			delta = min_ratio - bdi->min_ratio;
-			if (bdi_min_ratio + delta < 100) {
+			if (bdi_min_ratio + delta < 100 * BDI_RATIO_SCALE) {
 				bdi_min_ratio += delta;
 				bdi->min_ratio = min_ratio;
 			} else {
@@ -684,6 +686,7 @@ int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned max_ratio)
 
 	if (max_ratio > 100)
 		return -EINVAL;
+	max_ratio *= BDI_RATIO_SCALE;
 
 	spin_lock_bh(&bdi_lock);
 	if (bdi->min_ratio > max_ratio) {
@@ -776,15 +779,15 @@ static unsigned long __wb_calc_thresh(struct dirty_throttle_control *dtc)
 	fprop_fraction_percpu(&dom->completions, dtc->wb_completions,
 			      &numerator, &denominator);
 
-	wb_thresh = (thresh * (100 - bdi_min_ratio)) / 100;
+	wb_thresh = (thresh * (100 * BDI_RATIO_SCALE - bdi_min_ratio)) / (100 * BDI_RATIO_SCALE);
 	wb_thresh *= numerator;
 	wb_thresh = div64_ul(wb_thresh, denominator);
 
 	wb_min_max_ratio(dtc->wb, &wb_min_ratio, &wb_max_ratio);
 
-	wb_thresh += (thresh * wb_min_ratio) / 100;
-	if (wb_thresh > (thresh * wb_max_ratio) / 100)
-		wb_thresh = thresh * wb_max_ratio / 100;
+	wb_thresh += (thresh * wb_min_ratio) / (100 * BDI_RATIO_SCALE);
+	if (wb_thresh > (thresh * wb_max_ratio) / (100 * BDI_RATIO_SCALE))
+		wb_thresh = thresh * wb_max_ratio / (100 * BDI_RATIO_SCALE);
 
 	return wb_thresh;
 }
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 05/14] mm: add bdi_get_max_bytes() function
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (3 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 04/14] mm: Use part per 10000 for bdi ratios Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 06/14] mm: split off __bdi_set_max_ratio() function Stefan Roesch
                   ` (10 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This adds a function to return the specified value for max_bytes. It
converts the stored max_ratio of the bdi to the corresponding bytes
value. It introduces the bdi_get_bytes helper function to do the
conversion. This is an approximation as it is based on the value that is
returned by global_dirty_limits(), which can change. The helper function
will also be used by the min_bytes bdi knob.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 include/linux/backing-dev.h |  1 +
 mm/page-writeback.c         | 18 ++++++++++++++++++
 2 files changed, 19 insertions(+)

diff --git a/include/linux/backing-dev.h b/include/linux/backing-dev.h
index f698befa76a0..94fedf66c915 100644
--- a/include/linux/backing-dev.h
+++ b/include/linux/backing-dev.h
@@ -105,6 +105,7 @@ static inline unsigned long wb_stat_error(void)
 /* BDI ratio is expressed as part per 10000 for finer granularity. */
 #define BDI_RATIO_SCALE 100
 
+unsigned long long bdi_get_max_bytes(struct backing_dev_info *bdi);
 int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio);
 int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio);
 int bdi_set_strict_limit(struct backing_dev_info *bdi, unsigned int strict_limit);
diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index 4d5383d4da45..0b9dcf5afda2 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -650,6 +650,18 @@ void wb_domain_exit(struct wb_domain *dom)
  */
 static unsigned int bdi_min_ratio;
 
+static unsigned long long bdi_get_bytes(unsigned int ratio)
+{
+	unsigned long background_thresh;
+	unsigned long dirty_thresh;
+	unsigned long long bytes;
+
+	global_dirty_limits(&background_thresh, &dirty_thresh);
+	bytes = (dirty_thresh * PAGE_SIZE * ratio) / BDI_RATIO_SCALE / 100;
+
+	return bytes;
+}
+
 int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio)
 {
 	unsigned int delta;
@@ -701,6 +713,12 @@ int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned max_ratio)
 }
 EXPORT_SYMBOL(bdi_set_max_ratio);
 
+unsigned long long bdi_get_max_bytes(struct backing_dev_info *bdi)
+{
+	return bdi_get_bytes(bdi->max_ratio);
+}
+EXPORT_SYMBOL_GPL(bdi_get_max_bytes);
+
 int bdi_set_strict_limit(struct backing_dev_info *bdi, unsigned int strict_limit)
 {
 	if (strict_limit > 1)
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 06/14] mm: split off __bdi_set_max_ratio() function
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (4 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 05/14] mm: add bdi_get_max_bytes() function Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 07/14] mm: add bdi_set_max_bytes() function Stefan Roesch
                   ` (9 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This splits off the __bdi_set_max_ratio() function from the
bdi_set_max_ratio() function. The __bdi_set_max_ratio() function will
also be called from the bdi_set_max_bytes() function, which will be
introduced in the next patch.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 mm/page-writeback.c | 14 +++++++++-----
 1 file changed, 9 insertions(+), 5 deletions(-)

diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index 0b9dcf5afda2..8b8936603783 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -692,14 +692,10 @@ int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio)
 	return ret;
 }
 
-int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned max_ratio)
+static int __bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio)
 {
 	int ret = 0;
 
-	if (max_ratio > 100)
-		return -EINVAL;
-	max_ratio *= BDI_RATIO_SCALE;
-
 	spin_lock_bh(&bdi_lock);
 	if (bdi->min_ratio > max_ratio) {
 		ret = -EINVAL;
@@ -711,6 +707,14 @@ int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned max_ratio)
 
 	return ret;
 }
+
+int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio)
+{
+	if (max_ratio > 100)
+		return -EINVAL;
+
+	return __bdi_set_max_ratio(bdi, max_ratio * BDI_RATIO_SCALE);
+}
 EXPORT_SYMBOL(bdi_set_max_ratio);
 
 unsigned long long bdi_get_max_bytes(struct backing_dev_info *bdi)
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 07/14] mm: add bdi_set_max_bytes() function.
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (5 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 06/14] mm: split off __bdi_set_max_ratio() function Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 08/14] mm: Add new knob /sys/class/bdi/<bdi>/max_bytes Stefan Roesch
                   ` (8 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This introduces the bdi_set_max_bytes() function. The max_bytes function
does not store the max_bytes value. Instead it converts the max_bytes
value into the corresponding ratio value.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 include/linux/backing-dev.h |  1 +
 mm/page-writeback.c         | 37 +++++++++++++++++++++++++++++++++++++
 2 files changed, 38 insertions(+)

diff --git a/include/linux/backing-dev.h b/include/linux/backing-dev.h
index 94fedf66c915..a10598f90d4d 100644
--- a/include/linux/backing-dev.h
+++ b/include/linux/backing-dev.h
@@ -108,6 +108,7 @@ static inline unsigned long wb_stat_error(void)
 unsigned long long bdi_get_max_bytes(struct backing_dev_info *bdi);
 int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio);
 int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio);
+int bdi_set_max_bytes(struct backing_dev_info *bdi, unsigned long long max_bytes);
 int bdi_set_strict_limit(struct backing_dev_info *bdi, unsigned int strict_limit);
 
 /*
diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index 8b8936603783..a6594ebdcbd8 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -650,6 +650,28 @@ void wb_domain_exit(struct wb_domain *dom)
  */
 static unsigned int bdi_min_ratio;
 
+static int bdi_check_pages_limit(unsigned long pages)
+{
+	unsigned long max_dirty_pages = global_dirtyable_memory();
+
+	if (pages > max_dirty_pages / 2)
+		return -EINVAL;
+
+	return 0;
+}
+
+static unsigned long bdi_ratio_from_pages(unsigned long long pages)
+{
+	unsigned long background_thresh;
+	unsigned long dirty_thresh;
+	unsigned long ratio;
+
+	global_dirty_limits(&background_thresh, &dirty_thresh);
+	ratio = (pages * 100 * BDI_RATIO_SCALE) / dirty_thresh;
+
+	return ratio;
+}
+
 static unsigned long long bdi_get_bytes(unsigned int ratio)
 {
 	unsigned long background_thresh;
@@ -723,6 +745,21 @@ unsigned long long bdi_get_max_bytes(struct backing_dev_info *bdi)
 }
 EXPORT_SYMBOL_GPL(bdi_get_max_bytes);
 
+int bdi_set_max_bytes(struct backing_dev_info *bdi, unsigned long long max_bytes)
+{
+	int ret;
+	unsigned long pages = max_bytes >> PAGE_SHIFT;
+	unsigned long max_ratio;
+
+	ret = bdi_check_pages_limit(pages);
+	if (ret)
+		return ret;
+
+	max_ratio = bdi_ratio_from_pages(pages);
+	return __bdi_set_max_ratio(bdi, max_ratio);
+}
+EXPORT_SYMBOL_GPL(bdi_set_max_bytes);
+
 int bdi_set_strict_limit(struct backing_dev_info *bdi, unsigned int strict_limit)
 {
 	if (strict_limit > 1)
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 08/14] mm: Add new knob /sys/class/bdi/<bdi>/max_bytes
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (6 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 07/14] mm: add bdi_set_max_bytes() function Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 09/14] mm: document new /sys/class/bdi/<bdi>/max_bytes knob Stefan Roesch
                   ` (7 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This adds the new knob max_bytes to specify a dirty memory limit for the
corresponding bdi. The specified bytes value is converted to a ratio.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 mm/backing-dev.c | 29 +++++++++++++++++++++++++++++
 1 file changed, 29 insertions(+)

diff --git a/mm/backing-dev.c b/mm/backing-dev.c
index e64bc49561b1..a3cad145b219 100644
--- a/mm/backing-dev.c
+++ b/mm/backing-dev.c
@@ -199,6 +199,34 @@ static ssize_t max_ratio_store(struct device *dev,
 }
 BDI_SHOW(max_ratio, bdi->max_ratio / BDI_RATIO_SCALE)
 
+static ssize_t max_bytes_show(struct device *dev,
+			      struct device_attribute *attr,
+			      char *buf)
+{
+	struct backing_dev_info *bdi = dev_get_drvdata(dev);
+
+	return sysfs_emit(buf, "%llu\n", bdi_get_max_bytes(bdi));
+}
+
+static ssize_t max_bytes_store(struct device *dev,
+		struct device_attribute *attr, const char *buf, size_t count)
+{
+	struct backing_dev_info *bdi = dev_get_drvdata(dev);
+	unsigned long long bytes;
+	ssize_t ret;
+
+	ret = kstrtoull(buf, 10, &bytes);
+	if (ret < 0)
+		return ret;
+
+	ret = bdi_set_max_bytes(bdi, bytes);
+	if (!ret)
+		ret = count;
+
+	return ret;
+}
+DEVICE_ATTR_RW(max_bytes);
+
 static ssize_t stable_pages_required_show(struct device *dev,
 					  struct device_attribute *attr,
 					  char *buf)
@@ -241,6 +269,7 @@ static struct attribute *bdi_dev_attrs[] = {
 	&dev_attr_read_ahead_kb.attr,
 	&dev_attr_min_ratio.attr,
 	&dev_attr_max_ratio.attr,
+	&dev_attr_max_bytes.attr,
 	&dev_attr_stable_pages_required.attr,
 	&dev_attr_strict_limit.attr,
 	NULL,
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 09/14] mm: document new /sys/class/bdi/<bdi>/max_bytes knob
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (7 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 08/14] mm: Add new knob /sys/class/bdi/<bdi>/max_bytes Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 10/14] mm: add bdi_get_min_bytes() function Stefan Roesch
                   ` (6 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This documents the new /sys/class/bdi/<bdi>/max_bytes knob.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 Documentation/ABI/testing/sysfs-class-bdi | 14 ++++++++++++++
 1 file changed, 14 insertions(+)

diff --git a/Documentation/ABI/testing/sysfs-class-bdi b/Documentation/ABI/testing/sysfs-class-bdi
index 68b5d4018c2f..580f723de049 100644
--- a/Documentation/ABI/testing/sysfs-class-bdi
+++ b/Documentation/ABI/testing/sysfs-class-bdi
@@ -56,6 +56,20 @@ Description:
 	be trusted to play fair.
 
 	(read-write)
+
+What:		/sys/class/bdi/<bdi>/max_bytes
+Date:		October 2022
+Contact:	Stefan Roesch <shr@devkernel.io>
+Description:
+	Allows limiting a particular device to use not more than the
+	given 'max_bytes' of the write-back cache.  This is useful in
+	situations where we want to avoid one device taking all or
+	most of the write-back cache.  For example in case of an NFS
+	mount that is prone to get stuck, a FUSE mount which cannot be
+	trusted to play fair, or a nbd device.
+
+	(read-write)
+
 What:		/sys/class/bdi/<bdi>/strict_limit
 Date:		October 2022
 Contact:	Stefan Roesch <shr@devkernel.io>
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 10/14] mm: add bdi_get_min_bytes() function.
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (8 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 09/14] mm: document new /sys/class/bdi/<bdi>/max_bytes knob Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 11/14] mm: split off __bdi_set_min_ratio() function Stefan Roesch
                   ` (5 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This adds a function to return the specified value for min_bytes. It
converts the stored min_ratio of the bdi to the corresponding bytes
value. This is an approximation as it is based on the value that is
returned by global_dirty_limits(), which can change. The returned
value can be different than the value when the min_bytes value was set.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 include/linux/backing-dev.h | 1 +
 mm/page-writeback.c         | 6 ++++++
 2 files changed, 7 insertions(+)

diff --git a/include/linux/backing-dev.h b/include/linux/backing-dev.h
index a10598f90d4d..cdb131b57fcb 100644
--- a/include/linux/backing-dev.h
+++ b/include/linux/backing-dev.h
@@ -105,6 +105,7 @@ static inline unsigned long wb_stat_error(void)
 /* BDI ratio is expressed as part per 10000 for finer granularity. */
 #define BDI_RATIO_SCALE 100
 
+unsigned long long bdi_get_min_bytes(struct backing_dev_info *bdi);
 unsigned long long bdi_get_max_bytes(struct backing_dev_info *bdi);
 int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio);
 int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio);
diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index a6594ebdcbd8..a37a25994ad8 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -739,6 +739,12 @@ int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio)
 }
 EXPORT_SYMBOL(bdi_set_max_ratio);
 
+unsigned long long bdi_get_min_bytes(struct backing_dev_info *bdi)
+{
+	return bdi_get_bytes(bdi->min_ratio);
+}
+EXPORT_SYMBOL_GPL(bdi_get_min_bytes);
+
 unsigned long long bdi_get_max_bytes(struct backing_dev_info *bdi)
 {
 	return bdi_get_bytes(bdi->max_ratio);
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 11/14] mm: split off __bdi_set_min_ratio() function
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (9 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 10/14] mm: add bdi_get_min_bytes() function Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 12/14] mm: add bdi_set_min_bytes() function Stefan Roesch
                   ` (4 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This splits off the __bdi_set_min_ratio() function from the
bdi_set_min_ratio() function. The __bdi_set_min_ratio() function will
also be called from the bdi_set_min_bytes() function, which will be
introduced in the next patch.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 mm/page-writeback.c | 7 ++++++-
 1 file changed, 6 insertions(+), 1 deletion(-)

diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index a37a25994ad8..7733dcf96d7e 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -684,7 +684,7 @@ static unsigned long long bdi_get_bytes(unsigned int ratio)
 	return bytes;
 }
 
-int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio)
+static int __bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio)
 {
 	unsigned int delta;
 	int ret = 0;
@@ -730,6 +730,11 @@ static int __bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ra
 	return ret;
 }
 
+int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio)
+{
+	return __bdi_set_min_ratio(bdi, min_ratio * BDI_RATIO_SCALE);
+}
+
 int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio)
 {
 	if (max_ratio > 100)
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 12/14] mm: add bdi_set_min_bytes() function
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (10 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 11/14] mm: split off __bdi_set_min_ratio() function Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 13/14] mm: add new /sys/class/bdi/<bdi>/min_bytes knob Stefan Roesch
                   ` (3 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This introduces the bdi_set_min_bytes() function. The min_bytes function
does not store the min_bytes value. Instead it converts the min_bytes
value into the corresponding ratio value.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 include/linux/backing-dev.h |  1 +
 mm/page-writeback.c         | 15 +++++++++++++++
 2 files changed, 16 insertions(+)

diff --git a/include/linux/backing-dev.h b/include/linux/backing-dev.h
index cdb131b57fcb..8db058f331bc 100644
--- a/include/linux/backing-dev.h
+++ b/include/linux/backing-dev.h
@@ -109,6 +109,7 @@ unsigned long long bdi_get_min_bytes(struct backing_dev_info *bdi);
 unsigned long long bdi_get_max_bytes(struct backing_dev_info *bdi);
 int bdi_set_min_ratio(struct backing_dev_info *bdi, unsigned int min_ratio);
 int bdi_set_max_ratio(struct backing_dev_info *bdi, unsigned int max_ratio);
+int bdi_set_min_bytes(struct backing_dev_info *bdi, unsigned long long min_bytes);
 int bdi_set_max_bytes(struct backing_dev_info *bdi, unsigned long long max_bytes);
 int bdi_set_strict_limit(struct backing_dev_info *bdi, unsigned int strict_limit);
 
diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index 7733dcf96d7e..04c4a142098b 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -750,6 +750,21 @@ unsigned long long bdi_get_min_bytes(struct backing_dev_info *bdi)
 }
 EXPORT_SYMBOL_GPL(bdi_get_min_bytes);
 
+int bdi_set_min_bytes(struct backing_dev_info *bdi, unsigned long long min_bytes)
+{
+	int ret;
+	unsigned long pages = min_bytes >> PAGE_SHIFT;
+	unsigned long min_ratio;
+
+	ret = bdi_check_pages_limit(pages);
+	if (ret)
+		return ret;
+
+	min_ratio = bdi_ratio_from_pages(pages);
+	return __bdi_set_min_ratio(bdi, min_ratio);
+}
+EXPORT_SYMBOL_GPL(bdi_set_min_bytes);
+
 unsigned long long bdi_get_max_bytes(struct backing_dev_info *bdi)
 {
 	return bdi_get_bytes(bdi->max_ratio);
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 13/14] mm: add new /sys/class/bdi/<bdi>/min_bytes knob
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (11 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 12/14] mm: add bdi_set_min_bytes() function Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11  1:00 ` [RFC PATCH v1 14/14] mm: document " Stefan Roesch
                   ` (2 subsequent siblings)
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

bdi has two existing knobs to limit the amount of dirty memory:
min_ratio and max_ratio. However the granularity of the knobs is limited
and often it is more convenient to specify limits in terms of bytes.
This change adds the min_bytes knob.

It does not store the min_bytes value, instead it converts the max_bytes
value to a ratio. The value is therefore more an approximation than an
absolute value.

It also maintains the sum over all the bdi min_ratio values stored in
the variable bdi_min_ratio.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 mm/backing-dev.c | 29 +++++++++++++++++++++++++++++
 1 file changed, 29 insertions(+)

diff --git a/mm/backing-dev.c b/mm/backing-dev.c
index a3cad145b219..9962a2d92fe3 100644
--- a/mm/backing-dev.c
+++ b/mm/backing-dev.c
@@ -199,6 +199,34 @@ static ssize_t max_ratio_store(struct device *dev,
 }
 BDI_SHOW(max_ratio, bdi->max_ratio / BDI_RATIO_SCALE)
 
+static ssize_t min_bytes_show(struct device *dev,
+			      struct device_attribute *attr,
+			      char *buf)
+{
+	struct backing_dev_info *bdi = dev_get_drvdata(dev);
+
+	return sysfs_emit(buf, "%llu\n", bdi_get_min_bytes(bdi));
+}
+
+static ssize_t min_bytes_store(struct device *dev,
+		struct device_attribute *attr, const char *buf, size_t count)
+{
+	struct backing_dev_info *bdi = dev_get_drvdata(dev);
+	unsigned long long bytes;
+	ssize_t ret;
+
+	ret = kstrtoull(buf, 10, &bytes);
+	if (ret < 0)
+		return ret;
+
+	ret = bdi_set_min_bytes(bdi, bytes);
+	if (!ret)
+		ret = count;
+
+	return ret;
+}
+DEVICE_ATTR_RW(min_bytes);
+
 static ssize_t max_bytes_show(struct device *dev,
 			      struct device_attribute *attr,
 			      char *buf)
@@ -269,6 +297,7 @@ static struct attribute *bdi_dev_attrs[] = {
 	&dev_attr_read_ahead_kb.attr,
 	&dev_attr_min_ratio.attr,
 	&dev_attr_max_ratio.attr,
+	&dev_attr_min_bytes.attr,
 	&dev_attr_max_bytes.attr,
 	&dev_attr_stable_pages_required.attr,
 	&dev_attr_strict_limit.attr,
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* [RFC PATCH v1 14/14] mm: document new /sys/class/bdi/<bdi>/min_bytes knob
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (12 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 13/14] mm: add new /sys/class/bdi/<bdi>/min_bytes knob Stefan Roesch
@ 2022-10-11  1:00 ` Stefan Roesch
  2022-10-11 18:20 ` [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Matthew Wilcox
  2022-10-17  7:45 ` Christoph Hellwig
  15 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-11  1:00 UTC (permalink / raw)
  To: kernel-team, linux-block, linux-mm; +Cc: shr, axboe, clm

This documents the new /sys/class/bdi/<bdi>/min_bytes knob.

Signed-off-by: Stefan Roesch <shr@devkernel.io>
---
 Documentation/ABI/testing/sysfs-class-bdi | 15 +++++++++++++++
 1 file changed, 15 insertions(+)

diff --git a/Documentation/ABI/testing/sysfs-class-bdi b/Documentation/ABI/testing/sysfs-class-bdi
index 580f723de049..bec996e29565 100644
--- a/Documentation/ABI/testing/sysfs-class-bdi
+++ b/Documentation/ABI/testing/sysfs-class-bdi
@@ -57,6 +57,21 @@ Description:
 
 	(read-write)
 
+What:		/sys/class/bdi/<bdi>/min_bytes
+Date:		October 2022
+Contact:	Stefan Roesch <shr@devkernel.io>
+Description:
+	Under normal circumstances each device is given a part of the
+	total write-back cache that relates to its current average
+	writeout speed in relation to the other devices.
+
+	The 'min_bytes' parameter allows assigning a minimum
+	percentage of the write-back cache to a particular device
+    expressed in bytes.
+	For example, this is useful for providing a minimum QoS.
+
+	(read-write)
+
 What:		/sys/class/bdi/<bdi>/max_bytes
 Date:		October 2022
 Contact:	Stefan Roesch <shr@devkernel.io>
-- 
2.30.2



^ permalink raw reply related	[flat|nested] 18+ messages in thread

* Re: [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (13 preceding siblings ...)
  2022-10-11  1:00 ` [RFC PATCH v1 14/14] mm: document " Stefan Roesch
@ 2022-10-11 18:20 ` Matthew Wilcox
  2022-10-13 20:11   ` Stefan Roesch
  2022-10-17  7:45 ` Christoph Hellwig
  15 siblings, 1 reply; 18+ messages in thread
From: Matthew Wilcox @ 2022-10-11 18:20 UTC (permalink / raw)
  To: Stefan Roesch; +Cc: kernel-team, linux-block, linux-mm, axboe, clm

On Mon, Oct 10, 2022 at 06:00:30PM -0700, Stefan Roesch wrote:
> 2) Part of 10000 internal calculation
>   The max_ratio is based on percentage. With the current machine sizes percentage
>   values can be very high (1% of a 256GB main memory is already 2.5GB). This change
>   uses part of 10000 instead of percentages for the internal calculations.

Why 10,000?  If you need better accuracy than 1/1000, the next step
should normally be parts per million.


^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs
  2022-10-11 18:20 ` [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Matthew Wilcox
@ 2022-10-13 20:11   ` Stefan Roesch
  0 siblings, 0 replies; 18+ messages in thread
From: Stefan Roesch @ 2022-10-13 20:11 UTC (permalink / raw)
  To: Matthew Wilcox; +Cc: kernel-team, linux-block, linux-mm, axboe, clm


Matthew Wilcox <willy@infradead.org> writes:

> On Mon, Oct 10, 2022 at 06:00:30PM -0700, Stefan Roesch wrote:
>> 2) Part of 10000 internal calculation
>>   The max_ratio is based on percentage. With the current machine sizes percentage
>>   values can be very high (1% of a 256GB main memory is already 2.5GB). This change
>>   uses part of 10000 instead of percentages for the internal calculations.
>
> Why 10,000?  If you need better accuracy than 1/1000, the next step
> should normally be parts per million.

For current main memory sizes 1000 is enough. I wanted to give some
additional headroom. Parts per million is too much. In the next version
of the patch, I'll change it to parts per 1000.


^ permalink raw reply	[flat|nested] 18+ messages in thread

* Re: [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs
  2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
                   ` (14 preceding siblings ...)
  2022-10-11 18:20 ` [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Matthew Wilcox
@ 2022-10-17  7:45 ` Christoph Hellwig
  15 siblings, 0 replies; 18+ messages in thread
From: Christoph Hellwig @ 2022-10-17  7:45 UTC (permalink / raw)
  To: Stefan Roesch; +Cc: kernel-team, linux-block, linux-mm, axboe, clm

On Mon, Oct 10, 2022 at 06:00:30PM -0700, Stefan Roesch wrote:
> At meta network block devices (nbd) are used to implement remote block storage. In testing

FYI, this mail is pretty much misformatted and really hard to read.


^ permalink raw reply	[flat|nested] 18+ messages in thread

end of thread, other threads:[~2022-10-17  7:45 UTC | newest]

Thread overview: 18+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2022-10-11  1:00 [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 01/14] mm: add bdi_set_strict_limit() function Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 02/14] mm: Add new knob /sys/class/bdi/<bdi>/strict_limit Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 03/14] mm: document new /sys/class/bdi/<bdi>/strict_limit knob Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 04/14] mm: Use part per 10000 for bdi ratios Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 05/14] mm: add bdi_get_max_bytes() function Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 06/14] mm: split off __bdi_set_max_ratio() function Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 07/14] mm: add bdi_set_max_bytes() function Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 08/14] mm: Add new knob /sys/class/bdi/<bdi>/max_bytes Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 09/14] mm: document new /sys/class/bdi/<bdi>/max_bytes knob Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 10/14] mm: add bdi_get_min_bytes() function Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 11/14] mm: split off __bdi_set_min_ratio() function Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 12/14] mm: add bdi_set_min_bytes() function Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 13/14] mm: add new /sys/class/bdi/<bdi>/min_bytes knob Stefan Roesch
2022-10-11  1:00 ` [RFC PATCH v1 14/14] mm: document " Stefan Roesch
2022-10-11 18:20 ` [RFC PATCH v1 00/14] mm/block: add bdi sysfs knobs Matthew Wilcox
2022-10-13 20:11   ` Stefan Roesch
2022-10-17  7:45 ` Christoph Hellwig

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).