linux-mm.kvack.org archive mirror
 help / color / mirror / Atom feed
* [PATCH v3] mm: introduce validity check on vm dirtiness settings
@ 2017-09-19 22:43 Yafang Shao
  2017-09-26 10:25 ` Michal Hocko
  0 siblings, 1 reply; 7+ messages in thread
From: Yafang Shao @ 2017-09-19 22:43 UTC (permalink / raw)
  To: jack
  Cc: akpm, hannes, mhocko, vdavydov.dev, jlayton, nborisov, tytso,
	mawilcox, linux-mm, linux-kernel, laoar.shao

we can find the logic in domain_dirty_limits() that
when dirty bg_thresh is bigger than dirty thresh,
bg_thresh will be set as thresh * 1 / 2.
	if (bg_thresh >= thresh)
		bg_thresh = thresh / 2;

But actually we can set vm background dirtiness bigger than
vm dirtiness successfully. This behavior may mislead us.
We'd better do this validity check at the beginning.

Signed-off-by: Yafang Shao <laoar.shao@gmail.com>
---
 Documentation/sysctl/vm.txt |  6 +++
 mm/page-writeback.c         | 92 +++++++++++++++++++++++++++++++++++++++++----
 2 files changed, 90 insertions(+), 8 deletions(-)

diff --git a/Documentation/sysctl/vm.txt b/Documentation/sysctl/vm.txt
index 9baf66a..0bab85d 100644
--- a/Documentation/sysctl/vm.txt
+++ b/Documentation/sysctl/vm.txt
@@ -156,6 +156,9 @@ read.
 Note: the minimum value allowed for dirty_bytes is two pages (in bytes); any
 value lower than this limit will be ignored and the old configuration will be
 retained.
+Note: the value of dirty_bytes also cannot be set lower than
+dirty_background_bytes or the amount of memory corresponding to
+dirty_background_ratio.
 
 ==============================================================
 
@@ -176,6 +179,9 @@ generating disk writes will itself start writing out dirty data.
 
 The total available memory is not equal to total system memory.
 
+Note: dirty_ratio cannot be set lower than dirty_background_ratio or
+ratio corresponding to dirty_background_bytes.
+
 ==============================================================
 
 dirty_writeback_centisecs
diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index 0b9c5cb..fadb1d7 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -511,15 +511,71 @@ bool node_dirty_ok(struct pglist_data *pgdat)
 	return nr_pages <= limit;
 }
 
+static bool vm_dirty_settings_valid(void)
+{
+	bool ret = true;
+	unsigned long bytes;
+
+	if (vm_dirty_ratio > 0) {
+		if (dirty_background_ratio >= vm_dirty_ratio) {
+			ret = false;
+			goto out;
+		}
+
+		bytes = global_dirtyable_memory() * PAGE_SIZE / 100 *
+				vm_dirty_ratio;
+		if (dirty_background_bytes >= bytes) {
+			ret = false;
+			goto out;
+		}
+	}
+
+	if (vm_dirty_bytes > 0) {
+		if (dirty_background_bytes >= vm_dirty_bytes) {
+			ret = false;
+			goto out;
+		}
+
+		bytes = global_dirtyable_memory() * PAGE_SIZE / 100 *
+				dirty_background_ratio;
+
+		if (bytes >= vm_dirty_bytes) {
+			ret = false;
+			goto out;
+		}
+	}
+
+	if ((vm_dirty_bytes == 0 && vm_dirty_ratio == 0) ||
+		(dirty_background_bytes == 0 && dirty_background_ratio == 0))
+		ret = false;
+
+out:
+	return ret;
+}
+
 int dirty_background_ratio_handler(struct ctl_table *table, int write,
 		void __user *buffer, size_t *lenp,
 		loff_t *ppos)
 {
 	int ret;
+	int old_ratio = dirty_background_ratio;
 
 	ret = proc_dointvec_minmax(table, write, buffer, lenp, ppos);
-	if (ret == 0 && write)
-		dirty_background_bytes = 0;
+
+	/* When dirty_background_ratio is 0 and dirty_background_bytes isn't 0,
+	 * it's not correct to set dirty_background_bytes to 0 if we reset
+	 * dirty_background_ratio to 0.
+	 * So do nothing if the new ratio is not different.
+	 */
+	if (ret == 0 && write && dirty_background_ratio != old_ratio) {
+		if (vm_dirty_settings_valid())
+			dirty_background_bytes = 0;
+		else {
+			dirty_background_ratio = old_ratio;
+			ret = -EINVAL;
+		}
+	}
+
 	return ret;
 }
 
@@ -528,10 +584,20 @@ int dirty_background_bytes_handler(struct ctl_table *table, int write,
 		loff_t *ppos)
 {
 	int ret;
+	unsigned long old_bytes = dirty_background_bytes;
 
 	ret = proc_doulongvec_minmax(table, write, buffer, lenp, ppos);
-	if (ret == 0 && write)
-		dirty_background_ratio = 0;
+
+	/* the reson is same as above */
+	if (ret == 0 && write && dirty_background_bytes != old_bytes) {
+		if (vm_dirty_settings_valid())
+			dirty_background_ratio = 0;
+		else {
+			dirty_background_bytes = old_bytes;
+			ret = -EINVAL;
+		}
+	}
+
 	return ret;
 }
 
@@ -544,8 +610,13 @@ int dirty_ratio_handler(struct ctl_table *table, int write,
 
 	ret = proc_dointvec_minmax(table, write, buffer, lenp, ppos);
 	if (ret == 0 && write && vm_dirty_ratio != old_ratio) {
-		writeback_set_ratelimit();
-		vm_dirty_bytes = 0;
+		if (vm_dirty_settings_valid()) {
+			writeback_set_ratelimit();
+			vm_dirty_bytes = 0;
+		} else {
+			vm_dirty_ratio = old_ratio;
+			ret = -EINVAL;
+		}
 	}
 	return ret;
 }
@@ -559,8 +630,13 @@ int dirty_bytes_handler(struct ctl_table *table, int write,
 
 	ret = proc_doulongvec_minmax(table, write, buffer, lenp, ppos);
 	if (ret == 0 && write && vm_dirty_bytes != old_bytes) {
-		writeback_set_ratelimit();
-		vm_dirty_ratio = 0;
+		if (vm_dirty_settings_valid()) {
+			writeback_set_ratelimit();
+			vm_dirty_ratio = 0;
+		} else {
+			vm_dirty_bytes = old_bytes;
+			ret = -EINVAL;
+		}
 	}
 	return ret;
 }
-- 
1.8.3.1

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply related	[flat|nested] 7+ messages in thread

* Re: [PATCH v3] mm: introduce validity check on vm dirtiness settings
  2017-09-19 22:43 [PATCH v3] mm: introduce validity check on vm dirtiness settings Yafang Shao
@ 2017-09-26 10:25 ` Michal Hocko
  2017-09-26 11:06   ` Yafang Shao
  0 siblings, 1 reply; 7+ messages in thread
From: Michal Hocko @ 2017-09-26 10:25 UTC (permalink / raw)
  To: Yafang Shao
  Cc: jack, akpm, hannes, vdavydov.dev, jlayton, nborisov, tytso,
	mawilcox, linux-mm, linux-kernel

On Wed 20-09-17 06:43:35, Yafang Shao wrote:
> we can find the logic in domain_dirty_limits() that
> when dirty bg_thresh is bigger than dirty thresh,
> bg_thresh will be set as thresh * 1 / 2.
> 	if (bg_thresh >= thresh)
> 		bg_thresh = thresh / 2;
> 
> But actually we can set vm background dirtiness bigger than
> vm dirtiness successfully. This behavior may mislead us.
> We'd better do this validity check at the beginning.

This is an admin only interface. You can screw setting this up even
when you keep consistency between the background and direct limits. In
general we do not try to be clever for these knobs because we _expect_
admins to do sane things. Why is this any different and why do we need
to add quite some code to handle one particular corner case?

To be honest I am not entirely sure this is worth the code and the
future maintenance burden.

> Signed-off-by: Yafang Shao <laoar.shao@gmail.com>
> ---
>  Documentation/sysctl/vm.txt |  6 +++
>  mm/page-writeback.c         | 92 +++++++++++++++++++++++++++++++++++++++++----
>  2 files changed, 90 insertions(+), 8 deletions(-)
> 
> diff --git a/Documentation/sysctl/vm.txt b/Documentation/sysctl/vm.txt
> index 9baf66a..0bab85d 100644
> --- a/Documentation/sysctl/vm.txt
> +++ b/Documentation/sysctl/vm.txt
> @@ -156,6 +156,9 @@ read.
>  Note: the minimum value allowed for dirty_bytes is two pages (in bytes); any
>  value lower than this limit will be ignored and the old configuration will be
>  retained.
> +Note: the value of dirty_bytes also cannot be set lower than
> +dirty_background_bytes or the amount of memory corresponding to
> +dirty_background_ratio.
>  
>  ==============================================================
>  
> @@ -176,6 +179,9 @@ generating disk writes will itself start writing out dirty data.
>  
>  The total available memory is not equal to total system memory.
>  
> +Note: dirty_ratio cannot be set lower than dirty_background_ratio or
> +ratio corresponding to dirty_background_bytes.
> +
>  ==============================================================
>  
>  dirty_writeback_centisecs
> diff --git a/mm/page-writeback.c b/mm/page-writeback.c
> index 0b9c5cb..fadb1d7 100644
> --- a/mm/page-writeback.c
> +++ b/mm/page-writeback.c
> @@ -511,15 +511,71 @@ bool node_dirty_ok(struct pglist_data *pgdat)
>  	return nr_pages <= limit;
>  }
>  
> +static bool vm_dirty_settings_valid(void)
> +{
> +	bool ret = true;
> +	unsigned long bytes;
> +
> +	if (vm_dirty_ratio > 0) {
> +		if (dirty_background_ratio >= vm_dirty_ratio) {
> +			ret = false;
> +			goto out;
> +		}
> +
> +		bytes = global_dirtyable_memory() * PAGE_SIZE / 100 *
> +				vm_dirty_ratio;
> +		if (dirty_background_bytes >= bytes) {
> +			ret = false;
> +			goto out;
> +		}
> +	}
> +
> +	if (vm_dirty_bytes > 0) {
> +		if (dirty_background_bytes >= vm_dirty_bytes) {
> +			ret = false;
> +			goto out;
> +		}
> +
> +		bytes = global_dirtyable_memory() * PAGE_SIZE / 100 *
> +				dirty_background_ratio;
> +
> +		if (bytes >= vm_dirty_bytes) {
> +			ret = false;
> +			goto out;
> +		}
> +	}
> +
> +	if ((vm_dirty_bytes == 0 && vm_dirty_ratio == 0) ||
> +		(dirty_background_bytes == 0 && dirty_background_ratio == 0))
> +		ret = false;
> +
> +out:
> +	return ret;
> +}
> +
>  int dirty_background_ratio_handler(struct ctl_table *table, int write,
>  		void __user *buffer, size_t *lenp,
>  		loff_t *ppos)
>  {
>  	int ret;
> +	int old_ratio = dirty_background_ratio;
>  
>  	ret = proc_dointvec_minmax(table, write, buffer, lenp, ppos);
> -	if (ret == 0 && write)
> -		dirty_background_bytes = 0;
> +
> +	/* When dirty_background_ratio is 0 and dirty_background_bytes isn't 0,
> +	 * it's not correct to set dirty_background_bytes to 0 if we reset
> +	 * dirty_background_ratio to 0.
> +	 * So do nothing if the new ratio is not different.
> +	 */
> +	if (ret == 0 && write && dirty_background_ratio != old_ratio) {
> +		if (vm_dirty_settings_valid())
> +			dirty_background_bytes = 0;
> +		else {
> +			dirty_background_ratio = old_ratio;
> +			ret = -EINVAL;
> +		}
> +	}
> +
>  	return ret;
>  }
>  
> @@ -528,10 +584,20 @@ int dirty_background_bytes_handler(struct ctl_table *table, int write,
>  		loff_t *ppos)
>  {
>  	int ret;
> +	unsigned long old_bytes = dirty_background_bytes;
>  
>  	ret = proc_doulongvec_minmax(table, write, buffer, lenp, ppos);
> -	if (ret == 0 && write)
> -		dirty_background_ratio = 0;
> +
> +	/* the reson is same as above */
> +	if (ret == 0 && write && dirty_background_bytes != old_bytes) {
> +		if (vm_dirty_settings_valid())
> +			dirty_background_ratio = 0;
> +		else {
> +			dirty_background_bytes = old_bytes;
> +			ret = -EINVAL;
> +		}
> +	}
> +
>  	return ret;
>  }
>  
> @@ -544,8 +610,13 @@ int dirty_ratio_handler(struct ctl_table *table, int write,
>  
>  	ret = proc_dointvec_minmax(table, write, buffer, lenp, ppos);
>  	if (ret == 0 && write && vm_dirty_ratio != old_ratio) {
> -		writeback_set_ratelimit();
> -		vm_dirty_bytes = 0;
> +		if (vm_dirty_settings_valid()) {
> +			writeback_set_ratelimit();
> +			vm_dirty_bytes = 0;
> +		} else {
> +			vm_dirty_ratio = old_ratio;
> +			ret = -EINVAL;
> +		}
>  	}
>  	return ret;
>  }
> @@ -559,8 +630,13 @@ int dirty_bytes_handler(struct ctl_table *table, int write,
>  
>  	ret = proc_doulongvec_minmax(table, write, buffer, lenp, ppos);
>  	if (ret == 0 && write && vm_dirty_bytes != old_bytes) {
> -		writeback_set_ratelimit();
> -		vm_dirty_ratio = 0;
> +		if (vm_dirty_settings_valid()) {
> +			writeback_set_ratelimit();
> +			vm_dirty_ratio = 0;
> +		} else {
> +			vm_dirty_bytes = old_bytes;
> +			ret = -EINVAL;
> +		}
>  	}
>  	return ret;
>  }
> -- 
> 1.8.3.1
> 

-- 
Michal Hocko
SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH v3] mm: introduce validity check on vm dirtiness settings
  2017-09-26 10:25 ` Michal Hocko
@ 2017-09-26 11:06   ` Yafang Shao
  2017-09-26 11:26     ` Michal Hocko
  0 siblings, 1 reply; 7+ messages in thread
From: Yafang Shao @ 2017-09-26 11:06 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Jan Kara, akpm, Johannes Weiner, vdavydov.dev, jlayton, nborisov,
	Theodore Ts'o, mawilcox, linux-mm, linux-kernel

2017-09-26 18:25 GMT+08:00 Michal Hocko <mhocko@kernel.org>:
> On Wed 20-09-17 06:43:35, Yafang Shao wrote:
>> we can find the logic in domain_dirty_limits() that
>> when dirty bg_thresh is bigger than dirty thresh,
>> bg_thresh will be set as thresh * 1 / 2.
>>       if (bg_thresh >= thresh)
>>               bg_thresh = thresh / 2;
>>
>> But actually we can set vm background dirtiness bigger than
>> vm dirtiness successfully. This behavior may mislead us.
>> We'd better do this validity check at the beginning.
>
> This is an admin only interface. You can screw setting this up even
> when you keep consistency between the background and direct limits. In
> general we do not try to be clever for these knobs because we _expect_
> admins to do sane things. Why is this any different and why do we need
> to add quite some code to handle one particular corner case?
>

Of course we expect admins to do the sane things, but not all admins
are expert or faimilar with linux kernel source code.
If we have to read the source code to know what is the right thing to
do, I don't think this is a good interface, even for the admin.
Anyway, there's no document on that direct limits should not less than
background limits.


> To be honest I am not entirely sure this is worth the code and the
> future maintenance burden.
I'm not sure if this code is a burden for the future maintenance, but
I think that if we don't introduce this code it is a burden to the
admins.
BTW, the newest code is [patch v4].


>
>> Signed-off-by: Yafang Shao <laoar.shao@gmail.com>
>> ---
>>  Documentation/sysctl/vm.txt |  6 +++
>>  mm/page-writeback.c         | 92 +++++++++++++++++++++++++++++++++++++++++----
>>  2 files changed, 90 insertions(+), 8 deletions(-)
>>
>> diff --git a/Documentation/sysctl/vm.txt b/Documentation/sysctl/vm.txt
>> index 9baf66a..0bab85d 100644
>> --- a/Documentation/sysctl/vm.txt
>> +++ b/Documentation/sysctl/vm.txt
>> @@ -156,6 +156,9 @@ read.
>>  Note: the minimum value allowed for dirty_bytes is two pages (in bytes); any
>>  value lower than this limit will be ignored and the old configuration will be
>>  retained.
>> +Note: the value of dirty_bytes also cannot be set lower than
>> +dirty_background_bytes or the amount of memory corresponding to
>> +dirty_background_ratio.
>>
>>  ==============================================================
>>
>> @@ -176,6 +179,9 @@ generating disk writes will itself start writing out dirty data.
>>
>>  The total available memory is not equal to total system memory.
>>
>> +Note: dirty_ratio cannot be set lower than dirty_background_ratio or
>> +ratio corresponding to dirty_background_bytes.
>> +
>>  ==============================================================
>>
>>  dirty_writeback_centisecs
>> diff --git a/mm/page-writeback.c b/mm/page-writeback.c
>> index 0b9c5cb..fadb1d7 100644
>> --- a/mm/page-writeback.c
>> +++ b/mm/page-writeback.c
>> @@ -511,15 +511,71 @@ bool node_dirty_ok(struct pglist_data *pgdat)
>>       return nr_pages <= limit;
>>  }
>>
>> +static bool vm_dirty_settings_valid(void)
>> +{
>> +     bool ret = true;
>> +     unsigned long bytes;
>> +
>> +     if (vm_dirty_ratio > 0) {
>> +             if (dirty_background_ratio >= vm_dirty_ratio) {
>> +                     ret = false;
>> +                     goto out;
>> +             }
>> +
>> +             bytes = global_dirtyable_memory() * PAGE_SIZE / 100 *
>> +                             vm_dirty_ratio;
>> +             if (dirty_background_bytes >= bytes) {
>> +                     ret = false;
>> +                     goto out;
>> +             }
>> +     }
>> +
>> +     if (vm_dirty_bytes > 0) {
>> +             if (dirty_background_bytes >= vm_dirty_bytes) {
>> +                     ret = false;
>> +                     goto out;
>> +             }
>> +
>> +             bytes = global_dirtyable_memory() * PAGE_SIZE / 100 *
>> +                             dirty_background_ratio;
>> +
>> +             if (bytes >= vm_dirty_bytes) {
>> +                     ret = false;
>> +                     goto out;
>> +             }
>> +     }
>> +
>> +     if ((vm_dirty_bytes == 0 && vm_dirty_ratio == 0) ||
>> +             (dirty_background_bytes == 0 && dirty_background_ratio == 0))
>> +             ret = false;
>> +
>> +out:
>> +     return ret;
>> +}
>> +
>>  int dirty_background_ratio_handler(struct ctl_table *table, int write,
>>               void __user *buffer, size_t *lenp,
>>               loff_t *ppos)
>>  {
>>       int ret;
>> +     int old_ratio = dirty_background_ratio;
>>
>>       ret = proc_dointvec_minmax(table, write, buffer, lenp, ppos);
>> -     if (ret == 0 && write)
>> -             dirty_background_bytes = 0;
>> +
>> +     /* When dirty_background_ratio is 0 and dirty_background_bytes isn't 0,
>> +      * it's not correct to set dirty_background_bytes to 0 if we reset
>> +      * dirty_background_ratio to 0.
>> +      * So do nothing if the new ratio is not different.
>> +      */
>> +     if (ret == 0 && write && dirty_background_ratio != old_ratio) {
>> +             if (vm_dirty_settings_valid())
>> +                     dirty_background_bytes = 0;
>> +             else {
>> +                     dirty_background_ratio = old_ratio;
>> +                     ret = -EINVAL;
>> +             }
>> +     }
>> +
>>       return ret;
>>  }
>>
>> @@ -528,10 +584,20 @@ int dirty_background_bytes_handler(struct ctl_table *table, int write,
>>               loff_t *ppos)
>>  {
>>       int ret;
>> +     unsigned long old_bytes = dirty_background_bytes;
>>
>>       ret = proc_doulongvec_minmax(table, write, buffer, lenp, ppos);
>> -     if (ret == 0 && write)
>> -             dirty_background_ratio = 0;
>> +
>> +     /* the reson is same as above */
>> +     if (ret == 0 && write && dirty_background_bytes != old_bytes) {
>> +             if (vm_dirty_settings_valid())
>> +                     dirty_background_ratio = 0;
>> +             else {
>> +                     dirty_background_bytes = old_bytes;
>> +                     ret = -EINVAL;
>> +             }
>> +     }
>> +
>>       return ret;
>>  }
>>
>> @@ -544,8 +610,13 @@ int dirty_ratio_handler(struct ctl_table *table, int write,
>>
>>       ret = proc_dointvec_minmax(table, write, buffer, lenp, ppos);
>>       if (ret == 0 && write && vm_dirty_ratio != old_ratio) {
>> -             writeback_set_ratelimit();
>> -             vm_dirty_bytes = 0;
>> +             if (vm_dirty_settings_valid()) {
>> +                     writeback_set_ratelimit();
>> +                     vm_dirty_bytes = 0;
>> +             } else {
>> +                     vm_dirty_ratio = old_ratio;
>> +                     ret = -EINVAL;
>> +             }
>>       }
>>       return ret;
>>  }
>> @@ -559,8 +630,13 @@ int dirty_bytes_handler(struct ctl_table *table, int write,
>>
>>       ret = proc_doulongvec_minmax(table, write, buffer, lenp, ppos);
>>       if (ret == 0 && write && vm_dirty_bytes != old_bytes) {
>> -             writeback_set_ratelimit();
>> -             vm_dirty_ratio = 0;
>> +             if (vm_dirty_settings_valid()) {
>> +                     writeback_set_ratelimit();
>> +                     vm_dirty_ratio = 0;
>> +             } else {
>> +                     vm_dirty_bytes = old_bytes;
>> +                     ret = -EINVAL;
>> +             }
>>       }
>>       return ret;
>>  }
>> --
>> 1.8.3.1
>>
>
> --
> Michal Hocko
> SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH v3] mm: introduce validity check on vm dirtiness settings
  2017-09-26 11:06   ` Yafang Shao
@ 2017-09-26 11:26     ` Michal Hocko
  2017-09-26 11:45       ` Yafang Shao
  0 siblings, 1 reply; 7+ messages in thread
From: Michal Hocko @ 2017-09-26 11:26 UTC (permalink / raw)
  To: Yafang Shao
  Cc: Jan Kara, akpm, Johannes Weiner, vdavydov.dev, jlayton, nborisov,
	Theodore Ts'o, mawilcox, linux-mm, linux-kernel

On Tue 26-09-17 19:06:37, Yafang Shao wrote:
> 2017-09-26 18:25 GMT+08:00 Michal Hocko <mhocko@kernel.org>:
> > On Wed 20-09-17 06:43:35, Yafang Shao wrote:
> >> we can find the logic in domain_dirty_limits() that
> >> when dirty bg_thresh is bigger than dirty thresh,
> >> bg_thresh will be set as thresh * 1 / 2.
> >>       if (bg_thresh >= thresh)
> >>               bg_thresh = thresh / 2;
> >>
> >> But actually we can set vm background dirtiness bigger than
> >> vm dirtiness successfully. This behavior may mislead us.
> >> We'd better do this validity check at the beginning.
> >
> > This is an admin only interface. You can screw setting this up even
> > when you keep consistency between the background and direct limits. In
> > general we do not try to be clever for these knobs because we _expect_
> > admins to do sane things. Why is this any different and why do we need
> > to add quite some code to handle one particular corner case?
> >
> 
> Of course we expect admins to do the sane things, but not all admins
> are expert or faimilar with linux kernel source code.
> If we have to read the source code to know what is the right thing to
> do, I don't think this is a good interface, even for the admin.

Well, it is kind of natural to setup background below the direct limit
in general so I am not sure what is so surprising here. Moreover setting
a non default drity limits already requires some expertise. It is not
like an arbitrary value will work just fine...

> Anyway, there's no document on that direct limits should not less than
> background limits.

Then improve the documentation.

> > To be honest I am not entirely sure this is worth the code and the
> > future maintenance burden.
> I'm not sure if this code is a burden for the future maintenance, but
> I think that if we don't introduce this code it is a burden to the
> admins.

anytime we might need to tweak background vs direct limit we would have
to change these checks as well and that sounds like a maint. burden to
me.
-- 
Michal Hocko
SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH v3] mm: introduce validity check on vm dirtiness settings
  2017-09-26 11:26     ` Michal Hocko
@ 2017-09-26 11:45       ` Yafang Shao
  2017-09-26 11:54         ` Michal Hocko
  0 siblings, 1 reply; 7+ messages in thread
From: Yafang Shao @ 2017-09-26 11:45 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Jan Kara, akpm, Johannes Weiner, vdavydov.dev, jlayton, nborisov,
	Theodore Ts'o, mawilcox, linux-mm, linux-kernel

2017-09-26 19:26 GMT+08:00 Michal Hocko <mhocko@kernel.org>:
> On Tue 26-09-17 19:06:37, Yafang Shao wrote:
>> 2017-09-26 18:25 GMT+08:00 Michal Hocko <mhocko@kernel.org>:
>> > On Wed 20-09-17 06:43:35, Yafang Shao wrote:
>> >> we can find the logic in domain_dirty_limits() that
>> >> when dirty bg_thresh is bigger than dirty thresh,
>> >> bg_thresh will be set as thresh * 1 / 2.
>> >>       if (bg_thresh >= thresh)
>> >>               bg_thresh = thresh / 2;
>> >>
>> >> But actually we can set vm background dirtiness bigger than
>> >> vm dirtiness successfully. This behavior may mislead us.
>> >> We'd better do this validity check at the beginning.
>> >
>> > This is an admin only interface. You can screw setting this up even
>> > when you keep consistency between the background and direct limits. In
>> > general we do not try to be clever for these knobs because we _expect_
>> > admins to do sane things. Why is this any different and why do we need
>> > to add quite some code to handle one particular corner case?
>> >
>>
>> Of course we expect admins to do the sane things, but not all admins
>> are expert or faimilar with linux kernel source code.
>> If we have to read the source code to know what is the right thing to
>> do, I don't think this is a good interface, even for the admin.
>
> Well, it is kind of natural to setup background below the direct limit
> in general so I am not sure what is so surprising here. Moreover setting
> a non default drity limits already requires some expertise. It is not
> like an arbitrary value will work just fine...
>
>> Anyway, there's no document on that direct limits should not less than
>> background limits.
>
> Then improve the documentation.

I have improved the kernel documentation as well, in order to make it
more clear for the newbies.

>> > To be honest I am not entirely sure this is worth the code and the
>> > future maintenance burden.
>> I'm not sure if this code is a burden for the future maintenance, but
>> I think that if we don't introduce this code it is a burden to the
>> admins.
>
> anytime we might need to tweak background vs direct limit we would have
> to change these checks as well and that sounds like a maint. burden to
> me.

Would pls. show me some example ?

> --
> Michal Hocko
> SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH v3] mm: introduce validity check on vm dirtiness settings
  2017-09-26 11:45       ` Yafang Shao
@ 2017-09-26 11:54         ` Michal Hocko
  2017-09-26 13:33           ` Jan Kara
  0 siblings, 1 reply; 7+ messages in thread
From: Michal Hocko @ 2017-09-26 11:54 UTC (permalink / raw)
  To: Yafang Shao
  Cc: Jan Kara, akpm, Johannes Weiner, vdavydov.dev, jlayton, nborisov,
	Theodore Ts'o, mawilcox, linux-mm, linux-kernel

On Tue 26-09-17 19:45:45, Yafang Shao wrote:
> 2017-09-26 19:26 GMT+08:00 Michal Hocko <mhocko@kernel.org>:
> > On Tue 26-09-17 19:06:37, Yafang Shao wrote:
[...]
> >> Anyway, there's no document on that direct limits should not less than
> >> background limits.
> >
> > Then improve the documentation.
> 
> I have improved the kernel documentation as well, in order to make it
> more clear for the newbies.

Why do we need to update the code then?

> >> > To be honest I am not entirely sure this is worth the code and the
> >> > future maintenance burden.
> >> I'm not sure if this code is a burden for the future maintenance, but
> >> I think that if we don't introduce this code it is a burden to the
> >> admins.
> >
> > anytime we might need to tweak background vs direct limit we would have
> > to change these checks as well and that sounds like a maint. burden to
> > me.
> 
> Would pls. show me some example ?

What kind of examples would you like to see. I meant that if the current
logic of bacground vs. direct limit changes the code to check it which
is at a different place IIRC would have to be kept in sync.

That being said, this is my personal opinion, I will not object if there
is a general consensus on merging this. I just believe that this is not
simply worth adding a single line of code. You can then a lot of harm by
setting different values which would pass the added check.
-- 
Michal Hocko
SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: [PATCH v3] mm: introduce validity check on vm dirtiness settings
  2017-09-26 11:54         ` Michal Hocko
@ 2017-09-26 13:33           ` Jan Kara
  0 siblings, 0 replies; 7+ messages in thread
From: Jan Kara @ 2017-09-26 13:33 UTC (permalink / raw)
  To: Michal Hocko
  Cc: Yafang Shao, Jan Kara, akpm, Johannes Weiner, vdavydov.dev,
	jlayton, nborisov, Theodore Ts'o, mawilcox, linux-mm,
	linux-kernel

On Tue 26-09-17 13:54:23, Michal Hocko wrote:
> On Tue 26-09-17 19:45:45, Yafang Shao wrote:
> > >> > To be honest I am not entirely sure this is worth the code and the
> > >> > future maintenance burden.
> > >> I'm not sure if this code is a burden for the future maintenance, but
> > >> I think that if we don't introduce this code it is a burden to the
> > >> admins.
> > >
> > > anytime we might need to tweak background vs direct limit we would have
> > > to change these checks as well and that sounds like a maint. burden to
> > > me.
> > 
> > Would pls. show me some example ?
> 
> What kind of examples would you like to see. I meant that if the current
> logic of bacground vs. direct limit changes the code to check it which
> is at a different place IIRC would have to be kept in sync.
> 
> That being said, this is my personal opinion, I will not object if there
> is a general consensus on merging this. I just believe that this is not
> simply worth adding a single line of code. You can then a lot of harm by
> setting different values which would pass the added check.

So I personally think that the checks Yafang added are worth the extra
code. The situation with ratio/bytes interface and hard/background limit is
complex enough that it makes sense to have basic sanity checks to me. That
being said I don't have too strong opinion on this so just documentation
update would be also fine by me.

								Honza
-- 
Jan Kara <jack@suse.com>
SUSE Labs, CR

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2017-09-26 13:33 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2017-09-19 22:43 [PATCH v3] mm: introduce validity check on vm dirtiness settings Yafang Shao
2017-09-26 10:25 ` Michal Hocko
2017-09-26 11:06   ` Yafang Shao
2017-09-26 11:26     ` Michal Hocko
2017-09-26 11:45       ` Yafang Shao
2017-09-26 11:54         ` Michal Hocko
2017-09-26 13:33           ` Jan Kara

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).