All of lore.kernel.org
 help / color / mirror / Atom feed
From: Rob Herring <robherring2@gmail.com>
To: Timo Kokkonen <timo.kokkonen@offcode.fi>
Cc: Alexandre Belloni <alexandre.belloni@free-electrons.com>,
	Guenter Roeck <linux@roeck-us.net>,
	Boris Brezillon <boris.brezillon@free-electrons.com>,
	Nicolas Ferre <nicolas.ferre@atmel.com>,
	linux-watchdog@vger.kernel.org,
	"linux-arm-kernel@lists.infradead.org"
	<linux-arm-kernel@lists.infradead.org>
Subject: Re: [PATCH 2/2] at91sam9_wdt: Allow watchdog to reset device at early boot
Date: Fri, 20 Feb 2015 08:06:23 -0600	[thread overview]
Message-ID: <CAL_Jsq+p=dEE0hz3B+nMG_Y1zNeZiJhfQXzDQ6b8xyKcc2VBJg@mail.gmail.com> (raw)
In-Reply-To: <54E57F4F.5040300@offcode.fi>

On Thu, Feb 19, 2015 at 12:14 AM, Timo Kokkonen
<timo.kokkonen@offcode.fi> wrote:
> Hi,
>
>
> On 18.02.2015 23:11, Rob Herring wrote:
>>
>> On Wed, Feb 18, 2015 at 10:00 AM, Alexandre Belloni
>> <alexandre.belloni@free-electrons.com> wrote:
>>>
>>> Hi,
>>>
>>> On 18/02/2015 at 06:50:44 -0800, Guenter Roeck wrote :
>>>>>>>
>>>>>>>    Optional properties:
>>>>>>>    - timeout-sec: Contains the watchdog timeout in seconds.
>>>>>>> +- early-timeout-sec: If present, specifies a timeout value in
>>>>>>> seconds
>>>>>>> +  that the driver keeps on ticking the watchdog HW on behalf of user
>>>>>>> +  space. Once this timeout expires watchdog is left to expire in
>>>>>>> +  timeout-sec seconds. If this propery is set to zero, watchdog is
>>>>>>> +  started (or left running) so that a reset occurs in timeout-sec
>>>>>>> +  since the watchdog was started.
>>>>>>>
>>>>>>>    Example:
>>>>>>>
>>>>>>>    watchdog {
>>>>>>>             timeout-sec = <60>;
>>>>>>> +   early-timeout-sec = <120>;
>>>>>>
>>>>>>
>>>>>> That is not a generic property as you defined it; if so,
>>>>>> it would have to be implemented in the watchdog core code,
>>>>>> not in the at91 code. You'll have to document it in the bindings
>>>>>> description for at91sam9_wdt.
>>>>>
>>>>>
>>>>> Then, if this is a controller specific property, it should be defined
>>>>> with the 'atmel,' prefix.
>>>>> We're kind of looping here: the initial discussion was "is there a need
>>>>> for this property to be a generic one ?", and now you're saying no,
>>>>> while you previously left the door opened.
>>>>>
>>>>> Tomi is proposing a generic approach, as you asked him to. I agree that
>>>>> parsing the property in core code and making its value part of the
>>>>> generic watchdog struct makes sense (that's what I proposed to Tomi a
>>>>> few weeks ago).
>>>>>
>>>> Hmm ... the problem here is that the property description creates the
>>>> assumption or expectation that the property is used if defined,
>>>> which is not the case.
>>>>
>>>> I am not sure how to best resolve this. Maybe a comment in the property
>>>> description stating that implementation of is device (driver) dependent
>>>> ?
>>>> After all, that is true for the timeout-sec property as well.
>>>>
>>>
>>> I would leave it in the generic file and state that it may not be
>>> implemented in the driver. That way, the property is documented for new
>>> driver writers.
>>
>>
>> That is pretty much true of any optional property. Whether implemented
>> in the driver or core is an implementation detail that does not belong
>> in the binding.
>>
>> I find this property pretty questionable. Certainly having the kernel
>> service a watchdog either enabled at reset, in the bootloader, or by
>> the kernel is a useful feature. A timeout for "how long userspace
>> watchdog daemon takes to start" does not belong in DT. timeout-sec
>> should be the default/initial timeout and long enough for userspace to
>> start. Userspace can then change it to a more suitable value.
>
>
> That would be a good workaround if we had enough time in the watchdog HW to
> wait long enough for the user space to start up. For example in atmel HW the
> maximum is 16 seconds, which may not be enough for the kernel to boot up and
> the user space to start the watchdog daemon.

Well, the 16 sec maximum may be something useful to put into the DT as
that actually is a property of the h/w.

> But even that is not enough as all of the watchdog drivers attempt to
> *disable* the watchdog device before user space opens it. What good is a
> watchdog if it is disabled by the kernel and we got stuck before the daemon
> wakes up and re-enables it? This the problem with all of the watchdog
> drivers right now. There are plenty of products out there that can't deal
> with this kind of limitation. They are all hacking around it one way or
> another. If there is a crash, the watchdog must reset the device. I can't
> think of any other run time way to configure the watchdog for this kind of
> situation than having a device tree property for it.

I fully agree the current design is broken. We should fix that in a
generic way.

> What I am proposing here is a way to solve this without hacking. I was told
> to think also a way to defer starting the watchdog for a given timeout so
> that user space would have more time to wake up, which sounded like a good
> idea. And this obviously needs to be implemented so that the watchdog is
> guaranteed to reset the device should there be a crash of any kind that
> prevents the watchdog daemon from starting up. There are a lot of details
> that need to be taken care of properly and therefore watchdog core can't do
> much about it, which is why I thought there is no much point trying to do it
> in watchdog core.

Deferring would assume that the watchdog is not already enabled.

Putting in how long the kernel should service the watchdog in DT is
like putting soft or hard lockup detection times into DT. These are
kernel settings. If you need to change this, you should update your
kernel or kernel settings, not the DT.

Rob

>
> But never the less I can try to state this in the documentation just to make
> clear what we are trying to solve here.
>
> Thanks for all the good comments!
>
> -Timo

WARNING: multiple messages have this Message-ID (diff)
From: robherring2@gmail.com (Rob Herring)
To: linux-arm-kernel@lists.infradead.org
Subject: [PATCH 2/2] at91sam9_wdt: Allow watchdog to reset device at early boot
Date: Fri, 20 Feb 2015 08:06:23 -0600	[thread overview]
Message-ID: <CAL_Jsq+p=dEE0hz3B+nMG_Y1zNeZiJhfQXzDQ6b8xyKcc2VBJg@mail.gmail.com> (raw)
In-Reply-To: <54E57F4F.5040300@offcode.fi>

On Thu, Feb 19, 2015 at 12:14 AM, Timo Kokkonen
<timo.kokkonen@offcode.fi> wrote:
> Hi,
>
>
> On 18.02.2015 23:11, Rob Herring wrote:
>>
>> On Wed, Feb 18, 2015 at 10:00 AM, Alexandre Belloni
>> <alexandre.belloni@free-electrons.com> wrote:
>>>
>>> Hi,
>>>
>>> On 18/02/2015 at 06:50:44 -0800, Guenter Roeck wrote :
>>>>>>>
>>>>>>>    Optional properties:
>>>>>>>    - timeout-sec: Contains the watchdog timeout in seconds.
>>>>>>> +- early-timeout-sec: If present, specifies a timeout value in
>>>>>>> seconds
>>>>>>> +  that the driver keeps on ticking the watchdog HW on behalf of user
>>>>>>> +  space. Once this timeout expires watchdog is left to expire in
>>>>>>> +  timeout-sec seconds. If this propery is set to zero, watchdog is
>>>>>>> +  started (or left running) so that a reset occurs in timeout-sec
>>>>>>> +  since the watchdog was started.
>>>>>>>
>>>>>>>    Example:
>>>>>>>
>>>>>>>    watchdog {
>>>>>>>             timeout-sec = <60>;
>>>>>>> +   early-timeout-sec = <120>;
>>>>>>
>>>>>>
>>>>>> That is not a generic property as you defined it; if so,
>>>>>> it would have to be implemented in the watchdog core code,
>>>>>> not in the at91 code. You'll have to document it in the bindings
>>>>>> description for at91sam9_wdt.
>>>>>
>>>>>
>>>>> Then, if this is a controller specific property, it should be defined
>>>>> with the 'atmel,' prefix.
>>>>> We're kind of looping here: the initial discussion was "is there a need
>>>>> for this property to be a generic one ?", and now you're saying no,
>>>>> while you previously left the door opened.
>>>>>
>>>>> Tomi is proposing a generic approach, as you asked him to. I agree that
>>>>> parsing the property in core code and making its value part of the
>>>>> generic watchdog struct makes sense (that's what I proposed to Tomi a
>>>>> few weeks ago).
>>>>>
>>>> Hmm ... the problem here is that the property description creates the
>>>> assumption or expectation that the property is used if defined,
>>>> which is not the case.
>>>>
>>>> I am not sure how to best resolve this. Maybe a comment in the property
>>>> description stating that implementation of is device (driver) dependent
>>>> ?
>>>> After all, that is true for the timeout-sec property as well.
>>>>
>>>
>>> I would leave it in the generic file and state that it may not be
>>> implemented in the driver. That way, the property is documented for new
>>> driver writers.
>>
>>
>> That is pretty much true of any optional property. Whether implemented
>> in the driver or core is an implementation detail that does not belong
>> in the binding.
>>
>> I find this property pretty questionable. Certainly having the kernel
>> service a watchdog either enabled at reset, in the bootloader, or by
>> the kernel is a useful feature. A timeout for "how long userspace
>> watchdog daemon takes to start" does not belong in DT. timeout-sec
>> should be the default/initial timeout and long enough for userspace to
>> start. Userspace can then change it to a more suitable value.
>
>
> That would be a good workaround if we had enough time in the watchdog HW to
> wait long enough for the user space to start up. For example in atmel HW the
> maximum is 16 seconds, which may not be enough for the kernel to boot up and
> the user space to start the watchdog daemon.

Well, the 16 sec maximum may be something useful to put into the DT as
that actually is a property of the h/w.

> But even that is not enough as all of the watchdog drivers attempt to
> *disable* the watchdog device before user space opens it. What good is a
> watchdog if it is disabled by the kernel and we got stuck before the daemon
> wakes up and re-enables it? This the problem with all of the watchdog
> drivers right now. There are plenty of products out there that can't deal
> with this kind of limitation. They are all hacking around it one way or
> another. If there is a crash, the watchdog must reset the device. I can't
> think of any other run time way to configure the watchdog for this kind of
> situation than having a device tree property for it.

I fully agree the current design is broken. We should fix that in a
generic way.

> What I am proposing here is a way to solve this without hacking. I was told
> to think also a way to defer starting the watchdog for a given timeout so
> that user space would have more time to wake up, which sounded like a good
> idea. And this obviously needs to be implemented so that the watchdog is
> guaranteed to reset the device should there be a crash of any kind that
> prevents the watchdog daemon from starting up. There are a lot of details
> that need to be taken care of properly and therefore watchdog core can't do
> much about it, which is why I thought there is no much point trying to do it
> in watchdog core.

Deferring would assume that the watchdog is not already enabled.

Putting in how long the kernel should service the watchdog in DT is
like putting soft or hard lockup detection times into DT. These are
kernel settings. If you need to change this, you should update your
kernel or kernel settings, not the DT.

Rob

>
> But never the less I can try to state this in the documentation just to make
> clear what we are trying to solve here.
>
> Thanks for all the good comments!
>
> -Timo

  reply	other threads:[~2015-02-20 14:06 UTC|newest]

Thread overview: 110+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2014-10-23 10:40 [PATCH] at91sam9_wdt: Allow watchdog to reset device at early boot Timo Kokkonen
2014-10-23 10:40 ` Timo Kokkonen
2014-11-12  8:20 ` Timo Kokkonen
2014-11-12  8:20   ` Timo Kokkonen
2014-11-13  9:12   ` Nicolas Ferre
2014-11-13  9:12     ` Nicolas Ferre
2014-11-14  8:40     ` Timo Kokkonen
2014-11-14  8:40       ` Timo Kokkonen
2014-11-21 12:23       ` Timo Kokkonen
2014-11-21 12:23         ` Timo Kokkonen
2014-11-27  6:53         ` Timo Kokkonen
2014-11-27  6:53           ` Timo Kokkonen
2014-11-27  9:22           ` Nicolas Ferre
2014-11-27  9:22             ` Nicolas Ferre
2014-11-27 17:23             ` Guenter Roeck
2014-11-27 17:23               ` Guenter Roeck
2014-11-27 19:06               ` Boris Brezillon
2014-11-27 19:06                 ` Boris Brezillon
2014-11-27 19:31                 ` Guenter Roeck
2014-11-27 19:31                   ` Guenter Roeck
2014-11-28  0:30                   ` Alexandre Belloni
2014-11-28  0:30                     ` Alexandre Belloni
2014-11-28  6:40                   ` Timo Kokkonen
2014-11-28  6:40                     ` Timo Kokkonen
2014-11-27 19:00         ` Boris Brezillon
2014-11-27 19:00           ` Boris Brezillon
2014-11-28  6:42           ` Timo Kokkonen
2014-11-28  6:42             ` Timo Kokkonen
2014-12-05 12:57           ` Timo Kokkonen
2014-12-05 12:57             ` Timo Kokkonen
2014-12-05 14:12             ` Boris Brezillon
2014-12-05 14:12               ` Boris Brezillon
2014-12-05 18:42               ` Timo Kokkonen
2014-12-05 18:42                 ` Timo Kokkonen
2014-12-05 19:02                 ` Guenter Roeck
2014-12-05 19:02                   ` Guenter Roeck
2014-12-05 20:32                   ` Timo Kokkonen
2014-12-05 20:32                     ` Timo Kokkonen
2014-12-05 21:39                     ` Guenter Roeck
2014-12-05 21:39                       ` Guenter Roeck
2014-12-06 10:11                       ` Timo Kokkonen
2014-12-06 10:11                         ` Timo Kokkonen
2015-01-13 14:53                         ` Guenter Roeck
2015-01-13 14:53                           ` Guenter Roeck
2015-01-14  6:09                           ` Timo Kokkonen
2015-01-14  6:09                             ` Timo Kokkonen
2015-02-18 12:57                           ` [PATCHv3 0/2] watchdog: Introduce "early-timeout-sec" property Timo Kokkonen
2015-02-18 12:57                             ` Timo Kokkonen
2015-02-18 12:57                             ` [PATCH 1/2] devicetree: Document generic watchdog properties Timo Kokkonen
2015-02-18 12:57                               ` Timo Kokkonen
2015-02-18 12:57                             ` [PATCH 2/2] at91sam9_wdt: Allow watchdog to reset device at early boot Timo Kokkonen
2015-02-18 12:57                               ` Timo Kokkonen
2015-02-18 13:21                               ` Boris Brezillon
2015-02-18 13:21                                 ` Boris Brezillon
2015-02-18 13:59                               ` Guenter Roeck
2015-02-18 13:59                                 ` Guenter Roeck
2015-02-18 14:17                                 ` Boris Brezillon
2015-02-18 14:17                                   ` Boris Brezillon
2015-02-18 14:50                                   ` Guenter Roeck
2015-02-18 14:50                                     ` Guenter Roeck
2015-02-18 16:00                                     ` Alexandre Belloni
2015-02-18 16:00                                       ` Alexandre Belloni
2015-02-18 17:50                                       ` Guenter Roeck
2015-02-18 17:50                                         ` Guenter Roeck
2015-02-18 20:21                                         ` Boris Brezillon
2015-02-18 20:21                                           ` Boris Brezillon
2015-02-19  6:02                                           ` Timo Kokkonen
2015-02-19  6:02                                             ` Timo Kokkonen
2015-02-18 21:11                                       ` Rob Herring
2015-02-18 21:11                                         ` Rob Herring
2015-02-19  6:14                                         ` Timo Kokkonen
2015-02-19  6:14                                           ` Timo Kokkonen
2015-02-20 14:06                                           ` Rob Herring [this message]
2015-02-20 14:06                                             ` Rob Herring
2015-02-20 16:28                                             ` Guenter Roeck
2015-02-20 16:28                                               ` Guenter Roeck
2015-02-20 19:43                                               ` Boris Brezillon
2015-02-20 19:43                                                 ` Boris Brezillon
2015-02-20 20:04                                                 ` Guenter Roeck
2015-02-20 20:04                                                   ` Guenter Roeck
2015-02-20  7:48                               ` Jean-Christophe PLAGNIOL-VILLARD
2015-02-20  7:48                                 ` Jean-Christophe PLAGNIOL-VILLARD
2015-02-20  7:51                                 ` Boris Brezillon
2015-02-20  7:51                                   ` Boris Brezillon
2015-02-20 16:33                                   ` Jean-Christophe PLAGNIOL-VILLARD
2015-02-20 16:33                                     ` Jean-Christophe PLAGNIOL-VILLARD
2015-02-20 17:16                                     ` Boris Brezillon
2015-02-20 17:16                                       ` Boris Brezillon
2015-02-20 18:06                                       ` Guenter Roeck
2015-02-20 18:06                                         ` Guenter Roeck
2015-02-23  7:29                                         ` Timo Kokkonen
2015-02-23  7:29                                           ` Timo Kokkonen
2015-02-23  8:51                                           ` Boris Brezillon
2015-02-23  8:51                                             ` Boris Brezillon
2015-02-23  9:11                                             ` Timo Kokkonen
2015-02-23  9:11                                               ` Timo Kokkonen
2015-02-23 16:19                                               ` Guenter Roeck
2015-02-23 16:19                                                 ` Guenter Roeck
2015-02-23 17:10                                                 ` Rob Herring
2015-02-23 17:10                                                   ` Rob Herring
2015-02-23 17:43                                                   ` Guenter Roeck
2015-02-23 17:43                                                     ` Guenter Roeck
2015-02-20  8:00                                 ` Timo Kokkonen
2015-02-20  8:00                                   ` Timo Kokkonen
2015-02-20 16:09                                 ` Guenter Roeck
2015-02-20 16:09                                   ` Guenter Roeck
2015-02-18 13:16                             ` [PATCHv3 0/2] watchdog: Introduce "early-timeout-sec" property Boris Brezillon
2015-02-18 13:16                               ` Boris Brezillon
2015-02-18 13:51                               ` Timo Kokkonen
2015-02-18 13:51                                 ` Timo Kokkonen

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAL_Jsq+p=dEE0hz3B+nMG_Y1zNeZiJhfQXzDQ6b8xyKcc2VBJg@mail.gmail.com' \
    --to=robherring2@gmail.com \
    --cc=alexandre.belloni@free-electrons.com \
    --cc=boris.brezillon@free-electrons.com \
    --cc=linux-arm-kernel@lists.infradead.org \
    --cc=linux-watchdog@vger.kernel.org \
    --cc=linux@roeck-us.net \
    --cc=nicolas.ferre@atmel.com \
    --cc=timo.kokkonen@offcode.fi \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.