All of lore.kernel.org
 help / color / mirror / Atom feed
From: Dmitry Osipenko <digetx@gmail.com>
To: Jon Hunter <jonathanh@nvidia.com>,
	Jean Delvare <jdelvare@suse.com>,
	Guenter Roeck <linux@roeck-us.net>,
	Matt Merhar <mattmerhar@protonmail.com>
Cc: linux-kernel@vger.kernel.org, linux-hwmon@vger.kernel.org,
	linux-tegra@vger.kernel.org
Subject: Re: [PATCH v3 2/4] hwmon: (lm90) Use hwmon_notify_event()
Date: Mon, 21 Feb 2022 15:59:06 +0300	[thread overview]
Message-ID: <4c12cab4-6265-78c0-46ce-d4014c9919fe@gmail.com> (raw)
In-Reply-To: <13b07bb3-90e4-a501-469c-ce64bc90bfd5@nvidia.com>

21.02.2022 15:56, Jon Hunter пишет:
> 
> On 21/02/2022 12:36, Dmitry Osipenko wrote:
>> 21.02.2022 15:01, Jon Hunter пишет:
>>> Hi Dmitry,
>>>
>>> On 18/06/2021 22:54, Dmitry Osipenko wrote:
>>>> Use hwmon_notify_event() to notify userspace and thermal core about
>>>> temperature changes.
>>>>
>>>> Suggested-by: Guenter Roeck <linux@roeck-us.net>
>>>> Signed-off-by: Dmitry Osipenko <digetx@gmail.com>
>>>> ---
>>>>    drivers/hwmon/lm90.c | 44
>>>> +++++++++++++++++++++++++++++++++-----------
>>>>    1 file changed, 33 insertions(+), 11 deletions(-)
>>>>
>>>> diff --git a/drivers/hwmon/lm90.c b/drivers/hwmon/lm90.c
>>>> index 2e057fad05b4..e7b678a40b39 100644
>>>> --- a/drivers/hwmon/lm90.c
>>>> +++ b/drivers/hwmon/lm90.c
>>>> @@ -465,6 +465,7 @@ enum lm90_temp11_reg_index {
>>>>      struct lm90_data {
>>>>        struct i2c_client *client;
>>>> +    struct device *hwmon_dev;
>>>>        u32 channel_config[4];
>>>>        struct hwmon_channel_info temp_info;
>>>>        const struct hwmon_channel_info *info[3];
>>>> @@ -1731,22 +1732,41 @@ static bool lm90_is_tripped(struct i2c_client
>>>> *client, u16 *status)
>>>>          if ((st & (LM90_STATUS_LLOW | LM90_STATUS_LHIGH |
>>>> LM90_STATUS_LTHRM)) ||
>>>>            (st2 & MAX6696_STATUS2_LOT2))
>>>> -        dev_warn(&client->dev,
>>>> -             "temp%d out of range, please check!\n", 1);
>>>> +        dev_dbg(&client->dev,
>>>> +            "temp%d out of range, please check!\n", 1);
>>>>        if ((st & (LM90_STATUS_RLOW | LM90_STATUS_RHIGH |
>>>> LM90_STATUS_RTHRM)) ||
>>>>            (st2 & MAX6696_STATUS2_ROT2))
>>>> -        dev_warn(&client->dev,
>>>> -             "temp%d out of range, please check!\n", 2);
>>>> +        dev_dbg(&client->dev,
>>>> +            "temp%d out of range, please check!\n", 2);
>>>>        if (st & LM90_STATUS_ROPEN)
>>>> -        dev_warn(&client->dev,
>>>> -             "temp%d diode open, please check!\n", 2);
>>>> +        dev_dbg(&client->dev,
>>>> +            "temp%d diode open, please check!\n", 2);
>>>>        if (st2 & (MAX6696_STATUS2_R2LOW | MAX6696_STATUS2_R2HIGH |
>>>>               MAX6696_STATUS2_R2THRM | MAX6696_STATUS2_R2OT2))
>>>> -        dev_warn(&client->dev,
>>>> -             "temp%d out of range, please check!\n", 3);
>>>> +        dev_dbg(&client->dev,
>>>> +            "temp%d out of range, please check!\n", 3);
>>>>        if (st2 & MAX6696_STATUS2_R2OPEN)
>>>> -        dev_warn(&client->dev,
>>>> -             "temp%d diode open, please check!\n", 3);
>>>> +        dev_dbg(&client->dev,
>>>> +            "temp%d diode open, please check!\n", 3);
>>>> +
>>>> +    if (st & LM90_STATUS_LLOW)
>>>> +        hwmon_notify_event(data->hwmon_dev, hwmon_temp,
>>>> +                   hwmon_temp_min, 0);
>>>> +    if (st & LM90_STATUS_RLOW)
>>>> +        hwmon_notify_event(data->hwmon_dev, hwmon_temp,
>>>> +                   hwmon_temp_min, 1);
>>>> +    if (st2 & MAX6696_STATUS2_R2LOW)
>>>> +        hwmon_notify_event(data->hwmon_dev, hwmon_temp,
>>>> +                   hwmon_temp_min, 2);
>>>> +    if (st & LM90_STATUS_LHIGH)
>>>> +        hwmon_notify_event(data->hwmon_dev, hwmon_temp,
>>>> +                   hwmon_temp_max, 0);
>>>> +    if (st & LM90_STATUS_RHIGH)
>>>> +        hwmon_notify_event(data->hwmon_dev, hwmon_temp,
>>>> +                   hwmon_temp_max, 1);
>>>> +    if (st2 & MAX6696_STATUS2_R2HIGH)
>>>> +        hwmon_notify_event(data->hwmon_dev, hwmon_temp,
>>>> +                   hwmon_temp_max, 2);
>>>
>>>
>>> We observed a random null pointer deference crash somewhere in the
>>> thermal core (crash log below is not very helpful) when calling
>>> mutex_lock(). It looks like we get an interrupt when this crash
>>> happens.
>>>
>>> Looking at the lm90 driver, per the above, I now see we are calling
>>> hwmon_notify_event() from the lm90 interrupt handler. Looking at
>>> hwmon_notify_event() I see that ...
>>>
>>> hwmon_notify_event()
>>>    --> hwmon_thermal_notify()
>>>      --> thermal_zone_device_update()
>>>        --> update_temperature()
>>>          --> mutex_lock()
>>>
>>> So although I don't completely understand the crash, it does seem
>>> that we should not be calling hwmon_notify_event() from the
>>> interrupt handler.
>>>
>>> BTW I have not reproduced this myself yet, so I have just been
>>> reviewing the code to try and understand this.
>>
>> Matt Merhar was experiencing a similar issue on T30 Ouya, but I never
>> managed to reproduce it on Nexus 7 and Acer A500 tablets, and couldn't
>> spot any problem in the code. IIRC, it was a NULL dereference of another
>> pointer within that code.
> 
> 
> OK. From looking at the above I don't think we can call
> hwmon_notify_event() from an interrupt handler because this is going to
> try and request a mutex. So we need to fix that.

The interrupt is threaded, so it can take a mutex.

  reply	other threads:[~2022-02-21 12:59 UTC|newest]

Thread overview: 23+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2021-06-18 21:54 [PATCH v3 0/4] HWMON LM90 interrupt fixes and improvements Dmitry Osipenko
2021-06-18 21:54 ` [PATCH v3 1/4] hwmon: (lm90) Don't override interrupt trigger type Dmitry Osipenko
2021-06-18 21:54 ` [PATCH v3 2/4] hwmon: (lm90) Use hwmon_notify_event() Dmitry Osipenko
2022-02-21 12:01   ` Jon Hunter
2022-02-21 12:36     ` Dmitry Osipenko
2022-02-21 12:56       ` Jon Hunter
2022-02-21 12:59         ` Dmitry Osipenko [this message]
2022-02-21 13:50           ` Jon Hunter
2022-02-21 13:59             ` Dmitry Osipenko
2022-02-21 15:25           ` Guenter Roeck
2022-02-21 15:43     ` Guenter Roeck
2022-02-21 15:49       ` Jon Hunter
2022-02-21 16:02         ` Guenter Roeck
2022-02-21 16:13           ` Dmitry Osipenko
2022-02-21 16:16           ` Jon Hunter
2022-02-21 16:20             ` Dmitry Osipenko
2022-02-21 16:42               ` Guenter Roeck
2022-02-21 16:22             ` Jon Hunter
2022-02-21 18:38               ` Guenter Roeck
2022-02-21 16:23             ` Guenter Roeck
2021-06-18 21:54 ` [PATCH v3 3/4] hwmon: (lm90) Unmask hardware interrupt Dmitry Osipenko
2021-06-18 21:54 ` [PATCH v3 4/4] hwmon: (lm90) Disable interrupt on suspend Dmitry Osipenko
2021-06-19 11:10 ` [PATCH v3 0/4] HWMON LM90 interrupt fixes and improvements Guenter Roeck

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=4c12cab4-6265-78c0-46ce-d4014c9919fe@gmail.com \
    --to=digetx@gmail.com \
    --cc=jdelvare@suse.com \
    --cc=jonathanh@nvidia.com \
    --cc=linux-hwmon@vger.kernel.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-tegra@vger.kernel.org \
    --cc=linux@roeck-us.net \
    --cc=mattmerhar@protonmail.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.