linux-pci.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: Nam Cao <namcao@linutronix.de>
To: Lukas Wunner <lukas@wunner.de>
Cc: Bjorn Helgaas <bhelgaas@google.com>,
	Yinghai Lu <yinghai@kernel.org>,
	Greg Kroah-Hartman <gregkh@linuxfoundation.org>,
	Rajesh Shah <rajesh.shah@intel.com>,
	linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org,
	stable@vger.kernel.org
Subject: Re: [PATCH 2/4] PCI: pciehp: bail out if pci_hp_add_bridge() fails
Date: Sat, 4 May 2024 11:35:29 +0200	[thread overview]
Message-ID: <20240504093529.p8pbGxuK@linutronix.de> (raw)
In-Reply-To: <ZjX3t1NerOlGBhzw@wunner.de>

On Sat, May 04, 2024 at 10:54:15AM +0200, Lukas Wunner wrote:
> On Fri, May 03, 2024 at 09:23:20PM +0200, Nam Cao wrote:
> > If there is no bus number available for the downstream bus of the
> > hot-plugged bridge, pci_hp_add_bridge() will fail. The driver proceeds
> > regardless, and the kernel crashes.
> > 
> > Abort if pci_hp_add_bridge() fails.
> [...]
> > --- a/drivers/pci/hotplug/pciehp_pci.c
> > +++ b/drivers/pci/hotplug/pciehp_pci.c
> > @@ -58,8 +58,13 @@ int pciehp_configure_device(struct controller *ctrl)
> >  		goto out;
> >  	}
> >  
> > -	for_each_pci_bridge(dev, parent)
> > -		pci_hp_add_bridge(dev);
> > +	for_each_pci_bridge(dev, parent) {
> > +		if (pci_hp_add_bridge(dev)) {
> > +			pci_stop_and_remove_bus_device(dev);
> > +			ret = -EINVAL;
> > +			goto out;
> > +		}
> > +	}
> 
> Is the pci_stop_and_remove_bus_device() really necessary here?
> Why not just leave the bridge as is, without any child devices?

pci_stop_and_remove_bus_device() is not necessary to prevent kernel
crashing. But without this, we cannot hot-plug any other devices to this
slot afterward, despite the bridge has already been removed. Below is what
happens without pci_stop_and_remove_bus_device().

First, we hotplug a bridge. That fails, so QEMU removes this bridge:
(qemu) device_add pci-bridge,id=br2,bus=br1,chassis_nr=19,addr=1
[    9.289609] shpchp 0000:01:00.0: Latch close on Slot(1-1)
[    9.291145] shpchp 0000:01:00.0: Button pressed on Slot(1-1)
[    9.292705] shpchp 0000:01:00.0: Card present on Slot(1-1)
[    9.294369] shpchp 0000:01:00.0: PCI slot #1-1 - powering on due to button press
[   15.529997] pci 0000:02:01.0: [1b36:0001] type 01 class 0x060400 conventional PCI bridge
[   15.533907] pci 0000:02:01.0: BAR 0 [mem 0x00000000-0x000000ff 64bit]
[   15.535802] pci 0000:02:01.0: PCI bridge to [bus 00]
[   15.538519] pci 0000:02:01.0:   bridge window [io  0x0000-0x0fff]
[   15.540261] pci 0000:02:01.0:   bridge window [mem 0x00000000-0x000fffff]
[   15.543486] pci 0000:02:01.0:   bridge window [mem 0x00000000-0x000fffff 64bit pref]
[   15.547151] pci 0000:02:01.0: No bus number available for hot-added bridge
[   15.549067] shpchp 0000:01:00.0: Cannot add device at 0000:02:01
[   15.553104] shpchp 0000:01:00.0: Latch open on Slot(1-1)
[   15.555246] shpchp 0000:01:00.0: Card not present on Slot(1-1)

Then, hot-plug an ethernet device. But the kernel still incorrectly
thought the bridge is still there, and refuses this new ethernet device:
(qemu) device_add e1000,bus=br1,addr=1
[   58.163529] shpchp 0000:01:00.0: Latch close on Slot(1-1)
[   58.165076] shpchp 0000:01:00.0: Button pressed on Slot(1-1)
[   58.166650] shpchp 0000:01:00.0: Card present on Slot(1-1)
[   58.168287] shpchp 0000:01:00.0: PCI slot #1-1 - powering on due to button press
[   64.677492] shpchp 0000:01:00.0: Device 0000:02:01.0 already exists at 0000:02:01, cannot hot-add
[   64.680007] shpchp 0000:01:00.0: Cannot add device at 0000:02:01
[   64.682802] shpchp 0000:01:00.0: Latch open on Slot(1-1)
[   64.684353] shpchp 0000:01:00.0: Card not present on Slot(1-1)

Best regards,
Nam

  reply	other threads:[~2024-05-04  9:35 UTC|newest]

Thread overview: 14+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2024-05-03 19:23 [PATCH 0/4] check returned value of pci_hp_add_bridge() Nam Cao
2024-05-03 19:23 ` [PATCH 1/4] PCI: shpchp: bail out if pci_hp_add_bridge() fails Nam Cao
2024-05-03 19:23 ` [PATCH 2/4] PCI: pciehp: " Nam Cao
2024-05-03 21:23   ` Bjorn Helgaas
2024-05-03 21:41     ` Nam Cao
2024-05-04  8:54   ` Lukas Wunner
2024-05-04  9:35     ` Nam Cao [this message]
2024-05-04  9:51       ` Lukas Wunner
2024-05-04 10:56         ` Nam Cao
2024-05-04 15:02           ` Lukas Wunner
2024-05-04 15:48             ` Nam Cao
2024-05-03 19:23 ` [PATCH 3/4] PCI: hotplug: document unchecked return value of pci_hp_add_bridge() Nam Cao
2024-05-03 19:23 ` [PATCH 4/4] PCI: hotplug: remove TODO notes for sgi_hotplug Nam Cao
2024-05-03 21:29 ` [PATCH 0/4] check returned value of pci_hp_add_bridge() Bjorn Helgaas

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=20240504093529.p8pbGxuK@linutronix.de \
    --to=namcao@linutronix.de \
    --cc=bhelgaas@google.com \
    --cc=gregkh@linuxfoundation.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=linux-pci@vger.kernel.org \
    --cc=lukas@wunner.de \
    --cc=rajesh.shah@intel.com \
    --cc=stable@vger.kernel.org \
    --cc=yinghai@kernel.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).