On Wed, Aug 18, 2021 at 03:22:19PM +0200, Greg Kroah-Hartman wrote: > On Wed, Aug 18, 2021 at 12:57:36PM +0100, Mark Brown wrote: > > The issue is that the driver core is using drivers completing probe as a > > proxy for resources becoming available. That works most of the time > > because most probes are fully synchronous but it breaks down if a > > resource provider registers resources outside of probe, we might still > > be fine if system boot is still happening and something else probes but > > only through luck. > The driver core is not using that as a proxy, that is up to the driver > itself or not. All probe means is "yes, this driver binds to this > device, thank you!" for that specific bus/class type. That's all, if > the driver needs to go off and do real work before it can properly > control the device, wonderful, have it go and do that async. Right, which is what is happening here - but the deferred probe machinery in the core is reading more into the probe succeeding than it should. > So if you know you should be binding to the device, great, kick off some > other work and return success from probe. There's no reason you have to > delay or defer for no good reason, right? The driver that's deferring isn't the one that takes a long time to probe - the driver that's deferring depends on the driver that takes a long time to probe, it defers because the resource it needs isn't available when it tries to probe as the slow device is still doing it's thing asynchronously. The problem is that the driver core isn't going back and attempting to probe the deferred device again once the driver that took a long time has provided resources. > But yes, if you do get new resources, the probe should be called again, > that's what the deferred logic is for (or is that the link logic, I > can't recall) This shouldn't be a new thing, no needing to call the > driver core directly like this at all, it should "just happen", right? How specifically does new resources becoming available directly cause a new probe deferral run at the moment? I can't see anything that resource provider APIs are doing to say that a new resource has become available, this patch is trying to provide something they can do.