On Thu, Sep 3, 2020 at 7:28 AM Al Viro wrote: > > I can live with this series; do you want that in vfs.git#for-next? Well, it's apparently there now (at least it's in your base.set_fs branch, I didn't check actual -next). So this is just a heads-up that I plan to merge the "asm goto" changes on top of this during 5.10. Nick did the patch to make my patch-set work either with or without the actual asm goto support, and I've been running it privately now for several months. And apparently there are people working on this on the gcc side too, so it won't just be clang-specific. Nor kernel-specific in that Nick tells me some other projects are looking at using that asm goto with outputs too. Anyway, the actual patch to use asm goto with outputs is fairly small and not that interesting to people (since no released compiler supports it), but part of the infrastructure to make it tiny is to just get rid of the inlined "__get_user()" and "__put_user()" stuff. I've ranted against those functions for a few years by now, so part of this is to stop inlining them and make people think they are "good", but part of it is also that those macros and inline functions are the main remaining ones that mess with this all. I'm attaching the two __get_user/__put_user patches here in case anybody cares, but these are the pre-rebased ones, I'll make them work with the new world order as it happens. The main change is: (a) unify around a common special calling convention: - %bx is clobbered - %cx contains the user address on input, and the error value on output - %ax/%dx contains the actual value (input for put, output for get, of course) (b) unify around using just a "call", using the model that get/put_user already did. - use "*_nocheck" for the double-underscore versions - this still has to use inline asm because the calling convention is odd - otherwise basically just a "call __{get,put}_user_[nocheck_]X" where X is the size. IOW, we unify around one single calling convention., and one single model for actually getting things done. I still want to remove the double-underscore versions entirely some day - they have absolutely zero advantages compared to the full "do address_ok as part of the operation" - but that's a separate thing. At least they can be unified. And the reason for this all is obviously that now *only* the "unsafe_{get,put}_user()" cases with the error label output are the "fast inlined" cases. They are the only ones that _can_ be done quickly inline, since the slow clac/stac is not part of them. Plus they already have that unified usage model of the error label, even if unsafe_get_user() currently does it manually because "asm goto" with outputs doesn't work in existing compilers. Comments? I suspect people won't care, but I thought I'd post these so that there won't be any surprises during the next merge window when I apply them after merging the set_fs() removal branch.. Linus