From mboxrd@z Thu Jan 1 00:00:00 1970 From: Richard Palethorpe Date: Wed, 20 Jan 2021 10:00:14 +0000 Subject: [LTP] [PATCH 1/1] fzsync: Add sched_yield for single core machine In-Reply-To: <20210120070053.11490-1-ycliang@andestech.com> References: <20210120070053.11490-1-ycliang@andestech.com> Message-ID: <87sg6w9bdd.fsf@suse.de> List-Id: MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit To: ltp@lists.linux.it Hello Leo, Leo Yu-Chi Liang writes: > Fuzzy sync library uses spin waiting mechanism > to implement thread barrier behavior, which would > cause this test to be time-consuming on single core machine. > > Fix this by adding sched_yield in the spin waiting loop, > so that the thread yields cpu as soon as it enters the waiting loop. Thanks for sending this in. Comments below. > > Signed-off-by: Leo Yu-Chi Liang > --- > include/tst_fuzzy_sync.h | 6 ++++++ > 1 file changed, 6 insertions(+) > > diff --git a/include/tst_fuzzy_sync.h b/include/tst_fuzzy_sync.h > index 4141f5c64..64d172681 100644 > --- a/include/tst_fuzzy_sync.h > +++ b/include/tst_fuzzy_sync.h > @@ -59,9 +59,11 @@ > * @sa tst_fzsync_pair > */ > > +#include > #include > #include > #include > +#include > #include > #include > #include "tst_atomic.h" > @@ -564,6 +566,8 @@ static inline void tst_fzsync_pair_wait(int *our_cntr, > && tst_atomic_load(our_cntr) < INT_MAX) { > if (spins) > (*spins)++; > + if(get_nprocs() == 1) We should use tst_ncpus() and then cache the value so we are not making a function call within the loop. It is probably best to avoid calling this function inside tst_fzsync_pair_wait, it may even result in a system call. We should probably cache the value in tst_fzsync_pair, maybe as a boolean e.g. "yield_in_wait". This can be set/checked in the tst_fzsync_pair_init function. Also this will allow the user to handle CPUs being offlined if the test itself can cause that. > + sched_yield(); > } > > tst_atomic_store(0, other_cntr); > @@ -581,6 +585,8 @@ static inline void tst_fzsync_pair_wait(int *our_cntr, > while (tst_atomic_load(our_cntr) < tst_atomic_load(other_cntr)) { > if (spins) > (*spins)++; > + if(get_nprocs() == 1) > + sched_yield(); > } > } > } Everyone please note that we will have to test this extensively to ensure it does break existing reproducers. Alternatively to this approach we could create seperate implementations of pair_wait and use a function pointer. I am thinking it may be best to do it both ways and perform some measurements. -- Thank you, Richard.