Re: Semaphore assembly-code bug

* Re: Semaphore assembly-code bug
       [not found]                         ` <Pine.LNX.4.61.0410291631250.8616@twinlark.arctic.org.suse.lists.linux.kernel>
@ 2004-10-30  2:04                           ` Andi Kleen
  0 siblings, 0 replies; 99+ messages in thread
From: Andi Kleen @ 2004-10-30  2:04 UTC (permalink / raw)
  To: dean gaudet
  Cc: linux-os, Andreas Steinmetz, Richard Henderson, Andi Kleen,
	Andrew Morton, Jan Hubicka, linux-kernel, torvalds

dean gaudet <dean-list-linux-kernel@arctic.org> writes:
> 
> it's worse than that in general -- lea typically goes through the AGU 
> which has either less throughput or longer latency than the ALUs... 
> depending on which x86en.  it's 4 cycles for a lea on p4, vs. 1 for a pop.  
> it's 2 cycles for a lea on k8 vs. 1 for a pop.

On D stepping and later K8 the lea is 1 cycle latency because the
decoder optimizes the lea into an add.

-Andi

^ permalink raw reply	[flat|nested] 99+ messages in thread