[Qemu-devel] [PATCH v6 00/50] tcg tb_lock removal

* [Qemu-devel] [PATCH v6 00/50] tcg tb_lock removal
@ 2017-10-16 17:25 Richard Henderson
  2017-10-16 17:25 ` [Qemu-devel] [PATCH v6 01/50] tcg: Merge opcode arguments into TCGOp Richard Henderson
                   ` (52 more replies)
  0 siblings, 53 replies; 94+ messages in thread
From: Richard Henderson @ 2017-10-16 17:25 UTC (permalink / raw)
  To: qemu-devel; +Cc: cota

I've fixed two bugs within v5 of Emilio's patch set:

 - The step_next_tb patch fixes the "rep movsb" bug that appeared
   when we included CF_COUNT_MASK into CF_HASH_MASK.  We had been
   relying on magic to single-step the next guest insn.

 - The original "allocate optimizer temps with tcg_malloc" patch
   failed testing on arm32 host.  I didn't really look into exactly
   what was wrong because I had an older patch set that touched the
   same portion of the optimizer.

   Thus, an extra 20 patches in the patch set rearranging how temps
   are referenced within the TCG backend.  Some of them have appeared
   on the list before, but it would have been last November.

r~

Emilio G. Cota (24):
  tcg: define CF_PARALLEL and use it for TB hashing along with
    CF_COUNT_MASK
  tcg: convert tb->cflags reads to tb_cflags(tb)
  target/arm: check CF_PARALLEL instead of parallel_cpus
  target/hppa: check CF_PARALLEL instead of parallel_cpus
  target/i386: check CF_PARALLEL instead of parallel_cpus
  target/m68k: check CF_PARALLEL instead of parallel_cpus
  target/s390x: check CF_PARALLEL instead of parallel_cpus
  target/sh4: check CF_PARALLEL instead of parallel_cpus
  target/sparc: check CF_PARALLEL instead of parallel_cpus
  tcg: check CF_PARALLEL instead of parallel_cpus
  cpu-exec: lookup/generate TB outside exclusive region during
    step_atomic
  translate-all: use a binary search tree to track TBs in TBContext
  exec-all: rename tb_free to tb_remove
  translate-all: report correct avg host TB size
  tcg: take tb_ctx out of TCGContext
  tcg: define tcg_init_ctx and make tcg_ctx a pointer
  gen-icount: fold exitreq_label into TCGContext
  tcg: introduce **tcg_ctxs to keep track of all TCGContext's
  tcg: distribute profiling counters across TCGContext's
  tcg: allocate optimizer temps with tcg_malloc
  osdep: introduce qemu_mprotect_rwx/none
  translate-all: use qemu_protect_rwx/none helpers
  tcg: introduce regions to split code_gen_buffer
  tcg: enable multiple TCG contexts in softmmu

Richard Henderson (26):
  tcg: Merge opcode arguments into TCGOp
  tcg: Propagate args to op->args in optimizer
  tcg: Propagate args to op->args in tcg.c
  tcg: Propagate TCGOp down to allocators
  tcg: Introduce arg_temp
  tcg: Add temp_global bit to TCGTemp
  tcg: Return NULL temp for TCG_CALL_DUMMY_ARG
  tcg: Introduce temp_arg
  tcg: Use per-temp state data in liveness
  tcg: Avoid loops against variable bounds
  tcg: Change temp_allocate_frame arg to TCGTemp
  tcg: Remove unused TCG_CALL_DUMMY_TCGV
  tcg: Export temp_idx
  tcg: Use per-temp state data in optimize
  tcg: Push tcg_ctx into generator functions
  tcg: Push tcg_ctx into tcg_gen_callN
  tcg: Introduce index_arg
  tcg: Reserve temporary index 0
  target/alpha: Avoid translate_init unless tcg_enabled
  qom: Introduce CPUClass.tcg_initialize
  tcg: Use pointers in TCGOp->args
  hack dump tb->flags and tb->cflags
  tcg: Add CPUState step_next_tb
  tcg: Include CF_COUNT_MASK in CF_HASH_MASK
  tcg: Add CF_LAST_IO + CF_USE_ICOUNT to CF_HASH_MASK
  tcg: Remove CF_IGNORE_ICOUNT

 include/exec/exec-all.h       |   41 +-
 include/exec/gen-icount.h     |   25 +-
 include/exec/helper-gen.h     |   12 +-
 include/exec/tb-context.h     |    6 +-
 include/exec/tb-hash-xx.h     |    9 +-
 include/exec/tb-hash.h        |    4 +-
 include/exec/tb-lookup.h      |    6 +-
 include/qemu/osdep.h          |    2 +
 include/qom/cpu.h             |    9 +-
 target/arm/helper-a64.h       |    4 +
 target/hppa/helper.h          |    2 +
 target/m68k/helper.h          |    1 +
 target/s390x/helper.h         |    4 +
 target/sparc/cpu.h            |    2 +-
 tcg/tcg-op.h                  |  132 +++---
 tcg/tcg.h                     |  184 +++++---
 accel/tcg/cpu-exec.c          |  102 ++--
 accel/tcg/tcg-runtime.c       |    4 +-
 accel/tcg/translate-all.c     |  511 ++++++++++----------
 accel/tcg/translator.c        |    4 +-
 bsd-user/main.c               |    3 +-
 cpus.c                        |   14 +
 exec.c                        |   13 +-
 linux-user/main.c             |    9 +-
 linux-user/syscall.c          |    1 +
 target/alpha/cpu.c            |    3 +-
 target/alpha/translate.c      |   12 +-
 target/arm/cpu.c              |    6 +-
 target/arm/helper-a64.c       |   38 +-
 target/arm/op_helper.c        |    7 -
 target/arm/translate-a64.c    |   38 +-
 target/arm/translate.c        |   17 +-
 target/cris/cpu.c             |   16 +-
 target/cris/translate.c       |    8 +-
 target/cris/translate_v10.c   |    2 +-
 target/hppa/cpu.c             |    3 +-
 target/hppa/op_helper.c       |   32 +-
 target/hppa/translate.c       |   22 +-
 target/i386/cpu.c             |    5 +-
 target/i386/translate.c       |   60 ++-
 target/lm32/cpu.c             |    7 +-
 target/lm32/translate.c       |   16 +-
 target/m68k/cpu.c             |    7 +-
 target/m68k/op_helper.c       |   33 +-
 target/m68k/translate.c       |   20 +-
 target/microblaze/cpu.c       |    7 +-
 target/microblaze/translate.c |    8 +-
 target/mips/cpu.c             |    5 +-
 target/mips/translate.c       |   35 +-
 target/moxie/cpu.c            |    7 +-
 target/moxie/translate.c      |   10 +-
 target/nios2/cpu.c            |    7 +-
 target/nios2/translate.c      |    6 +-
 target/openrisc/cpu.c         |    7 +-
 target/openrisc/translate.c   |    8 +-
 target/ppc/translate.c        |   14 +-
 target/ppc/translate_init.c   |   37 +-
 target/s390x/cpu.c            |    7 +-
 target/s390x/mem_helper.c     |   80 +++-
 target/s390x/translate.c      |   36 +-
 target/sh4/cpu.c              |    5 +-
 target/sh4/translate.c        |   17 +-
 target/sparc/cpu.c            |    5 +-
 target/sparc/translate.c      |   19 +-
 target/tilegx/cpu.c           |    7 +-
 target/tilegx/translate.c     |    4 +-
 target/tricore/cpu.c          |    5 +-
 target/tricore/translate.c    |    9 +-
 target/unicore32/cpu.c        |    7 +-
 target/unicore32/translate.c  |    8 +-
 target/xtensa/cpu.c           |    7 +-
 target/xtensa/translate.c     |   30 +-
 tcg/optimize.c                |  659 ++++++++++++++------------
 tcg/tcg-op.c                  |  186 ++++----
 tcg/tcg.c                     | 1030 ++++++++++++++++++++++++++++-------------
 tests/qht-bench.c             |    2 +-
 util/osdep.c                  |   41 ++
 77 files changed, 2194 insertions(+), 1577 deletions(-)

-- 
2.13.6

^ permalink raw reply	[flat|nested] 94+ messages in thread