qemu-devel
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 00/20] tcg: vector improvements


From: Richard Henderson
Subject: Re: [PATCH 00/20] tcg: vector improvements
Date: Sat, 29 Jan 2022 20:28:21 +1100
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.5.0

Ping?

Patch 1 is now upstream, but only patches 2-4 have reviews.
It applies cleanly to master...


r~

On 12/19/21 06:42, Richard Henderson wrote:
Add some opcodes for compound logic operations that were so
far marked as TODO.  Implement those for PPC and S390X.

We do not want to implement 512-bit width operations, because
those trigger a cluster clock slowdown on the current set of
Intel cpus.  But there are new operations in avx512 that apply
to 128 and 256-bit vectors, which do not trigger the slowdown,
and those are very interesting.


r~


Richard Henderson (20):
   tcg/optimize: Fix folding of vector ops
   tcg: Add opcodes for vector nand, nor, eqv
   tcg/ppc: Implement vector NAND, NOR, EQV
   tcg/s390x: Implement vector NAND, NOR, EQV
   tcg/i386: Detect AVX512
   tcg/i386: Add tcg_out_evex_opc
   tcg/i386: Use tcg_can_emit_vec_op in expand_vec_cmp_noinv
   tcg/i386: Implement avx512 variable shifts
   tcg/i386: Implement avx512 scalar shift
   tcg/i386: Implement avx512 immediate sari shift
   tcg/i386: Implement avx512 immediate rotate
   tcg/i386: Implement avx512 variable rotate
   tcg/i386: Support avx512vbmi2 vector shift-double instructions
   tcg/i386: Expand vector word rotate as avx512vbmi2 shift-double
   tcg/i386: Remove rotls_vec from tcg_target_op_def
   tcg/i386: Expand scalar rotate with avx512 insns
   tcg/i386: Implement avx512 min/max/abs
   tcg/i386: Implement avx512 multiply
   tcg/i386: Implement more logical operations for avx512
   tcg/i386: Implement bitsel for avx512

  include/qemu/cpuid.h          |  20 +-
  include/tcg/tcg-opc.h         |   3 +
  include/tcg/tcg.h             |   3 +
  tcg/aarch64/tcg-target.h      |   3 +
  tcg/arm/tcg-target.h          |   3 +
  tcg/i386/tcg-target-con-set.h |   1 +
  tcg/i386/tcg-target.h         |  17 +-
  tcg/i386/tcg-target.opc.h     |   3 +
  tcg/ppc/tcg-target.h          |   3 +
  tcg/s390x/tcg-target.h        |   3 +
  tcg/optimize.c                |  61 ++++--
  tcg/tcg-op-vec.c              |  27 ++-
  tcg/tcg.c                     |   6 +
  tcg/i386/tcg-target.c.inc     | 386 ++++++++++++++++++++++++++++------
  tcg/ppc/tcg-target.c.inc      |  15 ++
  tcg/s390x/tcg-target.c.inc    |  17 ++
  16 files changed, 472 insertions(+), 99 deletions(-)





reply via email to

[Prev in Thread] Current Thread [Next in Thread]