Re: [PATCH v2] target/arm/arch

qemu-arm

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v2] target/arm/arch_dump: Add SVE notes

From:	Richard Henderson
Subject:	Re: [PATCH v2] target/arm/arch_dump: Add SVE notes
Date:	Wed, 9 Oct 2019 20:39:21 -0400
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0

On 10/4/19 8:03 AM, Andrew Jones wrote:
> +#ifdef TARGET_AARCH64
> +static off_t sve_zreg_offset(uint32_t vq, int n)
> +{
> +    off_t off = sizeof(struct aarch64_user_sve_header);
> +    return ROUND_UP(off, 16) + vq * 16 * n;
> +}
> +static off_t sve_preg_offset(uint32_t vq, int n)
> +{
> +    return sve_zreg_offset(vq, 32) + vq * 16 / 8 * n;
> +}
> +static off_t sve_fpsr_offset(uint32_t vq)
> +{
> +    off_t off = sve_preg_offset(vq, 17) + offsetof(struct aarch64_note, sve);
> +    return ROUND_UP(off, 16) - offsetof(struct aarch64_note, sve);
> +}
> +static off_t sve_fpcr_offset(uint32_t vq)
> +{
> +    return sve_fpsr_offset(vq) + sizeof(uint32_t);
> +}
> +static uint32_t sve_current_vq(CPUARMState *env)
> +{
> +    return sve_zcr_len_for_el(env, arm_current_el(env)) + 1;
> +}
> +static size_t sve_size_vq(uint32_t vq)
> +{
> +    off_t off = sve_fpcr_offset(vq) + sizeof(uint32_t) +
> +                offsetof(struct aarch64_note, sve);
> +    return ROUND_UP(off, 16) - offsetof(struct aarch64_note, sve);
> +}
> +static size_t sve_size(CPUARMState *env)
> +{
> +    return sve_size_vq(sve_current_vq(env));
> +}

Watch the missing spaces between functions.

> +    for (i = 0; i < 32; ++i) {
> +#ifdef HOST_WORDS_BIGENDIAN
> +        uint64_t d[vq * 2];
> +        int j;
> +
> +        for (j = 0; j < vq * 2; ++j) {
> +            d[j] = bswap64(env->vfp.zregs[i].d[j]);
> +        }
> +#else
> +        uint64_t *d = &env->vfp.zregs[i].d[0];
> +#endif
> +        memcpy(&buf[sve_zreg_offset(vq, i)], &d[0], vq * 16);
> +    }

We should avoid the variable sized array here.

It might be best to avoid the ifdef altogether:

    for (i = 0; i < 32; ++i) {
        uint64_t *d = (uint64_t *)&buf[sve_zreg_offset(vq, i)];
        for (j = 0; j < vq * 2; ++j) {
            d[j] = cpu_to_le64(env->vfp.zregs[i].d[j]);
        }
    }

The compiler may well transform the inner loop to memcpy for little-endian
host, but even if it doesn't core dumping is hardly performance sensitive.



r~

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH v2] target/arm/arch_dump: Add SVE notes, Andrew Jones, 2019/10/04
- Re: [PATCH v2] target/arm/arch_dump: Add SVE notes, no-reply, 2019/10/04
- Re: [PATCH v2] target/arm/arch_dump: Add SVE notes, Richard Henderson <=
  - Re: [PATCH v2] target/arm/arch_dump: Add SVE notes, Andrew Jones, 2019/10/10
    - Re: [PATCH v2] target/arm/arch_dump: Add SVE notes, Richard Henderson, 2019/10/10
    - Re: [PATCH v2] target/arm/arch_dump: Add SVE notes, Andrew Jones, 2019/10/16
    - Re: [PATCH v2] target/arm/arch_dump: Add SVE notes, Richard Henderson, 2019/10/16

Prev by Date: Re: [PATCH v2 0/8] hw: Convert various reset() handler to DeviceReset
Next by Date: Re: [RFC PATCH 2/5] timer: arm: Introduce functions to get the host cntfrq
Previous by thread: Re: [PATCH v2] target/arm/arch_dump: Add SVE notes
Next by thread: Re: [PATCH v2] target/arm/arch_dump: Add SVE notes
Index(es):
- Date
- Thread