|
From: | Elijah Stone |
Subject: | Re: [Tinycc-devel] Optimizing for avx512 |
Date: | Sun, 6 Feb 2022 02:16:23 -0800 (PST) |
On Sun, 6 Feb 2022, Samir Ribić via Tinycc-devel wrote:
A bit faster than VMOVUPS is VMOVAPS, but the numbers must be at addresses divisible by 64.
They are the same speed. VMOVUPS will go slowly if you give it an unaligned address, while VMOVAPS will fault; but they do exactly the same thing given an aligned address. That said, if you do intend to use aligned addresses for performance, you should probably use the 'aligned' instruction variants where you can, to check your work.
-E
[Prev in Thread] | Current Thread | [Next in Thread] |