[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
bug#30020: floating point unboxing regression in 2.2.3
From: |
Mark H Weaver |
Subject: |
bug#30020: floating point unboxing regression in 2.2.3 |
Date: |
Mon, 28 May 2018 02:21:37 -0400 |
User-agent: |
Gnus/5.13 (Gnus v5.13) Emacs/25.3 (gnu/linux) |
Hi David,
Sorry for the long delay.
"Thompson, David" <address@hidden> writes:
> Guile 2.2.3 seems to have lost some of the abilities that Guile 2.2.2
> had wrt unboxing floats. Here's a simple procedure to show the
> problem. It simply adds the first two elements of an f32vector:
>
> (define (add-two-floats bv)
> (+ (f32vector-ref bv 0) (f32vector-ref bv 1)))
Thanks for the tiny test case. This helps a lot!
> Here's the disassembly from 2.2.2 (note that f64->scm appears only once):
>
> Disassembly of #<procedure add-two-floats (bv)> at #x7efef4006230:
>
> 0 (assert-nargs-ee/locals 2 1) ;; 3 slots (1 arg) at
> (unknown file):22:0
> 1 (load-u64 2 0 0) at
> (unknown file):23:26
> 4 (bv-f32-ref 2 1 2)
> 5 (load-u64 0 0 4) at
> (unknown file):23:47
> 8 (bv-f32-ref 1 1 0)
> 9 (fadd 2 2 1) at
> (unknown file):23:23
> 10 (f64->scm 1 2)
> 11 (handle-interrupts)
> 12 (return-values 2) ;; 1 value
>
> And here is 2.2.3:
>
> Disassembly of #<procedure add-two-floats (bv)> at #x2457140:
>
> 0 (assert-nargs-ee/locals 2 1) ;; 3 slots (1 arg) at
> (unknown file):29:0
> 1 (load-u64 2 0 0) at
> (unknown file):30:26
> 4 (bv-f32-ref 2 1 2)
> 5 (f64->scm 2 2)
> 6 (load-u64 0 0 4) at
> (unknown file):30:47
> 9 (bv-f32-ref 1 1 0)
> 10 (f64->scm 1 1)
> 11 (scm->f64 2 2) at
> (unknown file):30:23
> 12 (scm->f64 1 1)
> 13 (fadd 2 2 1)
> 14 (f64->scm 1 2)
> 15 (handle-interrupts)
> 16 (return-values 2) ;; 1 value
>
> 2.2.3 is reading unboxed floats from the bytevector, boxing them,
> unboxing them, then calling fadd. The result is that some formerly
> well optimized code of mine that generated little to no garbage is now
> generating lots of garbage.
I did the moral equivalent of a git bisect, and found the culprit:
commit d4883307ca64a7028b9a6cd072974437306c19d3
Author: Andy Wingo <address@hidden>
Date: Thu Nov 30 10:41:45 2017 +0100
Minor CSE run-time optimization
* module/language/cps/cse.scm (compute-equivalent-subexpressions): Minor
optimization to reduce the size of equivalent expression keys, and to
avoid some work if an expression has no key.
Fortunately, this commit can be reverted in isolation without any
difficulty, and apparently without any negative consequences. If a
better solution isn't found soon, perhaps that's what we should do.
Andy, do you have any idea what's going on here?
Thanks,
Mark