|
From: | Dmitri A. Sergatskov |
Subject: | Re: Octave 64-bit indexing built with ATLAS |
Date: | Mon, 11 Sep 2017 17:51:20 -0500 |
octave_mflops.m:Now the funny part is the following, which confuses me a little bit. For the OpenBLAS build and the ATLAS build, I recieve the following values:
N = 400;
for i = 1:N
n = 4096;
x = rand(n, n);
tic, x = x .* x;
y = toc;
mflops(i) = n*n / y / 1e6;
end
mflops_mean = sum(mflops)/N;
mflops_sig = std(mflops);
printf('MFLOPS: (%.2f +- %.2f)\n', mflops_mean, mflops_sig);
MFLOPS: (370.47 +- 7.60) (OpenBLAS)I expected ATLAS to be faster than OpenBLAS. Also: when monitoring the load with 'htop', only one CPU is fully loaded. I expected ATLAS to have parallel threading, which I tried to ensure by using libtatlas_Oct64.so. Am I expecting something wrong? And why, can someone explain to me what I did wrong?
MFLOPS: (370.43 +- 7.27) (ATLAS)
[Prev in Thread] | Current Thread | [Next in Thread] |