Re: [Tinycc-devel] vectorize the curent hash implementation

tinycc-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Tinycc-devel] vectorize the curent hash implementation

From:	Michael Matz
Subject:	Re: [Tinycc-devel] vectorize the curent hash implementation
Date:	Sat, 23 Apr 2016 20:45:37 +0200 (CEST)
User-agent:	Alpine 2.20 (LSU 67 2015-01-07)

Hi,

On Sat, 23 Apr 2016, Vladimir Vissoultchev wrote:

Rolling hashes have a nice property -- associativity -- i.e. hash(concat(a,
b)) = [hash(a) * TOK_HASH_PRIME^len(b) + hash(b)] in GF(TOK_HASH_SIZE)

What I tried based on this was to pre-hash all of file->buffer and calc hash
on current (pos, len) with no loop like this:

   h = (ph[len] - (ph[0] - TOK_HASH_INIT) * hash_prime_powers[len]) &
(TOK_HASH_SIZE - 1)

... where ph is pointing to current pos in pre-hash buffer.
Unfortunately complete file->buffer pre-hashing is slow, I'm currentlyworking on impl pre-hash part in SSE2 but it looks like it's going to betoo slow again as there is no performant way to calculate allintermediate hashes.

The hashing of identifiers (when tcc is compiled with GCC on -O0) innext_nomacro1 itself takes only 2.3 % of the overall cycle estimate(valgrind cachegrind measurement of compiling tcc with tcc itself). Thestepping forward of the characters itself takes more than the hashing(namely 3.3%). So any optimization of the hashing needs to be _extremely_efficient to lead to any measurable improvement at all. I.e. this loop:


3.3%        while (c = *++p, isidnum_table[c - CH_EOF] & (IS_ID|IS_NUM))
2.3%            h = TOK_HASH_FUNC(h, c);

Even optimizing this loop to run twice as fast (which would be quite anachievement) will make tcc run only 2.8% faster (nothing to sneeze at ofcourse, I just wanted to mention that it's not easy to improve the speedby an fantastic amount).



Ciao,
Michael.

[Prev in Thread]

Current Thread

[Next in Thread]

[Tinycc-devel] vectorize the curent hash implementation, Sergey Korshunoff, 2016/04/23
- Re: [Tinycc-devel] vectorize the curent hash implementation, Vladimir Vissoultchev, 2016/04/23
  - Re: [Tinycc-devel] vectorize the curent hash implementation, KHMan, 2016/04/23
    - Re: [Tinycc-devel] vectorize the curent hash implementation, Vladimir Vissoultchev, 2016/04/23
    - Re: [Tinycc-devel] vectorize the curent hash implementation, KHMan, 2016/04/23
    - Re: [Tinycc-devel] vectorize the curent hash implementation, Michael Matz <=
    - Re: [Tinycc-devel] vectorize the curent hash implementation, Michael B. Smith, 2016/04/25

Prev by Date: Re: [Tinycc-devel] Development style
Next by Date: Re: [Tinycc-devel] First kcachegrind output
Previous by thread: Re: [Tinycc-devel] vectorize the curent hash implementation
Next by thread: Re: [Tinycc-devel] vectorize the curent hash implementation
Index(es):
- Date
- Thread