[Top][All Lists]
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-gnubg] Some idle musings re. ratings
From: |
Joern Thyssen |
Subject: |
Re: [Bug-gnubg] Some idle musings re. ratings |
Date: |
Thu, 11 Sep 2003 09:23:21 +0000 |
User-agent: |
Mutt/1.4.1i |
On Thu, Sep 11, 2003 at 01:38:01AM +0200, Jim Segrave wrote
>
> This is not a realistic model in that no-one ever plays with a
> consistent MWC, but I was wondering if we could use the luck adjusted
> result as an indicator of MWC to model FIBS ratings and to compare them
> to the current function which Kees van Doel generated.
I think that is exactly what what Kees did.
He calculated relative rating estimates of gnubg 0-ply versus gnubg
0-ply with noise and found a relationship between error rates and the
relative rating estimates.
Rethinking, I would have prefered to relate the error rates and MWCs,
and then apply the FIBS rating formulae afterwards.
One of the issues raised by Douglas is if the experiment of 0-ply versus
0-ply noise really represents a good model of human play.
Also, what relationship do we expect between error rates and lost MWC?
My guess would be linear for some fixed match length, but it depends on
if you make the errors in the beginning or towards the end of a match.
Using 0-ply with noise will roughly give constant error rates throughout
the match, so I'd expect to see a linear relationship between error
rates and lost MWC for a fixed match length. I'm not sure if this holds
for human play -- I guess you would need to investigate a large number
of human matches.
The FIBs rating formula is:
D = -2000/sqrt(n) * log10( 1/p - 1 )
For values of p around 0.5 the rating difference D is close to linear in
p, so we would get a linear relationship between D and error rates --
exactly that Kees found.
Kees uses a a/N+b extrapolation formulae for the match length. I would
guess from the FIBS rating formulae that a/sqrt(N)+b would be better.
Jørn
pgpk_KHIhVed1.pgp
Description: PGP signature
- [Bug-gnubg] Some idle musings re. ratings, Jim Segrave, 2003/09/10
- Re: [Bug-gnubg] Some idle musings re. ratings,
Joern Thyssen <=
- Re: [Bug-gnubg] Some idle musings re. ratings, kvandoel, 2003/09/11
- RE: [Bug-gnubg] Some idle musings re. ratings, David Montgomery, 2003/09/11
- Re: [Bug-gnubg] Some idle musings re. ratings, Jim Segrave, 2003/09/11
- Re: [Bug-gnubg] Some idle musings re. ratings, Joseph Heled, 2003/09/11
- Re: [Bug-gnubg] Some idle musings re. ratings, kvandoel, 2003/09/11
- Re: [Bug-gnubg] Some idle musings re. ratings, kvandoel, 2003/09/11
- Re: [Bug-gnubg] Some idle musings re. ratings, Joern Thyssen, 2003/09/12
- Re: [Bug-gnubg] Some idle musings re. ratings, kvandoel, 2003/09/12
- Re: [Bug-gnubg] Some idle musings re. ratings, Nardy Pillads, 2003/09/15