Re: [Bug-gnubg] Re: Strange FIBS ratings

bug-gnubg

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-gnubg] Re: Strange FIBS ratings

From:	Christopher D. Yep
Subject:	Re: [Bug-gnubg] Re: Strange FIBS ratings
Date:	Tue, 09 Sep 2003 06:58:20 -0400

At 07:04 PM 9/8/2003 +0200, Jim Segrave wrote:

On Mon 08 Sep 2003 (14:52 +0000), Joern Thyssen wrote:
> On Mon, Sep 08, 2003 at 03:03:55PM +0200, Jim Segrave wrote
> > On Mon 08 Sep 2003 (11:04 +0000), Joern Thyssen wrote:
> > > On Mon, Sep 08, 2003 at 11:36:33AM +0200, Jim Segrave wrote
> > >

> > > Kees' experiments show that cube decisions errors don't weigh asmuch as

> > > chequer play errors. I can't offer any explanation for this, other than
> > > gnubg's chequerplay is much better than the cube play???
> >
> >

I think this phenomenon has been known for many years now. Kees'experiments and Douglas Zare's research on Gammonvillage are just thelatest examples supporting this conclusion.

While the largest errors in a money game or match are likely to be cubeerrors (which causes most players to mistakenly believe that the totalequity given up by cube errors dominates the total equity given up bychecker errors), the total equity given up by checker errors is larger (onaverage) since there are more checker errors per game. For example it'spossible to make a checker error on every turn of the game, while much ofthe game one doesn't even have cube access (i.e. opponent owns thecube). Even if we only consider non-trivial decisions, there are a lotmore non-trivial checker decisions than non-trivial cube decisions pergame. Note also that it's only possible to make 1 wrong pass per game, afew wrong doubles per game (it would be rare for one player to make morethan 1 wrong double in a given game), a few wrong takes per game (againit's rare to make more than 1 in a given game), and several missed doublesper game.

David Montgomery's excellent article, archived athttp://www.bkgm.com/rgb/rgb.cgi?view+455, gives some general arguments asto why checker play is difficult.

> > >From a comment in the thread on GammOnLine:
> >
> > ================
> >
> > seems to me that checker play errors in real matches represent are
> > always an irretrievable loss of equity, while cube errors may or may
> > not matter, depending on the flow of the game (5 missed marginal
> > doubles with an eventual correct double/take), and opponent's error
> > (too good to double, but he took.) Objective cube errors may not even
> > be errors (there's little play-the-opponent in checker play, but a lot
> > in cube action). Further, it seems that cube errors against weaker
> > opponents are relatively less costly than cube errors against stronger
> > opponents (against a weakie I can recover from a bad take and gammon
> > loss in the first game of a 5-point match, or choose to play the whole
> > match semi-cubeless, or take "passes" that opponent's checker errors
> > make takes -- in gnu's eyes I'll be a "casual cubist" in all cases).
> >
> > ================
>

The above is a common (but incorrect argument). The same argument could bemade about checker errors in repeating positions, but it's wrong.


Consider an example:

1. If a player makes a 0.1 error (checker or cube), both players then playperfectly, the original position later repeats (with the same dice roll ifthe original error was a checker error), and the player then makes thecorrect play (checker or cube), this doesn't mean that his original errorshouldn't count as an error. One way to think of it is that maybe therewas a 50% chance that his original error would "cost" 0.2 and 50% that itwould "cost" 0 (i.e. the position would essentially repeat). Rather thanbase the error on the actual future dice rolls, it's better and proper tojust give it a 0.1 error. This is what gnubg, Snowie, etc. do. Thisapplies for both checker and cube errors.


Some specific examples:

2. Bearoff, Player 0 and Player 1 both have a 2-roll position (i.e. 3 or 4each on the acepoint). Player 0 owns the cube (normalized to 1).

Suppose that player 0 fails to double. How much of an error is this(answer: .278)? The gammonline argument says that it's an error of -2.0(the difference between winning 1 and losing 1) 13.9% (5/6 * 1/6) of thetime and an error of 0.0 86.1% of the time (player 0 can cash on the nextturn 86.1% of the time if he forgets to cash on the first roll). Whilethis also averages to .278, most players don't think of equity this way(nor should they).


3. Bearoff, both players have 5-roll position, cube centered, player 0 on roll.

Correct cube action is marginal double, easy take. Not doubling is a .023error. The gammonline argument says that if both players roll non-doubles(so that player 0 can double-in next turn), then the original error wasn'treally an error and should be counted as 0, but if other sequences happenthen count the error as something else (including counting the error as a"negative error" if player 0 rolls non-doubles followed by player 1 rollingdoubles!). While this again averages to .023, it's the wrong way toconsider equity.

4. Bearoff, player 0 has checkers on 1,2; player 1 has a 2-rollposition. Player 1 owns the cube (normalized to 1). Player 0 onroll. Player 0 rolls 6-1 and plays 2/1, 1/off instead of 2/off1/off. This is an error of .333 (equity of 5/6 - 1/6 = 2/3, instead of1). It's right to consider this a .333 error instead of a 0.0 error 5/6 ofthe time and a 2.0 error 1/6 of the time.


A more generalized example:

5. Player 0 has a medium to large race lead and is playing against a20-point holding game. Suppose proper cube action is double/take (and islikely to remain so for many rolls). Suppose player 0 (a beginner) doesn'tknow to double (and makes the same mistake on all future rolls). Let'salso assume that he does know to double when he clears his midpoint orleaves a single checker on his midpoint which is missed. For simplicityassume that in all games player 0 will roll doubles to clear the midpointor will leave a shot (and get hit or missed). Suppose that overall, player0's late-double strategy costs .30 in equity. The gammonline argument saysto: (a) penalize him zero when he doesn't double then rolls a non-double;(b) penalize him a large amount when he doesn't double then rolls doubles(which is a market loser); (c) penalize him a negative amount when hedoesn't double, leaves a number which forces him to leave a shot, and ishit; and (d) penalize him a large amount when he doesn't double, leaves anumber which forces him to leave shot, and is missed (this is a marketloser). The (correct) approach that gnubg (and Snowie, etc.) use is topenalize each missed double a certain amount (e.g. .05). In some games thebeginner will clear his midpoint after his first missed double (for thatgame gnubg will assign him a total error of only .05), while in other gameshe may roll a large number (e.g. 12) of non-doubles before clearing hismidpoint or leaving a shot (for that game gnubg will assign him a totalerror of .60). However, on average the beginner will be credited witherrors totalling .30 for this late-double strategy.

Also, the side comments about cube decisions which are errors in theory,but not in practice (i.e. if the opponent is a stronger/weaker player) arerelevant, but let's ignore them for the point of this discussion. gnubgwith (and without) noise doesn't try to play based on the strength of theopponent, yet we observed that it still gives up much more in checkererrors than cube errors. Two equally strong humans (who know they areequally strong) also give up much more in checker errors than in cubeerrors. The gammonline side comments support the idea that if a human (whois consciously adjusting for skill) plays a match, then analyzes it withgnubg, his reported cube error rate will be higher than it "should" be inpractice. (BTW, I agree with this statement.)

In the case of missed doubles, you may not lose your market and you
will get a chance to recover your mistake on the next move?

See example 3 (5-roll position). Rather than assigning different errorsbased on future dice rolls, it's proper to just assign an error of .023 inthe original position.


Chris

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug-gnubg] RE: Strange FIBS ratings, (continued)

Prev by Date: RE: [Bug-gnubg] Re: The importance of METs
Next by Date: [Bug-gnubg] Win32 030908 crashes
Previous by thread: Re: [Bug-gnubg] Re: Strange FIBS ratings
Next by thread: Re: [Bug-gnubg] Re: Strange FIBS ratings
Index(es):
- Date
- Thread