XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10
X:Player 1 O:Player 2
Score is X:0 O:0. Unlimited Game
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X O | | O O O O X | +---+
| | | O O O O | | 2 |
| | | O | +---+
| | | |
| | | |
| |BAR| |
| | | O |
| | | X O |
| | | X X O |
| X | | X X O X X |
| X | | X X O X X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 106 O: 148 X-O: 0-0
Cube: 2, O own cube
X to play 54
XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10
X to play 54
On December 13, 2021 at 8:27:59 PM UTC-7, Tim Chow wrote:Quoting for archival purposes.
XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10
X to play 54
Will this position be in your next bg book?
You never written a previous bg book? Uh, sorry.
It's hard to believe you never written a bg book.
I would've guessed that you wrote many.
At least some of your opinions here should be
worth 99 cents if not 99 dollar or euros...??
Make it a hard cover. I don't mean cardboard.
I mean plywood. You will enjoy it more... ;)
MK
XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10
X:Player 1 O:Player 2
Score is X:0 O:0. Unlimited Game
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X O | | O O O O X | +---+
| | | O O O O | | 2 |
| | | O | +---+
| | | |
| | | |
| |BAR| |
| | | O |
| | | X O |
| | | X X O |
| X | | X X O X X |
| X | | X X O X X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 106 O: 148 X-O: 0-0
Cube: 2, O own cube
X to play 54
The rollout below agrees with Paul's play. However, XG 3-ply plays
24/20 6/1. For Stick's benefit, I should explain that when I say
that XG 3-ply plays 24/20 6/1, what I actually mean is that XG 3-ply
plays 24/20 6/1. But it's also true that the play with the highest
XG 3-ply evaluation---by a narrow margin---is 24/20 6/1.
XGID=--BBeCD---B--Aa----b-bbcA-:1:-1:1:54:0:0:0:0:10
X:Player 1 O:Player 2
Score is X:0 O:0. Unlimited Game
+13-14-15-16-17-18------19-20-21-22-23-24-+
| X O | | O O O O X | +---+
| | | O O O O | | 2 |
| | | O | +---+
| | | |
| | | |
| |BAR| |
| | | O |
| | | X O |
| | | X X O |
| X | | X X O X X |
| X | | X X O X X |
+12-11-10--9--8--7-------6--5--4--3--2--1-+
Pip count X: 106 O: 148 X-O: 0-0
Cube: 2, O own cube
X to play 54
The rollout below agrees with Paul's play. However, XG 3-ply plays
24/20 6/1. For Stick's benefit, I should explain that when I say
that XG 3-ply plays 24/20 6/1, what I actually mean is that XG 3-ply
plays 24/20 6/1. But it's also true that the play with the highest
XG 3-ply evaluation---by a narrow margin---is 24/20 6/1.
1. Rollout¹ 6/1 5/1 eq:+0.615
Player: 72.32% (G:32.32% B:0.79%)
Opponent: 27.68% (G:4.64% B:0.12%)
Confidence: ±0.007 (+0.608..+0.623) - [100.0%]
2. Rollout¹ 24/20 6/1 eq:+0.555 (-0.061)
Player: 71.46% (G:30.52% B:0.63%)
Opponent: 28.54% (G:6.90% B:0.22%)
Confidence: ±0.009 (+0.546..+0.563) - [0.0%]
3. Rollout¹ 24/15 eq:+0.517 (-0.098)
Player: 72.35% (G:23.38% B:0.51%)
Opponent: 27.65% (G:5.46% B:0.14%)
Confidence: ±0.007 (+0.511..+0.524) - [0.0%]
¹ 1296 Games rolled with Variance Reduction.
Dice Seed: 271828
Moves: 3-ply, cube decisions: XG Roller
I missed this episode of the Stick-and-Tim show. (Maybe I'll treat myself to the Netflix box set for Christmas.) Is it really the case that XG n-ply doesn't always
make the play with the highest XG n-ply evaluation. Could you possibly explain this
XGPly Paradox please?
On 12/16/2021 7:44 AM, peps...@gmail.com wrote:
I missed this episode of the Stick-and-Tim show. (Maybe I'll treat myself toFirst, as a bit of background, there are two main ways you'll
the Netflix box set for Christmas.) Is it really the case that XG n-ply doesn't always
make the play with the highest XG n-ply evaluation. Could you possibly explain this
XGPly Paradox please?
encounter XG 3-ply playing. First, if you want to play a match
against the bot and you select "eXtremeGammon" as your opponent,
then the bot will play at the XG 3-ply level. Second, if you
perform a rollout, the default setting is 3-ply/XGR, which means
that during a rollout trial, the bot will play the checkers at
the 3-ply level and the cube at the XGR level.
But what exactly does it mean to play the checkers at the 3-ply
level? What the bot first does is to evaluate every legal play
at the 1-ply level, by directly consulting the neural net. Then
it will select some subset of these to evaluate at the 2-ply
level, and finally it will select some subset of *those* to
evaluate at the 3-ply level. In particular, for most moves, it
will *not* actually evaluate every legal play at the 3-ply level.
It first filters out a bunch of moves whose 1-ply evaluation is
too low, and then it filters out more moves whose 2-ply evaluation
is too low. Finally, it evaluates the surviving candidates at the
3-ply level, picks the one with the highest equity, and plays it.
This procedure is usually quite sensible. Most of the time, if
a play's 1-ply evaluation is extremely far below that of the play
with the highest 1-ply evaluation, a 3-ply evaluation will not
overturn that verdict (it won't come up with the same equity
estimate as 1-ply does, but it will usually agree on which play
is better). So you save a lot of time by quickly filtering out
most of the moves immediately, and only investing further compute
time on the "promising" moves. (If you're worried that the bot
is filtering out moves too aggressively, then one thing you can
do, at least for rollouts, is to increase the size of the move
filter from "Normal" to "Large" or "Huge" or "Gigantic.")
You can see this visually if you set up a position with a lot of
legal plays, and then click on the 3-ply button near the lower
left corner of the screen. You'll see that next to each move is
an icon with 5 bars, and usually you'll see some plays at the
top with 3 bars filled in, followed by some plays with 2 bars
filled in, followed by yet more plays with only 1 bar filled in.
However, once in a while, a move that is filtered out at the 1-ply
or 2-ply level will, if forcibly evaluated at 3-ply, come out ahead
of the move that XG 3-ply plays.
Clear?
I have always said "XG 3-ply plays..." when I mean that that's
what XG 3-ply plays, either when it's playing at the eXtremeGammon
level against a human opponent, or when it's playing out a rollout
trial under the standard rollout settings. It's hard for me to
imagine a more natural meaning for "XG 3-ply plays." However, Stick
insists that the natural interpretation of "XG 3-ply plays..." is
*not* what XG 3-ply plays, but rather what move has the highest
XG 3-ply evaluation. To me, the natural way to refer to the move
with the highest XG 3-ply evaluation is "the move with the highest
XG 3-ply evaluation." But Stick continues to insist that I'm using
language in a very confusing manner.
Sysop: | Keyop |
---|---|
Location: | Huddersfield, West Yorkshire, UK |
Users: | 296 |
Nodes: | 16 (2 / 14) |
Uptime: | 37:38:12 |
Calls: | 6,648 |
Files: | 12,193 |
Messages: | 5,329,135 |