Per-iteration adversarial training progress
Here we show games from each iteration of our iterated adversarial training procedure. At each iteration, the victim v
n
is generated by fine-tuning the previous victim v
n-1
against the previous adversary a
n-1
, and the adversary a
n
is generated by fine-tuning the previous adversary a
n-1
against the victim v
n
. The initial victim v
0
is KataGo network checkpoint b40c256-s11840935168-d2898845681
, the network we attacked with our original cyclic adversary (base-adversary
). The initial adversary a
0
is the original cyclic adversary.
a
0
vs. v
0
Against victim v
0
, a
0
achieves win rates of 100% at 16 victim visits, 99% at 256 visits, and 99% at 4096 visits.
Victim: v
0
, 16 visits
Adversary: a
0
a
0
vs. v
1
Against victim v
1
, a
0
achieves win rates of 13% at 16 victim visits, 1% at 256 visits, and 1% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -53.0
Victim: v
1
, 16 visits
Adversary: a
0
a
1
vs. v
1
Against victim v
1
, a
1
achieves win rates of 99% at 16 victim visits, 86% at 256 visits, and 69% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -22.7
Victim: v
1
, 16 visits
Adversary: a
1
a
1
vs. v
2
Against victim v
2
, a
1
achieves win rates of 34% at 16 victim visits, 4% at 256 visits, and 0% at 4096 visits.
adversary predicted win prob: 1.00 loss: 0.00, predicted score: 68.5
Victim: v
2
, 16 visits
Adversary: a
1
a
2
vs. v
2
Against victim v
2
, a
2
achieves win rates of 99% at 16 victim visits, 95% at 256 visits, and 56% at 4096 visits.
adversary predicted win prob: 1.00 loss: 0.00, predicted score: 210.6
Victim: v
2
, 16 visits
Adversary: a
2
a
2
vs. v
3
Against victim v
3
, a
2
achieves win rates of 13% at 16 victim visits, 4% at 256 visits, and 0% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -176.5
Victim: v
3
, 16 visits
Adversary: a
2
a
3
vs. v
3
Against victim v
3
, a
3
achieves win rates of 88% at 16 victim visits, 67% at 256 visits, and 33% at 4096 visits.
adversary predicted win prob: 1.00 loss: 0.00, predicted score: 52.3
Victim: v
3
, 16 visits
Adversary: a
3
a
3
vs. v
4
Against victim v
4
, a
3
achieves win rates of 18% at 16 victim visits, 0% at 256 visits, and 0% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -10.0
Victim: v
4
, 16 visits
Adversary: a
3
a
4
vs. v
4
Against victim v
4
, a
4
achieves win rates of 94% at 16 victim visits, 54% at 256 visits, and 25% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -14.3
Victim: v
4
, 16 visits
Adversary: a
4
a
4
vs. v
5
Against victim v
5
, a
4
achieves win rates of 9% at 16 victim visits, 0% at 256 visits, and 0% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -23.0
Victim: v
5
, 16 visits
Adversary: a
4
a
5
vs. v
5
Against victim v
5
, a
5
achieves win rates of 51% at 16 victim visits, 3% at 256 visits, and 0% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -12.1
Victim: v
5
, 16 visits
Adversary: a
5
a
5
vs. v
6
Against victim v
6
, a
5
achieves win rates of 10% at 16 victim visits, 1% at 256 visits, and 0% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -6.6
Victim: v
6
, 16 visits
Adversary: a
5
a
6
vs. v
6
Against victim v
6
, a
6
achieves win rates of 38% at 16 victim visits, 9% at 256 visits, and 2% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -17.4
Victim: v
6
, 16 visits
Adversary: a
6
a
6
vs. v
7
Against victim v
7
, a
6
achieves win rates of 26% at 16 victim visits, 1% at 256 visits, and 0% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -10.6
Victim: v
7
, 16 visits
Adversary: a
6
a
7
vs. v
7
Against victim v
7
, a
7
achieves win rates of 94% at 16 victim visits, 47% at 256 visits, and 32% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -38.6
Victim: v
7
, 16 visits
Adversary: a
7
a
7
vs. v
8
Against victim v
8
, a
7
achieves win rates of 16% at 16 victim visits, 0% at 256 visits, and 0% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -34.9
Victim: v
8
, 16 visits
Adversary: a
7
a
8
vs. v
8
Against victim v
8
, a
8
achieves win rates of 24% at 16 victim visits, 0% at 256 visits, and 0% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -21.0
Victim: v
8
, 16 visits
Adversary: a
8
a
8
vs. v
9
Against victim v
9
, a
8
achieves win rates of 6% at 16 victim visits, 2% at 256 visits, and 0% at 4096 visits.
victim predicted win prob: 1.00 loss: 0.00, predicted score: 366.5
Victim: v
9
, 16 visits
Adversary: a
8
a
9
vs. v
9
Against victim v
9
, a
9
achieves win rates of 99% at 16 victim visits, 94% at 256 visits, and 59% at 4096 visits.
victim predicted win prob: 0.00 loss: 1.00, predicted score: -60.1
Victim: v
9
, 16 visits
Adversary: a
9
victim predicted win prob: 0.00 loss: 1.00, predicted score: -108.2