AlphaZero trained to play Othello using Jax and PGX. I used a TPU v4-8 provided by the TensorFlow Research Cloud to build this. Currently, we only have a checkpoint for steps 13270 and 15154, but we will have better models soon. Model evaluations:

Step Win % vs PGX baseline Draw % vs baseline Lose % vs baseline
13270 ~46.8% 6.25% ~46.8%
15154 62.5% 0% 37.5%
17039 81.25% 3.125% 15.625%
22190 87.5% 0% 12.5%
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.