logoalt Hacker News

billyloyesterday at 12:43 PM0 repliesview on HN

If you are curious about doing something similar with TPU, Google has an article. https://developers.googleblog.com/train-gpt2-model-with-jax-...