logoalt Hacker News

ilakshtoday at 2:21 PM0 repliesview on HN

How long would it actually take to train a 120B model on an H200? What if you have 8?