alt
Hacker News
ilaksh
•
today at 2:21 PM
•
0 replies
•
view on HN
How long would it actually take to train a 120B model on an H200? What if you have 8?