logoalt Hacker News

Efficient Code Search with Nvidia DGX

19 pointsby simplesort04/24/20251 commentview on HN

Comments

macleginn04/24/2025

I wonder where the label ‘mini/micro’ batch came from (‘Training at bfloat16 numeric precision enabled them to use large micro-batch sizes of 256...’), given that batches were never that big to begin with.