logoalt Hacker News

hedgehogyesterday at 10:24 PM0 repliesview on HN

Not strange, for the kind of applications models at that size are often used for the prefill is the main factor in responsiveness. Large prompt, small completion.