Our model is bigger and more capable than gpt OSS and can run at full context at 40 tokens / s....

adam_patarino • today at 5:39 PM • 0 replies • view on HN

Our model is bigger and more capable than gpt OSS and can run at full context at 40 tokens / s.

We are rolling out to Mac to start with plans to release windows and Linux within 3 months.

alt Hacker News