logoalt Hacker News

gchamonlivetoday at 4:07 PM0 repliesview on HN

Models are lossy, so fine-tune can only take you so far with small models. What we need is reasonably capable local models with a huge context window and a method to make efficient use of token and cram as much info as possible in the context before degrading the output quality.