logoalt Hacker News

orphyesterday at 9:10 PM1 replyview on HN

Why not apply changes to the underlying model so that you crush every available eval?


Replies

cgorllayesterday at 10:58 PM

SOTA results are a happy byproduct of the core mission of our approach, which is to enable the effective and simple translation of policy documents into a model without having to fine-tune and prompt engineer. This performance is somewhat unexpected but also sensical, so we're still trying to figure out the best way to harness it. That may include releasing model artifacts in the future.