logoalt Hacker News

fragmedetoday at 12:11 AM0 repliesview on HN

What does it do when the model wants to return something else, and what's better/worse about doing it in llamafile vs whatever wrapper that's calling it? How do I set retries? What if I want JSON and a range instead?