Seeing half of an AR LLM's output tokens go to generating a predefined json schema bothers me s...

fumeux_fume • today at 3:22 PM • 1 reply • view on HN

Seeing half of an AR LLM's output tokens go to generating a predefined json schema bothers me so much. I would love to have an option to use diffusion for infilling.

Replies

jmalicki • today at 3:48 PM

One trick I learned for this was to use csv for LLM I/I and translate json <-> csv at the boundary layer

➕ show 1 reply

alt Hacker News

Replies