LLMs should be trained on and directly output binary.

dyauspitr • today at 5:42 AM • 7 replies • view on HN

Replies

On the off chance that you’re serious, that would result in disastrously bad output. The difference between “jmp $+15” and “jmp $+16” is inscrutable and the LLM would not be able to pick the right one without tooling.

That tooling is a compiler. The higher level, the better chance the LLM can be steered to good output. Machine code is hopeless, don’t bother.

➕ show 3 replies

xiaoyu2006 • today at 5:52 AM

It should not. Abstraction in software engineering brings intelligence. (compression correlates to intelligence)

➕ show 3 replies

bandrami • today at 6:11 AM

Generative algorithms have been studied for decades now and while they have led to some interesting results they're a bad fit for LLMs because there's no such thing as a "plausible" binary: a small perturbation yields an unusable result.

fulafel • today at 6:23 AM

Technically they are, just a subset. But still a practical one, they're frequently used to produce executable files.

junior44660 • today at 6:57 AM

[flagged]

wahnfrieden • today at 6:01 AM

[flagged]

rvz • today at 5:53 AM

I think you forgot the "/s"

alt Hacker News

Replies