I tried it on a enwik9 100 mb slice and was able to compress it to 20 mb + 900kb transformer so 21mb...

spidy__ • last Tuesday at 3:35 PM • 1 reply • view on HN

I tried it on a enwik9 100 mb slice and was able to compress it to 20 mb + 900kb transformer so 21mb.

I know the top submission was able to get it to 13 mb.

Still trying some ideas to get better compression.

Replies

Since you know the size of the file beforehand you may be able to overfit some kind of text diffusion model instead of a transformer? May allow you to partially correct the model output using some other method and then fill in the blanks that were wrong from previous generations.

➕ show 1 reply

alt Hacker News

Replies