Is there a paper on this? I'm curious how they pre-trained it... I feel like it must have had...

spott • today at 5:35 PM • 1 reply • view on HN

Is there a paper on this?

I'm curious how they pre-trained it... I feel like it must have had audio/image output that they chopped off.

I wonder how hard it would be to add it back on.

joaogui1 • today at 5:40 PM

I mean Claude is multimodal on input but not output, why couldn't this also be?

alt Hacker News