logoalt Hacker News

ummonktoday at 6:51 PM1 replyview on HN

I don't see why the transformer architecture can't be designed and trained with separate inputs for control data and content data.


Replies

amw-zerotoday at 6:53 PM

Give it a shot