How come they didn’t ablate encoder? OpenAI GOT models are decoder only.
If you read the Wired article linked elsewhere on this thread, then it explains that. The work was being done by people from the Google Translate team.
If you read the Wired article linked elsewhere on this thread, then it explains that. The work was being done by people from the Google Translate team.