logoalt Hacker News

naaskingtoday at 2:13 AM1 replyview on HN

The linked paper tested nanoGPT with this new transformer:

https://www.techrxiv.org/users/685780/articles/1375955-topol...


Replies

tunedtoday at 6:14 AM

thanks for linking.

Yes the paper compares the new architecture (that is also a fork of my implementation of nanoGPT) with Karpathy's nanoGPT. There are also links to the code and bench used.