The author claimed that the models he modified with this layer repetition method topped the huggingf...

v9v • yesterday at 4:57 PM • 1 reply • view on HN

The author claimed that the models he modified with this layer repetition method topped the huggingface open llm leaderboard in his first post: https://dnhkng.github.io/posts/rys/

Do you remember the names of the previous experiments done on this? Would love to take a look.

Replies

vibe42 • yesterday at 5:26 PM

Just learned about it the other day from this thread from Feb, 2024: https://old.reddit.com/r/LocalLLaMA/comments/1aqrd7t/i_made_...

Has some interesting github links.

alt Hacker News

Replies