logoalt Hacker News

JPLeRouzictoday at 1:51 PM1 replyview on HN

Has anyone started to implement this technique in Llama.cpp or similar inference tool?


Replies

dnhkngtoday at 1:59 PM

There was some work done on this a while back, during the FrankenMerge craze of 23'

I am working with TurboDerp to integrate this into the Exllama v3 format.

show 1 reply