Sparse Attention, it's the highlight of this model as per the paper

hammeiam • yesterday at 11:47 PM • 2 replies • view on HN

Replies

How did we come to the place that the most transparent and open models are now coming out of China—freely sharing their research and source code—while all the American ones are fully locked down

➕ show 4 replies

pylotlight • today at 12:38 AM

I'll have to wait for the bycloud video on this one :P

alt Hacker News

Replies