logoalt Hacker News

hammeiamyesterday at 11:47 PM2 repliesview on HN

Sparse Attention, it's the highlight of this model as per the paper


Replies

culitoday at 1:29 AM

How did we come to the place that the most transparent and open models are now coming out of China—freely sharing their research and source code—while all the American ones are fully locked down

show 4 replies
pylotlighttoday at 12:38 AM

I'll have to wait for the bycloud video on this one :P