Apparently it is the same as the DeepseekV3 architecture and already supported by llama.cpp once the... | alt Hacker News

alt Hacker News

cristoperb • yesterday at 7:55 PM • 1 reply • view on HN

Apparently it is the same as the DeepseekV3 architecture and already supported by llama.cpp once the new name is added. Here's the PR: https://github.com/ggml-org/llama.cpp/pull/18936

Replies

khimaros • today at 3:17 AM

has been merged