logoalt Hacker News

Bringing Up DeepSeek-V4-Flash on AMD MI300X

72 pointsby kkmyesterday at 5:52 PM6 commentsview on HN

Comments

maCDzPyesterday at 8:49 PM

I train on AMD MI250X and managed to get Gemma 4 31B to work - but it took a lot of work on the software side.

show 1 reply
mezarkyesterday at 7:31 PM

We at doubleword are bullish for AMD for low-interactivity inference - it does just take a bigger lift on the software side...

show 1 reply
kkmyesterday at 7:30 PM

Also the vllm patch accompanying the blogpost: https://github.com/doublewordai/vllm-amd-blog-doubleword

benlmyesterday at 7:23 PM

Nice work! Would DeepSeek V4 Pro on 8xMI300X work with these patches?