logoalt Hacker News

dryarzegtoday at 9:19 AM1 replyview on HN

As far as I'm aware, it's not true for models like DeepSeek or other Chinese open-weight models (at least those that I have seen); their reasoning traces are fully composed from some human language, be it English, Chinese or another one; by the way, most of them can adapt their reasoning based on user language, for example, if user speaks English the reasoning more likely will be in English.

I think that for DeepSeek problem (thinking and replying in Chinese) everything is kinda simpler: in their official chat, they're probably using some kind of system prompt which is (probably) written in Chinese, so that's why model may prefer Chinese in it's output.


Replies

calgootoday at 11:57 AM

I have seen mixed language thinking from claude when i speak to it in english but we are discussing a product thats in spanish or searching amazon spain.