logoalt Hacker News

embedding-shapetoday at 9:02 AM1 replyview on HN

Almost slipping into conspiracy territory, but without insights into what the labs actually do internally, hard not to:

Anyways, heard about A/B testing before? ML people tend to like it a lot, hard to imagine neither OpenAI or Anthropic are already deep into categorizing people into buckets and running an wild amount of A/B testing all over the place, especially in the weeks leading up to new model releases, in various ways.


Replies

user43928today at 9:23 AM

Yes, and we can see A/B testing on the ChatGPT website all the time.

They are also testing the new models in their coding tools with select customers first.

People working at OpenAI have publicly denied that they are performing any kind of hidden routing or quantization of models after release for Codex. I tend to believe them.