logoalt Hacker News

mlmonkeyyesterday at 11:32 PM1 replyview on HN

> We were cautious to only run after each model’s training cutoff dates for the LLM models

Grok is constantly training and/or it has access to websearch internally.

You cannot backtest LLMs. You can only "live" test them going forward.


Replies

cheeseblubberyesterday at 11:45 PM

Via api you can turn off websearch internally. We provided all the models with their own custom tools that only provided data up to the date of the backtest.

show 1 reply