> We were cautious to only run after each model’s training cutoff dates for the LLM models
Grok is constantly training and/or it has access to websearch internally.
You cannot backtest LLMs. You can only "live" test them going forward.
Via api you can turn off websearch internally. We provided all the models with their own custom tools that only provided data up to the date of the backtest.
Via api you can turn off websearch internally. We provided all the models with their own custom tools that only provided data up to the date of the backtest.