logoalt Hacker News

thanhhaimaiyesterday at 5:48 PM3 repliesview on HN

This is the PR with the changes in case people missed it:

https://github.com/mieciu/tau2-bench/pull/1/files


Replies

nitwit005yesterday at 11:31 PM

That seems so strongly directed, that it feels like an attempt to reproduce a classic chat bot.

catlifeonmarstoday at 6:47 AM

Can one customer get the model to return the bill details for another customer?

blndrtyesterday at 7:37 PM

Thanks! I also updated the post with the link on the website.