logoalt Hacker News

toomuchtodoyesterday at 10:10 PM1 replyview on HN

You decouple the workloads from human interaction (ie when you submit the job to the queue vs when it is scheduled to execute) so when they run is not a consideration, if possible. The economic incentives encourage solving this, and if it can’t be solved, it buckets customer cohort by willingness (or unwillingness) to pay for access during peak times.


Replies

stavrosyesterday at 10:13 PM

Sure, but if I ask the LLM a question, I'd like it to respond now, instead of tonight.

show 1 reply