Sure. Claude does that. "Cogitated for 1m 50s" doesn't work for real-time application...

ceejayoz • today at 1:05 PM • 1 reply • view on HN

Sure. Claude does that. "Cogitated for 1m 50s" doesn't work for real-time applications.

You can submit many queries in parallel to increase throughout. Smaller models and faster hardware can reduce the time per query too.

➕ show 2 replies

alt Hacker News